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Award for Distinguished Service to Dr. Shirley A. Hill 


JAMES T. FEY AND KENNETH M. HOFFMAN 


During the past 30 years, no subject in the curriculum of elementary and 
secondary schools has received more critical attention or proposals for reform than 
mathematics. No subject in the college curriculum has received more focused 
attention. Throughout that crucial period of challenge and change, no individual 
has provided more distinguished leadership for the mathematical community than 
Shirley A. Hill. 

Shirley Hill has played key roles in dozens of curriculum development projects 
and policy boards that have shaped the character of teaching and learning in 
mathematics at all levels, across the United States and in many foreign countries. 
It is of more than symbolic significance that the decade of educational ferment 
through which our country has just passed began with An Agenda for Action, 
developed by the National Council of Teachers of Mathematics (NCTM) under 
Shirley’s leadership as president, and ended with the even more ambitious agenda 
of Everybody Counts, developed during her tenure as the first chairman of the 
Mathematical Sciences Education Board (MSEB). 

Events have given our discipline the leading role in implementing new national 
education goals during the 1990s. Surely no single individual can be credited with 
having placed us in this challenging position. Yet, we can say with equal certainty 
that as a discipline we could not have come this far without the remarkable 
leadership of Shirley A. Hill. It is an especially appropriate time to recognize her 
contributions through the Yueh-Gin Gung and Dr. Charles Y. Hu Award for 
Distinguished Service. 


Shirley A. Hill 
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A Phi Beta Kappa graduate of the University of Missouri, Shirley Hill taught in 
the schools of Kansas City, Missouri, before heading west to earn a Ph.D. from 
Stanford University in 1961. While at Stanford she became the assistant director of 
the Stanford University Arithmetic Project. This was the first in a series of 
contributions to prominent curriculum development projects that included the 
School of Mathematics Study Group, the African Mathematics Project, a project 
for teaching mathematics by radio in Nicaragua, and the Comprehensive School 
Mathematics Project. 

In 1967, Shirley joined the Teacher Training Panel of the Committee on the 
Undergraduate Program of the Mathematical Association of America (MAA). She 
helped draft and promote the adoption of CUPM guidelines for the education of 
mathematics teachers—a problem area in which she has maintained a strong 
interest throughout her career. 

Teacher education has been the focal point of her work as a professor, since 
1963, at the University of Missouri-Kansas City. Her skill and dedication to this 
work have been widely recognized, including awards as Missouri Mathematics 
Teacher of the Year in 1978 and Amoco Award for Excellence in Undergraduate 
Teaching in 1981. 

The recognition of Shirley’s talents grew quickly, and in 1972 she was elected to 
the Board of Directors of the National Council of Teachers of Mathematics and 
the Board of Governors of the MAA. She chaired the National Advisory Commit- 
tee on Mathematical Education and was a principal author of its influential report 
Overview and Analysis of School Mathematics K-12. In 1976 she was appointed 
chair of the United States Commission on Mathematical Instruction and the U.S. 
representative to the International Commission on Mathematical Instruction. In 
1978 she was elected to a two-year term as president of the National Council of 


Teachers of Mathematics. 

During the 1980s, Shirley served as a consultant to more than 15 states and to a 
broad array of curriculum development, assessment, and teacher education pro- 
jects, including the University of Chicago School Mathematics Project, the 
National Assessment and Second International Study of Mathematics Achieve- 
ment, and the National Board for Professional Teaching Standards. She was 
instrumental in the creation of the Mathematical Sciences Education Board and as 
chairman led its development from 1985 through 1989. 

While serving in so many active and advisory roles on diverse projects and 
boards, Shirley has also managed to publish widely. Her contributions have ranged 
from First Course in Mathematical Logic (with Patrick Suppes) to the text, 
Elementary Geometry, for teachers (with Vincent Haag and Clarence Hardgrove) 
and chapters in the yearbooks of the National Council of Teachers of Mathemat- 
ics, the NSSE, and the National Education Association. She was an associate 
editor of The American Mathematical Monthly from 1973-1977, and a member 
of the editorial panel for the 1973 NCTM Yearbook, The Slow Learner in Mathe- 
matics. 
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The numerous, elected, and appointed positions on national and international 
professional policy and advisory boards, and her frequency selection as leader of 
those groups, are unmistakable evidence of the respect that Shirley has earned 
throughout the mathematical community. She has, for many years, been a highly 
sought speaker at local, regional, national, and international professional meetings, 
where she conveys an inspirational sense of dedication to mathematics teaching. 

In 1987, her institution, the University of Missouri at Kansas City, recognized 
her accomplishments by appointing her to the distinguished Curator’s Professor- 
ship, through which she carries on both the dedicated teaching and the outstand- 
ing leadership for which we honor her. 


Conway’s Challenge Sequence 


CoLin L. MaLLows, AT&T Bell Laboratories, Murray Hill, New Jersey 07974 


CoLin MAL.Lows has been doing research in statistics at AT & T Bell Labs 
since 1960. He sees his main challenge to be that of making his subject seem 
less like sorcery. 


1. Introduction. In a lecture at AT&T Bell Labs [1], J. H. Conway introduced 
the sequence 


1,1,2,2,3,4,4,4,5, 6,7, 7,8, 8, 8,8,9,... 
defined by the recursion (for n > 3) 
a(n) = a(a(n — 1)) + a(n — a(n — 1)). 


He had proved that a(n)/n — 1/2, but admitted to being unable to establish the 
rate of convergence. He offered a modest prize for determining the rate, and a 
most immodest one for the exact n at which |a(n)/n — 1/2| last exceeds 1/20. 
Notice that this is not a “find the next term” problem. Here we already know the 
rule for generating the sequence; the challenge is to develop enough understand- 
ing of its structure so that rigorous proofs can be given. To claim the larger prize it 
would not be enough merely to find a number that “looks like” the right answer. 
No (finite) amount of computation will suffice to prove that apparent regularities 
persist indefinitely. 

It turns out, as we shall see, that this sequence does have much hidden 
structure. In fact, there are simple rules that work much faster than one-step- 
at-a-time; there is even a formula that enables us to compute a(m) for any n. 
However it is not immediately clear how to prove these results. 

What follows is an account of my search for structure in Conway’s sequence, 
culminating in a proof that the structure that is found does in fact persist. Then 
there is an explanation of how Conway’s number was calculated. Finally, there are 
comments on some related problems. 

My own interest in this problem is two-fold. First, there is the simple pleasure 
to be found in taking up a challenge. But also, since I am by profession a 
statistician, I am interested in the process by which hidden structure (usually in 
statistical data, rather than a mathematical puzzle, as in this case) can be found. 
John Tukey and I have remarked [2] that success in an exploratory investigation is 
crucially dependent on 


(i) a willingness to collect and study the data, 
(ii) use of diagnostic techniques to show the unexpected, 
(iii) an ability to recognize striking patterns, 
(iv) enough understanding of the context of the problem to enable these 
patterns to be recognized as potentially meaningful, 
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(v) avoidance of precipitate commitment to models of clearly inadequate 
complexity, and 
(vi) energetic following-up of the clues obtained. 


All these components, except possibly (v), operate in the present case. 


2. First steps. Let us look at some values of a(n). 


n= 1-20 1 12 2 3 4 4 4 5 6 7 7 8 8 8 8 9 10 11 12 
21-40 12 13 14 14 15 15 15 15 16 16 16 16 17 18 19 20 21 21 22 23 
41-60 24 24 25 26 26 27 27 27 28 29 29 30 30 30 31 31 31 31 32 32 


61-80 32 32 32 32 33 34 35 36 37 38 38 39 40 41 42 42 43 44 45 45 
81-100 46 47 47 48 48 48 49 50 51 51 52 53 53 54 54 54 55 56 56 57 


Some striking patterns are readily apparent. The differences 
Aa(n) = a(n) — a(n — 1) 


are always 0 or 1, and this property is clearly persistent; if Aa(m — 1) = 0 then 
a(a(n — 1)) = a(a(n — 2)) so the first component of a(n) stays constant, while if 
Aa(n — 1) = 1 then the second component of a(n) stays constant. This shows that 
a(n) <n for all n > 3, so that the sequence is well-defined. 

Next, we notice repeated occurrences of 2’s, 4’s and 8’s; in fact the last 2 occurs 
at n = 4, the last 4 at n = 8, etc. Applying strategy (ii) above, when we find 
anything, we should try to “subtract it out” so that a deeper level of structure can 
be seen. Here, we look at the function 


A(n) =2a(n) -—n 


which will be zero when n is a power of 2. Now successive values will differ by +1. 
Notice that 


A(n) = A(a(n — 1)) + A(n — a(n — 1)). 


FiGurE 1 shows the first 384 values of A(n). It shows that A(n) is positive and 
(surprise!) symmetric between its zeroes! If the positivity (and the humps!) persist, 
the values of A(n) in any hump will be generated from the values in the 
immediately preceding hump; there will be no longer-range dependence. Let us 
number the humps so that the kth hump extends from 2* to 2*t?. 


0 100 200 300 


Fic. 1. A(n) = 2a(n) — 1 for l=n = 384. 
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It is not clear how a hump is generated from the previous hump. The process is 
not local. For example, a(100) (in the sixth hump) is computed as a(a(99)) + 
a(100 — a(99)) = a(56) + a(44) = 31 + 26 = 57, with the two components being 
far apart in the fifth hump. 

Let us look at the heights of the humps in the middle. 


n = 3 A(n) = 1 n=6 A(n) = 2 
12 2 24 4 
48 6 96 12 
192 20 384 40 
We recognize the sequence 1, 2,6,20,... as being central binomial coefficients, 


with 2,4,12,40,... being just double these. At this point it is worth computing 
more terms in the sequence, to see whether these regularities persist. FIGURE 2 
shows 6144 terms of A(n), and indeed they do, at least this far. 


100 200 300 400 500 


0 


0 1000 2000 3000 4000 5000 6000 


Fic. 2. A(n) for 1=n =—6144. 


3. The breakthrough. How can we describe the shapes of the humps? Let us 
define 
D(n) = A(n) — A(n — 1) = 2 Aa(n) - 1 
and 
D,(j) = D(2* +j) for 1<j <2*. 
Some values are 


D;;111-1 1-1-1-!1 
Dz: 111 1-1 #1 =+#21-11-1-11-1-1-1~-!1 


etc., which are conveniently encoded as 


dz; 3-1 1-3 

ds) 4-1 2-1 1-2 1-4 
ds) 5-1 3-1 2-1 1-2 2-1 — 

ds) 6-1 4-1 3-1 2-1 1-2 3-1 2-1 1-2 _ (negative reverse) 
We notice that d, has 2*~' elements and is skew-symmetric. Also the sum of the 
absolute values of the elements is 2* (we knew that!). However, it is not clear how 
one d can be derived from the previous one. (Even assuming that this is what 
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continues to happen!) Earlier, we found a hint that odd and even humps should be 
considered separately, so let us compare d, with d,, and d, with d,, etc. Now we 
find another regularity: for k = 3,...,9, d,,, can be obtained from d, by the 
following operation, which we call Rule L (for Local). 


Rule L. 
(i) replace each element x in d, by the string of elements /(x) where 


1) =2 -1 1 

(2) =3 -1 2 

(3) =4-1 3-1 2 

(4)=5 -1 4-1 3-1 2 

(5) = 6 -—1 5-1 4-1 3-1 2 

L(6) = 7-1 6-1 5-1 4-1 3-1 2, 
etc. 


I(—x) = —reverse of /(x). 


(ii) increase the first element of the new string by 1 and decrease the last 
element by 1. 

Notice that /(1) and /(—1) are anomalous. Of course we have not yet proved 
that this rule works, we have only observed that it does in the first seven instances. 


4. We need more insight. It is very awkward that Rule L connects the kth 
hump (from 2* to 2+!) with the k + 2nd hump. The step to the k + 1st hump 
seems more complicated. Let us look at how D,,, comes from D,. Define (for 
n > 3) 


F(n) = A(a(n — 1)) — A(a(n — 2)) 
G(n) = A(n — a(n — 1)) —A(n —- 1- a(n —- 2)) 
so that 
D(n) = F(n) + G(n) 


and F(n),G(n) are zero or +1. We see that the definition of Conway’s sequence 
implies that for all n > 3, 


F(n)G(n) =0 


while the successive nonzero values of F(n) reproduce D(n), and the successive 
nonzero values of G(n) also reproduce D(n). Let us write 


F,(j) = F(2* + j) 
G,(j) = G(2* + j). 


To establish that the regular behavior persists, we need to prove that the succes- 
sive nonzero values of both F, and G, reproduce D,_,. Now look at some data. 
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D= 1 -1 

D,= 1 1-1 -1 

F, = 1-1 

G,= 1 —1 

D;= 1 1 1 -1 1 -1 -1 -!1 

F, = 1 1-1 ~1 

G,= 1 1 -1 -1 

D= 1 1 1 1 -1l 1 1 -1 1 -1 -1 1 -1 -1 -1 -!l 
f= 1 1 1 -1 1 -1 —1 —1 

G,= 1 1 1 —-1 1 —-1 -1 -1 


We see a new regularity: the nonzero sections of F, all end with a —1, while 
the nonzero sections of G, (except for the last one) all end with a +1. In fact it 
seems that D,,, can be obtained from two copies of D, by the following 
operation, which we call Rule C (for both Cut and Conway): 


Rule C. 

(i) cut the first copy of D, after every —1, and cut the second copy after every 
+1. 

(ii) interleave the two copies, starting with the first piece of the second copy. 


Lemma. For k > 1, Rule C generates (the differences of ) Conway’s sequence. 


Proof. We have D, = 1 — 1. We prove by induction that for all k > 1, D,() = 
1, D,(2*) = —1, the sum of the elements of D, is zero, and D,(j) = D(2* + j). 
All this is clearly true for k = 1. Suppose it is true for k. Then according to Rule C 
the first copy of D, is cut into 2* pieces, and the second copy into 2* + 1 pieces. 
So the interleaved sequence D' has D'(1) = 1, and D‘(2**') = —1. Also the sum 
of the elements of D’ is zero, since this is true of both copies of D,. Finally we 
must show that D' and the expanded versions F',G’ of the two copies of D, are 
exactly D,41, Fy4,, and G,,, respectively. We examine four cases: 

F'(j) = 1. Here D' = 1, and since this is not the last element of a section of F’, 
Fj + 1) #0. 

F'(j) = -—1. Now D' = —1, and Fj + 1) = 0. 

Similarly for the two cases with G’ ¥ 0. 

In all cases, D', F', and G’ track D,,,, Fi41, and G,,, exactly. 


At this stage we have proved that a(2“) = 2*—!. Also it is clear, by an inductive 
argument, that the cumulative sums of D, are non-negative, so that the humps of 
A(n) never go negative. However we have not proved that D, is skew-symmetric. It 
is possible to push the following argument through without this, but it is more 
elegant to derive the skew-symmetry from a beautiful and more general result due 
to P. Frankl and J. Kahn, (written communication, Sept. 1, 1988) as follows. Start 
with any vector X in {—1,1}”. Let X* be the negative reversal of X. Note that 
X = X* iff X is skew-symmetric. Cut X after every —1, and cut X%* after every 
+ 1. Now interleave the pieces to give a sequence Y in {—1, 1}*”. (It turns out that 
the numbers of pieces can differ by at most one. If the numbers are equal, we may 
start with either X or X*.) 
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THEOREM. (Frankl and Kahn). Y = Y*. 


The proof is by induction on n. 
Applying this result to D,, we see that D,,, is skew-symmetric. 


5. Why does Rule L work? We are concerned with D,, where 
D, = C“~*(D,) 
with 
D,=1-1 


and C is the rule given above. We want to prove that for k > 3, D,,, = L(D,) 
where L is the substitution rule that we guessed above. Let’s look at more data: 


D,=11111-111-11-1--11-1--1---1----- 
Fs= 1111- 11- 1- - 1- = _ - 
G,=1 1 1 24-1 21 #-1 --1 ---- 


Our L-parsing rules want to view sections of D; as corresponding to the 
elements 3,— 1,1,— 3 of d,. We mark these divisions: 


D,=11111-111-11/-1--/11-1/--1---1- ---- 
F5= 1111- 11- 1/- - /i1- s- = ~ 
G,=1 1 1/ 1 -/1 1/7 -1 --1 ---- 


which also divide F; and G, into sections. Since F; and G; are expanded versions 
of D,, the sections give parsings of D,. To keep the notation specific, we refer to 
D,(F;) and D,(G;). Calculations up to the case k = 8 show that in all these cases, 
the sections of D,_,(F;,,) that are obtained when D, is L-parsed according to d,_, 
can be obtained by the following rule. 


Rule M. 
(i) replace x (in d,_,) by m(x), where 
m(1)=1-1 
m(2)=2 -1 1 
m(3) = 3 -1 2-1 1 
m(4) = 4 -1 3-1 2-1 1 
etc. 
m(-—1) = —2 
m(—2) = —2 
m(-3) = -3 
m(—-4) = -4 
etc. 


(ii) increase the first element by 1. 


Notice that m(1) and m(-—1) are anomalous, as with Rule L. 

The rule for D(G), called rule M, is to replace x by the negative reverse of 
m(—x), and to reduce the last element by 1. 

The M and M rules do not work if we start with an arbitrary skew-symmetric 
vector, though they do seem to for the special ones that are generated from D, 
(but only from k = 5 onwards). We need to show that the M and M rules are 
consistent with Rule L, and also with Rule C (which defines Conway’s sequence). 
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A one-line explanation of how it comes about that the local rule L works, when 
Rule C is not local, is that the vectors F and G carry the non-local information, 
while their reduced forms D(F) and D(G) obey local rules. 


THEOREM. For k > 5, the vectors D,, F,,G, that are generated from D, by 
repeated application of Rule C satisfy the following: 


D;, = L(d,_2) 
D,_\(F;) = M(d,_2) 
D,~\(G,) = M(d,_). 


Proof. Consider how Rule C gets D, from D,_,. Over the range 2 <j < 2* — 1, 
we have 2*~? pieces of the first copy of D,_,, and an equal number of pieces of 
the second copy. We shall show that over this range, the interleaved pieces are 
consistent with the first parts (part (i)) of Rules L, M, and M. Then the second 
parts of the rules will take care of making D,(1) = 1 (corresponding to an extra 
section of G,) and D,(2*) = —1 (corresponding to extending the last piece of G, 
by an additional —1). All we have to do is to check that the sections of D,_,(F,) 
and D,_(G,) that are generated by Rules M and M fit together properly. 

First, for d,_(j) = +1, we have 


d,_»> = +] d,_j) = —] 


inD, 11-1 -1- - 

in F, 1-1 - - 

inG, 1 1 1 - 
while for d,_,(j) =x > 2 the pattern is consistent: 

in D, 1147 -117'-1--- -11 

nF, W- wr- ws = | 

inG, 1 1 1 1 
and for d,_(j) = -y < -2 

inD, - -1-—-?1+:: --? 

in F, — — oe ee 

inG, —-1 —*1s) 


We see that in all cases, D,_ ,(F,,) and D,_,(G,) do agree with what Rules M and 
M require. The only slight complication is that the influence of each section does 
extend a little on either side. Thus for d,_,(j) = 1 we can fill in some adjacent 
values: 


nD, —/11-1/- 
in G, J 1 1/ 
and similarly in the other cases. But this is O.K., since successive elements of d 


have alternating signs so these adjacent entries are always consistent with what is 
needed. @ 


6. Computations. Now that we have an efficient way (Rule L) of extrapolating 
Conway’s sequence, we try to use it to answer his two questions. We are interested 
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in the ratio a(n)/n, or equivalently A(n)/n, so interesting places are the “first 
maxima” that occur at 


n = 11 23 44 92 178 370 719 1487 
A = 3 5 8 14 24 48 79 149  etc., 


the “‘highest foothills” that occur at 


n = 20 82 £335 ~~ 1361 
A = 4 12 39 135 ete., 


in the even humps, and at 


n = 41 £168 684 
A = 7 22 74 ~~ «etc., 


in the odd ones, and also the lesser foothills that are visible in the figures. Let us 
look first at the stretch between the middle of the kth hump and the “first 
maximum,” or equivalently (by symmetry) the stretch between the middle and the 
“last maximum.” Let db, be the second half of d,. This generates the right-hand 
half of the kth hump, from the middle at n = 3-2*~! to the upper zero at 
n = 2*+!. We place an asterisk by the term that gives the “last maximum.” The 
vectors are parsed according to Rule L (we need only part (i)). 

db, 1* —3 

db,  1* -2 1-4 

db; 2-1 1*/-2 1-3 1-5 

dbp 2-1 1*/-2 1-2/ 2-1 1/-2 1-3 1-4 1-6 

db, 3-1 2/-1 1-2/ 2-1 1*/-2 1-3/ 2-1 1/-2 1-3 1 -4/... 

dbs same as db. this far... 

db) 4-1 3-1 2/-1 1-2/ 3-1 2/-1 1-2/7 2-1 1/-2 1-2/ 

3-1 2/-1 1-2/ 2-1 1*/-2 1-3/... 
db 19 same as dba this far... 


We see that the entry in db, that generates the “last maximum” in the kth hump 
propagates to the entry in db,,, that generates the “last maximum” in the 
k + 2nd hump. Also, each even hump behaves exactly like the preceding odd 
hump over this stretch. So all we need look at are the leading parts of these vectors 
for k odd. Put 


k=2m+1 
and call the truncated vectors h,,. Keeping the parse marks for clarity, 
h,=1 
h, = 2 —] 1 
h,=3-1 2/-1 1-1/2-1 1 
h,=4-1 3-1 2/-1 1-2/3-1 2/-1 1-2/ 


2-1 1/-2 1-3/3-1 2/-1 1-2/2 -1 1 


Notice that these vectors are generated by a completely regular rule; we don’t 
have to worry about the anomalous part (ii) of the full L-rule. What we need are 


Lh pCi) 
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which gives the excess of the maximum over A(3 - 27”), and 


Llhm( ADI, 


which gives the position of the last maximum, measured from 3 - 27”. 


7. Another breakthrough. We do not need to capture the detailed structure of 
h,,- All we need is the frequency distribution of its elements. Thus all we need to 
study are these frequency distributions: 


value of element of h,, 


—4 —3 —2 —1 1 2 3 4 5 
fh, 1 
fh, 1 1 1 
fh, 1 3 2 2 1 
fh, 1 4 9 6 5 3 1 
which are propagated by the following consequence of Rule L: 
an element gives rise to (column headings as above) 
with value 
—3 1 1 1 2 
—2 1 1 1 
—1 1 1 1 
1 1 1 1 
2 1 1 1 
3 2 1 1 1 
4 3 1 1 1 1 


etc. We can regard this array as an infinite matrix T, so that fh, ,, = jh,, *T. 
Now some numerology leads to formulas for the entries in these vectors of 
frequencies. If we define 


, Ka" (2k + 
SCI; r) = » (7% ‘| 
then for j > 0 
fh, (1) = 1+ S(2,m — 3) 
fh, (2) = 1+ S(1,m — 3) 
fh, (j + 2) =S(j -1,m—-j - 2), 
and we can verify that these propagate correctly. Hence we get formulas for the 


height and position of the “first maximum” in the 2m + 1st hump, both relative to 
the center value: 


height(m) = S(0,m — 1) — S(2,m — 2) 
position(m) = S(1,m — 1). 
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Before we can translate this into the height of the maximum of A(7) itself, we 
still need to establish the value of A(3 - 27”). So we go through the same kind of 
analysis, using the full vector da, (first half of d,). Now we have to use part (ii) of 
Rule L, but things turn out simply anyway; the frequency distributions are 


-5 -4 -3 -2 -1 1 2 3 45 67 8 9 


in da, 1 1 
da s 1 3 1 1 41 1 
da. 1 4 11 5 4 32 1 1 
day 1 65 160 «42 22 16«11«7 4 2«21 1 


etc. These frequencies turn out to have simple forms: 


m—2 
for 1 <x < 2m — 1, the frequency of x in da,,,,,is ), (°" ; : } 
j=0 

Also there is a single x = 2m + 1. 

2m—-y-1 

2m-y-—1 

for —m < —y < —1, the frequency of —y in da,,,,,is ) | > | 

jJ=m-1 J 
and these propagate correctly. Hence the value of A(3 : 2”) is indeed (27) as we 
guessed above. So finally the “‘first maximum” in the 2m + 1st hump 


occurs at n = 3-27?" — S(1,m — 1) 
and has value A[n] = 1+ SC1,m — 1) + S(O, m — 1). 


It is easy to see that at this “first maximum” (and indeed at the middle (3 - 2?”) 
we have 


A(n)/n = O(m-’?) = O((logn)~”’). 


Calculations show that A(n) at these “first maxima’ decreases in successive 
humps, and is first below .05 in the 32nd hump. We need to check that the 
“foothills” have not grown large enough to make A(n)/n > .05 in this hump. To 
expedite the ensuing calculations, let us calculate the progeny of a term x in some 
d-vector, not at either end. Writing 


An = >t+ 37 
AA= 37-23, 


the same kind of analysis as above (which was for the case x = 1) shows that after 
p stages of expansion the term x becomes a stretch given by 


x >* > 

1 S(0, p) S(2, p — 1) 

2 14+ Sd, p) SB,p—-D 

3 1+ S(0, p) + SQ, p) SQ, p — 1) + S(4,p — D 

4 1+ S(0, p) + SQ, p) + s@, p) S(2, p — 1) + SG, p — 1) + SG, p — 1) 

5 1+ S(0, p)+ SQ, p) + SQ, p) + S(4, p) SQ2,p —1)+ SB, p-1)+ S4,p —- D+ S6,p — 1) 


and so on. For a term —x we simply interchange }* and >~. 
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Some values are given in the table. 


1 2 3 4 
x= An AA An AA An Aa An AA 
p=0 1 1 2 2 3 3 4 4 
1 4 2 6 4 11 7 17 11 
2 14 4 21 9 42 18 70 32 
3 49 9 77 23 161 51 281 99 
4 175 23 287 65 617 155 1112 320 


For example, starting with x = 2, after two expansions using the first part of 
Rule L we have 


4-1 3-1 2/-1 1-2/ 3-1 2 


with An = 21, AA = 9. For x negative, simply change the sign of AA. 

Now it is easy to calculate the progeny of the stretch —1 2 that takes us from 
the foothill to the first maximum in each of the third and fourth humps. We find 
that in the 32nd hump the foothill has A(n)/n > .05, while in the 33rd hump 
A(n)/n stays below .05. 

What follows is an abbreviated account of a calculation to determine the last n 
for which A(n)/n > .05. This was done essentially by hand, except for using the S$ 
language [3] as a glorified desk calculator. Full details are available from the 
author on request. Start with da, = 4 —1 2 —1 and label three points Do, p,, p, 
as shown. After 14 cycles of expansion by Rule L, these correspond to points 
P, P,, P, on the 32nd hump, whose positions can be calculated as above. We find 
that at P,, A > n/20, while at P,, A <n/20. We expand the section between p, 
and p, in da, by Rule L, and look at the corresponding section in da,. Now we 
identify new points p, and p,, compute n and A at the corresponding points P, 
and P, in the 32nd hump by 13 cycles of expansion, and so on. By the time we get 
to da,, we have pinned down the critical value of n to the stretch between P,, at 
n = 6083008361 and P,, at n = 6083010090. Detailed calculation of this stretch of 
da, (by fourfold expansion of the stretch — 4 3 in d,,) enables us to draw FIGURE 
3 and to identify the critical n as 6083008742. 
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Fic. 3. The last time A(n)/n exceeds 1/20 is at n = 6083008742. 
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P2 Po d¢ 


8. An aside. When I first tried to compute Conway’s number, I neglected to 
check the foothills of the 32nd hump and so arrived at a point on the 31st hump, 
with n = 3173375556. In this calculation I took only a few steps of the type 
explained above, and used a Cray computer (not for its speed, but because it has a 
large memory) to compute 1747771 terms of d,, and to search for the correct n. It 
is amusing to observe that all this effort was unnecessary; neither of these large 
numbers is the correct response to Conway’s challenge. Conway and I both forgot 
that we were looking at A(n), not a(n); a(n)/n — 1/2 is last above 1/20 at 
n = 1,489. 


9. Asymptotics. We have 
da,=1—-3 
da; =2-1 1-2 1-3 1-5=L(1-2) 1-5 
and it is easy to show by induction that for k = 2m + 1 
da, = L™~-*(1 — 2)L™-*(1 -4) «++ L'(1 — (2m — 2)) 1 —(Q2m + 1). 


Hence the following points are on the 2m + 1th hump, and the function that 
interpolates them linearly lies everywhere below the other points of that hump: 


ny = 3-22" = middle Ay = (2 | 
_ 2m+1 _{ 2m 
ny = no + ( m 4,= (77, | 
2m +1 2 
mome(RS!) ase ( 
1. 


Ny =Nm-p~t2mt+i1 


m 


As m > &, the points (n,2-°", A,m/2)'/*2~*”) for j = 0,1,...,m converge 
to the curve with parametric definition 


x=2+20(u) 
y=¢(u) -~<u<o 


where ¢,@® are the standard Gaussian density and cumulative functions respec- 
tively. FIGURE 4 shows this curve. The extreme tangent has slope .138015 (corre- 
sponding to u = —.27603) and occurs at x = 2.78253. 
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0 1 2 3 4 
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Fic. 4. The asymptotic shape of the humps in A(n). 


What happens between these “interior” points? FiGURE 5 shows what happens 
if we divide the kth hump (suitably scaled) by the asymptotic function, for 
k = 3,...,12. It seems that the ratio may be tending to unity for all x in (@, 4), but 
I cannot prove this. If this conjecture is correct, an estimate of the maximum value 
of a(n)/n — 1/2 for 2-2?" <n < 4-2” is .09759m 1/7, An estimate of n,, the 
largest n for which A(n)/n > «, is 5.565 - 27” where m is the largest integer in 
(.346 /e)? /7r. 


12 14 1.6 


A /y (standardized) 


0.6 0.8 1.0 


0 1 2 3 4 
x 


Fic. 5. Sections of A(n)/y, standardized. 


We can get a rigorous bound by noting that in the kth hump, A(n)/n cannot 
exceed M,2~* where M, is the maximum of A in the kth hump, which we found 
above. Numerically, 


epsilon bound estimate 
01 7.5 + 101104 1.2 - 1058 
02 8.9 - 10277 2.5 - 10'* 


10. Some further comments. D. R. Hofstadter (private communication, Sept. 2, 
1988) has found some ancient notes in which he studied Conway’s sequence, and 
derived the following rule for its behavior. Let 


f(i) = #{n: a(n) = 3} j=1,2,3,.... 
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Breaking this sequence at powers of 2, we find 


2 
2 
13 
1124 
11121235 
1111211212312346 


and there is a simple (local) rule for generating each section from the previous 
one; in fact Hofstadter found two such rules. 


(i) Reading the previous row backwards, replace any integer k > 1 as a group 
of k — 1 1’s followed by a 2, 
Replace any group of k 1’s by the group3 45 ... k +2. 
Add 1 to the last number you produce. 

(ii) Reading the previous row forwards, replace k by 123 ... k —1k, except 
when k is the rightmost number of its line, in which case replace it by 1 2 3 
. K-LkK+1. 


I do not see how to prove that these rules work, except by an analysis similar to 
mine. (The details for (ii) are rather simpler than for my rules. For (i) we seem to 
need the symmetry property.) 

There is even a formula for a(n), due to G. Phillips, shown to me by Conway on 
Sept. 9, 1988. Consider the following array. 


1 1 

2 1 1 

4 1 2 1 

8 1 3 3 #1 

16 1 4 6 4 1 

32 1 5 10 10 5 1 


etc. Given any n between 2* and 2**!, find the row that starts with 2*, and mark 
as many successive elements as possible without letting the sum of the marked 
elements exceed n. When the next entry in that row would take the sum above n, 
move up one row and continue. At most one element in each column may be 
marked. Stop when the total of the marked elements is n. For example, when 
n = 23 we mark the elements 16, 1, 4 in the fifth row, then the first 1 in the third 
row, and finally the first 1 in the second row. 

Now a(n) can be read off as the sum of the elements that are north-east of each 
of the marked elements. For n = 23, these are 8, 1,3, 1,1, so that a(23) = 14. 


11. Some related sequences, and further comments. Conway’s sequence is 
anomalous at n = 2. Applying his recursion starting at n = 2 (with a(1) = 1) gives 
a very dull sequence. What if we start in other ways? We have to be careful that 
a(n) stays below n, or the sequence will not be well-defined. An interesting class is 
obtained by starting with a string of m 1’s, applying Conway’s recursion for 
n>m + 1. For m = 3, we get a sequence that proceeds in stretches between the 
Fibonacci numbers, and that has structure very similar to that found for Conway’s 
sequence. Here we have to interleave cut copies of D,_, and D,_,. Rule L (with 
a modified part (ii)) generates D, from D,_,. Similarly for all m > 4; we leave 
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the details as an exercise for the reader. (On 1/4/89 R. K. Guy pointed out 
(personal communication) that this is problem E3274 [5]!) 
Another variation is obtained by writing 


b(n) = b(b(n - 1)) + b(n —1- b(n - 1)), 


but this turns out to be a trivial modification of a(n); in fact b(n — 1) =n — a(n). 
In his delightful book [4], D. R. Hofstadter mentions the (nonmonotone) 


sequence generated by 
q(n) =q(n — q(n - 1)) + a(n - g(n - 2)) 


which has bizarre behavior. The first 1000 values exhibit some striking regularities, 
but these do not persist; FIGURE 6 shows the first 20,000 values. There is obvious 
approximate structure, but I have no idea what happens asymptotically. 


g(n) 
0 2000 4000 6000 8000 10000 


0 5000 10000 15000 20000 
n 


Fic. 6. Hofstadter’s q-sequence. 


Finally, a minor change in Conway’s definition gives the monotone sequence 
c(n) =c(c(n — 2)) + c(n — c(n — 2)). 


Now the increments are very irregular; see FiGuRE 7. It is not clear whether 
c(n)/n tends to a limit. 


3c(n) — 2n 
100 200 300 400 500 


0 


0 1000 2000 3000 4000 5000 6000 
n 


Fic. 7. 3c(n) — 2n for 1=n—6144. 
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Notice te authors: Recently the Monthly has not been processing 
papers because of a very high backlog. This situation has now eased . 


somewhat and we are once again considering new manuscripts for 
possible publication. Thank you for your patience.—The Editors. 


LETTERS TO THE EDITOR 
To the Editor: 


In a recent MONTHLY article [3], the authors give a Taylor polynomial-like 
formula to define f(A), where A is a square complex matrix and f(-) is a 
scalar-valued (real or complex) function of a scalar. Although the authors assert 
that their formula (3.1-2) for f(A) “has not been given previously in this form’’, it 
was published by H. Schwerdtfeger in 1938; see Section 6 in [5]. For an application 
of Schwerdtfeger’s formula to calculating (d/dt) fCA(t)), see [1] and Section 6.6 
of [2]. For a discussion of several equivalent definitions of f(A), including 
Schwerdtfeger’s formula, see [4] as well as Sections 6.1-2 of [2]. 


Sincerely yours, 


Roger A. Horn 
Johns Hopkins University 
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To the Editor: 


I would like to correct an important detail in Richard Guy’s review of my book 
The Mathematics of Plato’s Academy, A New Reconstruction (this MONTHLY, May 
1990, 410-413): the book does contain, on pages 356-360, a description of Bill 
Gosper’s algorithm for continued fraction arithmetic, together with a tableau 
illustrating its use in evaluating /2 x /3. The description ends with an implied 
challenge to give a direct proof that, in this case, the output will be periodic; and a 
much more significant contribution would be to go on and see if it is possible to 
use the algorithm to prove that the output from some example like /2 + /3 is, or is 
not, bounded. 


Yours sincerely, 
Daniel Fowles 


University of Warwick 
Coventry, UK 
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UNSOLVED PROBLEMS 


EDITED By RICHARD Guy 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied by 
relevant references (if any are known to the author) and by a brief description of known partial 
results. Manuscripts should be sent to Richard Guy, Department of Mathematics and Statistics, The 
University of Calgary, Alberta, Canada T2N 1N4. 


A DYNAMIC PUZZLE 


MosHE ROSENFELD 
Department of Mathematics and Computer Science, Pacific Lutheran University, 
Tacoma, WA 98447 


A dynamic puzzle: Start with an n X m rectangle partitioned into nm squares. 

Step 0: Select any square and cover it. 

Step k + 1: Cover a set of uncovered squares by a translate of the current 

covered set. 

Objective: | Cover as many squares as possible. 

Observe that each step doubles the number of covered squares, so that after the 
k-th step exactly 2" squares are covered. In FIGURE 1 below we show a covering of 
128 squares of a 14 xX 14 square (a cover of size 128, in the rest of this note the 
term cover will be used only to describe sets covered using the above steps). We 
invite the reader to try to cover as many squares as possible on a 7 X 7 square 
(327), on a 23 X 23 square (512?) or on a 47 X 47 square (20487). 
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Fic. 1. A 14 X 14 square with a cover of size 128. The numbers inside the squares mark the image of 
the first square (0) when the k-th translation is executed. The figure shows a cover of size 64 for the 
7 X 14 rectangle. There are two distinct ways to obtain a cover of size 128. One cover of size 128 is 
obtained by placing a copy of the cover in the 7 X 14 bottom rectangle, the second can be obtained by 
translating the current cover down to the 6-th row. 
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The term puzzle is frequently used to describe a game or problem where one is 
asked to put pieces together to form a given shape. In this sense this puzzle can be 
viewed as a dynamic puzzle, the pieces to be fitted into the initial rectangle change 
dynamically. Every step determines the shape of the subsequent piece to be placed 
in the rectangle. For a given rectangle, we are interested in finding the largest 
number of squares that can be covered and the number of distinct shapes of the 
final covered set. Clearly, one cannot cover more than 128 squares of a 14 x 14 
Square. From FiGurE 1 one can immediately see that there are at least two distinct 
final shapes, in fact, FiGuRE 1 also indicates a cover of size 128 for the 14 x 13 
rectangle. While the rectangle is a “simple” object, it is not so nice “mathemati- 
cally’, not all squares are “created equal’ even if the initial rectangle is a square. 
We can do our dynamic puzzle on a “nicer” object, the torus. One advantage of 
using the torus is the choice of the first square. Since all squares are “created 
equal,” we can choose the first square as we wish. Clearly, if an n X k rectangle 
has a cover of size 2” then so does the n X k torus, the reverse is not true. The 
6 X 6 torus has a cover of size 32 while the 6 X 6 square does not. Two covers are 
distinct if one is not congruent to the other. In FiGURE 2 we show two distinct 
covers of size 32 of the 7 X 7 torus (we invite the reader to determine whether the 
7 X 7 square admits a cover of size 32). 
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Fic. 2. Two distinct covers of size 32 of the 7 X 7 torus (actually cylinder). 


The puzzle on the torus exhibits some unexpected phenomena. Consider for 
example the three tori of sizes 5 X 14,6 X 12 and 5 xX 15. They have 70, 72 and 75 
squares respectively. Figure 3 shows covers of size 64 for the first two (the 
“smaller”) tori. Surprisingly, it is not possible to cover 64 squares of the “larger’’ 
5 x 15 torus (a proof is included in [1]). 


5 


Fic. 3. A cover of size 64 of the 5 < 14 torus 
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Fic. 4. A cover of size 64 of the 6 < 12 torus 


This puzzle has a natural generalization to higher dimensional boxes and tori, 
exhibiting the same “paradoxical” behavior. For example we can show that the 
7X 7X7 torus (with 343 cubes) does not admit a cover of size 256 but the 
“smaller” 3-dimensional 6 X 7 X 8 (only 336 cubes) does admit a cover of size 256. 


While the smaller size problems can be solved by backtracking on a computer, the 
backtracking becomes intractable very quickly. We used backtracking to prove the 
nonexistence of a cover of size 64 for the 5 X 15 torus but only after finding a 
substantial reduction in the number of possibilities. Most of the results known to 
us are for the 2-dimensional puzzle on the torus and the square. The following is a 
sample of our results. If m and k are relatively prime then the n X k torus has a 
cover of size 2!!°82”*] (largest possible cover). If 3*2”-!' <k < 2”t! then it is 
possible to cover 27”*! squares of a k X k torus. We have similar results for the 
k X k square. 

The following is a list of related interesting problems. 

1. The smallest square torus for which we do not know the size of the largest cover 
is the 47 x 47 torus. More generally, for all sizes k > 47, for which 2”V2 <k < 
3%2”—! we do not know whether it is possible to cover g2n+1 squares of the k Xk 
torus (or square). 

2. For any pair of integers nm and m, determine the largest possible cover of the 
n X m torus (or rectangle). For the answer in case gcd(m, n) = 1 see [1]. 

3. For any given sizes, determine the number of distinct final maximal covers. 

4. The higher dimensional puzzle seems much more complicated than the 2-dimen- 
sional. It will be interesting to obtain bounds for such tori (or boxes). 

5. Sherman Stein suggested the following interesting variation of this puzzle: 
instead of starting with a single square in step 0, start with some initial set of 
squares, for example start with three squares in a row or L-shaped and then 
continue to duplicate the covered set. Again the objective is to cover the largest 
possible number of squares. 


REFERENCE 


1 B. Dorner, C. Meyer and M. Rosenfeld, A square puzzle (in final stages of preparation, preprints 
available). 


NOTES 


EDITED BY DENNIS DETuRCK, DAvip J. HALLENBECK, AND RODICA SIMION 


A Continued Fraction and Permutations With Fixed Points 


HENRI DARMON 
Department of Mathematics, Harvard University, Cambridge, MA 02138 


JOHN McKay 
Department of Computer Science, Concordia Univ., Montreal, Canada H3G 1M8 


The set >, of all permutations on ©, = (1, 2,3,..., n} is partitioned into FPF, 
permutations acting without fixed points and FP, acting with at least one fixed 
point. These numbers are related by 


[>| =n!=FPF,+ FP, for n=0, 


with the convention that FPF, = 1 and FP, = 0. We shall index all sequences 
from 0. 

Let V denote the two-dimensional real vector space of sequences {t,} satisfying 
the recurrence 


Ln4+ = n(t,, + tr—1)- 
To convince the reader that V is worthy of study, consider the following: 
THEOREM. The sequences {n!}, {FPF}, and {FP_} belong to V. 


Proof. The proof that {n!} € V is trivial. To show that {FPF} © V we observe 
that any fixed-point-free permutation in >,,,, can be obtained from the identity 
permutation in exactly one of two ways: 


(1) by applying a permutation with a unique fixed point i € &, to the first n 
elements, and then transposing n + 1 and i, or: 

(2) by applying a fixed-point-free permutation to the first n elements, followed 
by a transposition of n + 1 and some i € O0,. 


There are n possible choices for i, FPF,_, possible permutations on 1,...,7 
which leave only i fixed, and FPF, fixed-point-free permutations on the first n 
elements. So FPF,,, = n(FPF,, + FPF,,_,), and hence {FPF,} € V. The last state- 
ment of the theorem follows from the identity FP, + FPF,=n!. & 


The vector space V also contains a sequence which converges to 0, which means 
that the null sequences form a proper (one-dimensional) subspace of V. This is the 
content of the next theorem: 


MaIn THEOREM. The sequence {I} given by 
° nl —Xx 1 nl 
[= e*—1l)e* “a= —t) edt 
J ¢ ) [(-#) 


belongs to V. Moreover, I,=1-—1/e, I, = —1/e, lim,.,... J, = 0, and Wn, 
Z| < 1. 
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Proof. That {I,} satisfies the appropriate recurrence follows from integration 
by parts. It is a trivial exercise to evaluate J, and /,. The limit of {J}, as well as the 
bound, follow from dominance of J, by [se -* dx. @ 


Let us denote by S(a, b) the sequence {t,} © V satisfying the initial conditions 
ty = a, t, = b. The previous results can be summarized as follows: 


S(0,1) = {FP}, 
S(1,0) = {FPF,}, 
S(1,1) = {n!} = (FP, + FPF}, 


1 1 
s(1 _=,- =| = {1}. 
e e 
Choosing an appropriate basis for V, one has 


S(0,1) = [1 ~ ~}8(.0 ~ s(1 - ~,- =|, 


e 
which leads to the following result: 


1 
FP, = [1 ~ = |nt- [otre de. ( *) 
0 


é 


The approximation FP, ~ (1 — 1/e)n! is well known in elementary combinato- 
rial probability, where the result is usually stated as: 


THEOREM. After a random shuffle of a deck of cards, the chances that at least 
one card will remain fixed is very close to 1 — 1/e. 


Formula (*) provides a closed form expression for the error term. 
Our investigation concludes with an evaluation of a continued fraction which is 
closely related to the previous formula: 


THEOREM. 


Proof. The right-hand side converges and is equal to lim, _,.. p,,/q,,, Where {p,}, 
{q,} © V, and py = q, = 1, D1 = dp = 0. So 


1 . (1,0) Ife 1 


=| Mei eg 
2 m™n>='s(0,1) 1—-l/e e-1 


1+ ——— 
ate 


The theory of the recurrence relations determining the p,,q, appears in [1, 
p. 492], [2, p. 15]. An evaluation of the continued fraction is given as an exercise in 
[1, Ch. XXXIV, Ex. 24, p. 576], but it is arrived at quite differently, using a 
classical formula of Gauss which gives continued fraction expansions of ratios of 
hypergeometric series. 
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Smoothness From Finite Points 


VICTOR NEVES 
University of Iowa, Mathematics, Iowa City, IA 52242 
UBI, R. Marqués D’Avila e Bolama, 6200 Covilha, Portugal 


1. Introduction. Like other branches of mathematics, infinitesimal analysis 
allows some informality; but careless use of terms may lead to wrong answers. 

Unfortunately, there is a serious error in the definition of tangent plane given in 
[2]. It is closely related to the well-known “Schwarz Accordion,” which is treated in 
[4, p. 124] (with incorrect algebra). There also seems to be a tendency to blur the 
distinction between “standard” and “finite.” Nevertheless, the main result of [2, p. 
434], can be reclaimed with correct definitions. That theorem is actually proved, if 
one replaces the definition of tangent plane as in section 3, below. Moreover, the 
corrected proof is certainly simpler then the classical one we first learned from [1], 
for it goes more directly to the point. 

Ours is not the only possible correction to Henle’s definition of tangent plane. It 
suffices to require that the three close points that define the nearly tangent planes, 
form a triangle with no infinitesimal angles. 

We use the notation set up in [3] chapters 1 and 1*. R* denotes the nonstandard 
extension of the set of real numbers R. 

We consider R a subset of R*. R”* = (R*)” (nm = 2,3). 

An element r of R* is standard, finite, or infinitesimal if, respectively, r € R, 
Ir| < s for some s in R‘*, or |r| <5 for all s € R*. An element of R”*, (n = 1, 2, 3), 
is standard, finite, or infinitesimal if all its coordinates are, respectively, standard, 
finite, or infinitesimal. We say x is infinitely close to J in R”*, x = y, if x — y Is 
infinitesimal. 

Also, for n = 1, 2,3, if a € R”* is finite and b is the (unique) standard element 
of R”* which is infinitely close to a, we say b is the standard part of a and write 
b = sta. 

Finally, functions and their nonstandard extensions will be denoted with the 
same symbol. 


2. A critique of [2]. Defining closeness of planes with the difference in normal 
unit vectors is dangerous: a plane is not close to itself if one takes opposite 
normals. But let us assume that a choice of unit normals was implicitly assumed. 

According to Henle’s definition [2], a tangent plane to a surface S, at a point u 
of S, is infinitely close to any plane determined by a three-tuple (u,v,w) of 
infinitely close, noncollinear points on the surface. This is simply false. This 
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definition would mean that spheres and cylinders have no tangent planes. For 
example, there are planes containing the normal to a sphere, or a cylinder, which 
intersect the surface in a circle. Three distinct points on a circle (infinitely close or 
not) determine the plane of that circle. In this case three infinitely close, non-col- 
linear points determine a plane perpendicular to the tangent plane. 

Contrary to what Henle states, one cannot characterize differentiability of a 
function, solely with the existence of tangent planes to its graph: when f:R* > R 
is given by f(x,, x,) = xt’, for all (x,, x,) in R?, the graph of f has a (vertical) 
tangent plane at (0,0, 0) and f is not differentiable at (0, 0). 

By avoiding reference to R”* (n = 1,2,3), it is not made clear if the points 
(a, b) in [2] are supposed to be standard or not. The difference is illustrated in 
Theorems 3.1 and 3.2, below. The point of those theorems is that, except for 
functions of class C', derivatives at nonstandard finite points are not necessarily 
infinitesimal approximations to all ratios of infinitesimal increments. If f(x) = 
x? sin(27/x) for x # 0 in R, and f(0) = 0, then for an infinite hyperinteger n, one 
has 


4n+1 on 4/(4n + 1) —1/n 


3. Calculus. The difference between standard and finite. As we still want to 
give a geometrical condition of differentiability in terms of tangent planes, some 
definitions are in order. 

We will always give a plane P contained in R°** by one of its points, a, and a 
unit vector u = (u,,u,,u3) € R°* according to the following equalities 


P = P(u,a) = {x € R*: (x —a)-u=0}. 


We take the plane P((0, 0, 1), (0, 0, 0)) to be “the ground” and say that a plane 
P(u,a) is S-vertical if u, =~ 0. A standard plane is vertical, u, = 0, iff it is 
S-vertical, u, = 0. 

Clearly a unit vector is finite. Also, if a is finite in R°* every finite point in 
P(u, a) is infinitely close to a standard point in P(stu, sta), and this standard plane 
is vertical iff P(u, a) is S-vertical. 

Let f:R? > R be a function and let a = (a,, ay, f(a,, a,)). We say the plane 
P(u,a) in R°* is S-tangent to the graph of f over (a,,a,) if for all (b,, b2) 
infinitely close to (a,, a4), 


b-—a 
|b — all 


Note that a standard plane will be tangent to a graph at the point (a,, a,, f(a,, a,)) 
& R° iff it is S-tangent over (a,, a,). 
From now on denote Af = f(a, + 6, a, + &) — f(a,, ay). 


-u=0 holdswith b = (b,,b), f(b, b>)). 


3.1. THEOREM. Let f:R* > R be a standard function. The following conditions 
are equivalent. 
i) f is differentiable. 
ii) For all standard (a,,a,) € R’, there are standard real numbers a and B, such 
that, when 6 and « are infinitesimal in R* 


Af =aéd + Be + ylI(6, €)||, forsome n = 0. 
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iii) The graph of f has a non-S-vertical, S-tangent plane over all standard (a,,a,) in 
R*. 


This is only a correction to Henle’s remark on differentiability [2, p. 434, line 5]. 
Its proof is indeed an application of standard infinitesimal techniques. 

Consider the following result obtained from Theorem 3.1, by replacing “stan- 
dard” by “finite”: 


3.2. THEOREM. Let f :R? > R be a standard function. The following conditions 
are equivalent. 
i) f is of class C'. 
ii) For all finite (a,, a,) € R**, there are finite numbers a and B, such that when 6 
and « are infinitesimal in R* then 


Af=ad+B+4+yll(6,¢e)|| forsome yn = 0. (1) 


iii) The graph of f has a non-S-vertical, S-tangent plane over all finite (a,,a,) in 
R-*, 


This result is similar to 5.7.9. in [4], with k = 1. 


Proof. i) = ii). Assume i). Let (a,, a2) be finite, 6 and « be infinitesimal. Using 
the «-transfer of the Mean Value Theorem in R* one obtains 


Af = f(a, + 6,a, +e) —f(a,,a, +e) + f(a,,a, + €) — f(a,, a2) 


0 0) 
= Ta, +A6,a,+6)d+ Fa a, + pe)e forsome A, pu & [0,1]. 
Ox, OX 
By continuity of the partial derivatives, it follows that 


af af | 
Af= 5, 641 42) +7, J + |——(4,,4,) + m2 /e with n, = n= 0. 
xX, OX, 


Finally (1) holds with 
t of re) E 
a= ~—_(a4,,a4)), = —_—_(4,,a@)), = .——— + _———, 
ax, 1 2 B ax, 1 2 7) 14 I(5, €) I 12 I(5, €) I 


ii) = i). Assume ii). a = a’ and B = B’ imply 
Af=a'd + B'e + 7'I(6, €)|| with yn’ = Oif 6 =O =e. 
It follows that, when (a,, a,) is standard, we have 
Af = (sta)6 + (stB)e + 7, |I(6, €)|l| with n, = Oif6 =e = 0. 
Thus, according to Theorem 3.1., f is differentiable and 


of of 

jx, 0 a,) = sta, x, 6a a,) = stB, 

when (a,,a,) is standard. Next, by «-transfer of the classical definition of partial 
derivative, when (a,, a,) is finite and 6 is sufficiently small but not zero, by ii) we 
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have 


F) 1 
(aya) ~ 5 l fla + 8,4) ~ f( a1, 42)] 


1 
slad + nial] with n ~ 0 


=a+y7’ withn’ = 0. 
Thus, we may assume 
d of i 
a= jx, (40 42)> B= jx, a0 42) for all finite (a,,a,) € R**. 
We will show that for all finite (a,,a,) € R** and infinitesimal 5 € R* the 
following hold 
a) a(a,+6,a,) = a(a,,a,) = a(a,,a,+68) 
b) B(a, + 6,a,) ~ B(a,,a,) ~ B(a,,a, + 8) 
so that, for 6 = « = 0 
a(a,+6,a,+ 6) =a(a,,a, +6) = a(a,,a,) 
B(a, + 6, a, + €) ~ B(a,,a, + €) ~ B(a,, a5). 
In particular, these conditions hold with (a,,a,) € R? and the partial derivatives 
are continuous. 


We shall only prove a), since b) can be deduced analogously. Let (a,, a.) be 
finite. Expand Af three different ways: 


Af = a(a,,a,)5 + B(a,,a)e + qld, &)Il (1) 

Af =a(a,+6,a,+e)6+ Bla, +6,a,+e)e+7,(I(6, €)l (2) 

Af = f(a, +6, a, +) —f(a,,a, +e) + f(a, + a, + &) — f(a, ay) (3) 
=a(a,,a,+£)6 + 7/6] + B(a,, a) + n,lel. 


By assumption, if 6 = « = 0 then n = n,; ~ 0 (i = 1, 2,3), so combining 1) and 
2), with « = 0, and dividing by 6, a(a, + 6,a,) = a(a,, a,) follows for all finite 
a,, a, and all nonzero infinitesimal 6. 

Next combine 1) and 3) and take 6 = « in order to obtain a(a,,a, + 5) = 
a(a,,a,) for all nonzero infinitesimal 6. 


ii) = iii). Assume ii) and take 
“= (a, B, i 1) 
Ca, B, — II 
P(u,(a,, ay, f(a,, a,)) is S-tangent to the graph of f over (a,,a,) and, of course, 


—1/|a, B, — 1)|| is not infinitesimal, for a and B are finite, so the plane is not 
nearly vertical. 
yn 7 (a, B, — 1) 
iii) = ii). Assume iii). Let i(a.B,- DI be a unit vector which defines a 
a, ? — 


non-S-vertical plane, S-tangent to the graph of f over the point (a,,a,) € R*’. 


1991] NOTES 31 


S-tangency yields 

ad + Be -~Af=ypll(6,¢,Af)l|l, with w = 0. (4) 
Moreover, a and ®# are finite since the plane is not S-vertical. Define p = 
Af/\I(6, €)|l so that |\(6, e, Af Il = V1 + p7IIC6, e)l|. Dividing by ||(, €)]| in equation 
(4) one obtains p = |Ka, B)llcos(@) — u¥1+ p? (with a and B finite and yu 
infinitesimal). The solutions for this equation satisfy p = |(a, B)\lcos(@), so p and 


y1+p?’ are finite. It follows that equation (1) holds with n = uwy1 + p”. 
This ends the proof of 3.2. 
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Anthropomorphic Polygons 


GODFRIED TOUSSAINT 
School of Computer Science, McGill University, Montreal, Canada 


We are concerned with a very special type of polygon in the Euclidean plane FE? 
referred to as a simple (also Jordan) polygon. For any integer n > 3, we define a 
polygon or n-gon in the Euclidean plane E” as the figure P =[x,,x,,...,X,] 
formed by n points x,,x,,...,x, in E” and n line segments [x,, x;,,], i= 
1,2,...,n — 1, and [x,, x,]. The points x, are called the vertices of the polygon 
and the line segments are termed its edges. 


DEFINITION. A polygon P is called a simple polygon provided the only points of 
the plane that belong to two edges of P are the vertices of P. A simple polygon has a 
well defined interior and exterior. We will follow the convention of including the 
interior of a polygon when referring to P. 


DEFINITION (Meisters [Me2]). A vertex x; of P is called a principal vertex 
provided the diagonal | x;_ ,, X; 4] intersects the boundary of P only at x,;_, and x; 4.4. 


DEFINITION (Meisters [Mel1]). A principal vertex x, of a simple polygon P is called 
an ear if the diagonal [x;_,, X;,,) that bridges x; lies entirely in .P. We say that two 
ears x; and x, are nonoverlapping if int[x;_,, x;,*;.,) N int[x,_,, x;,X;4,] = ©. 

The following Two-Ears Theorem was recently proved by Meisters [Mel]. 

THEOREM 1 (the Two-Ears Theorem, Meisters [Mel1]). Except for triangles every 
simple polygon P has at least two nonoverlapping ears. 
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Meisters’ proof by induction is both elegant and concise. However, given that a 
simple polygon can always be triangulated allows a one-sentence proof [O’R]. 
Leaves in the dual-tree of the triangulated polygon correspond to ears and every 
tree of two or more nodes must have at least two leaves. 

This theorem is quite applicable in many situations. For example it establishes 
that there exists a stepwise procedure for ‘“‘shrinking” a polygon P down to a 
triangle by deleting a vertex at each step, say x,, and inserting [x;_,, x;,,] in the 
place of [x,;_,, x;, x;,,] while ensuring that at each step we retain a simple polygon 
P'. In actual fact, of course, we need only a “one ear” theorem to carry out such a 
procedure. The method is evident: locate an ear in P and “‘cut it off,” then locate 
an ear in the remaining polygon of one less vertex and cut it off, and continue this 
process until the remaining polygon is a triangle. It is obvious that such a 
procedure could also be used as an algorithm for computing a triangulation of P. 
However care must be taken in converting this idea into an efficient algorithm. A 
straightforward approach of implementing this notion can result in a very slow 
algorithm. To determine if a vertex is or is not an ear may take O(n) steps and we 
may have to visit O(n) vertices to find and cut off an ear. Therefore, using a 
“brute force” approach we may have to perform O(n’) steps to cut off an ear and 
O(n>) steps to completely triangulate P in this manner, On the other hand, 
algorithms exist for triangulating simple polygons in time O(n log n) [GJPT] and 
O(n log log n) [TV]. Once a triangulation is obtained the dual-tree can be deter- 
mined in O(n) time. Finally, an O()-time tree-traversal can prune off one leaf 
from the dual-tree at each step resulting in the cutting off of one ear from P at 
each step. It remains one of the most outstanding problems in computational 
geometry to determine if an O(n) time algorithm exists for triangulating arbitrary 
simple polygons. 

One question that arises is whether the “inverse” of the previous procedure is 
possible, i.e., does there always exist a stepwise procedure for “inflating” a simple 
polygon P until it is “as fat-as-possible ” by deleting vertices from P one-at-a-time 
so that at each step we retain a simple polygon? We answer this question in the 
affirmative by proving that every non-convex polygon contains at least one mouth, 
but first we must define mouth and make more precise what we mean by 
“‘as-fat-as-possible.”’ 


DEFINITION. A principal vertex x, of a simple polygon P is called a mouth if the 
diagonal |x ;_,, X;4,] is an external diagonal, i.e., the interior of |x;_,, X;4,] lies in 
the exterior of P. 


The convex hull of a simple polygon P will be denoted by CH(P). The 
boundary (bd) of CH(P) is a convex polygon. We now have a precise definition of 
“‘as-fat-as-possible,” i.e., P is inflated until it becomes the convex hull of P. 


THEOREM 2 (the One-Mouth Theorem). Except for convex polygons every simple 
polygon P has at least one mouth. 


Proof. Construct the convex hull CH(P). Since P is non-convex there must exist 
edges on bd(CH(P)) that are not edges of P. Each such edge forms the “lid” of a 
“pocket” of CH(P). (Refer to Fic. 1.) We shall prove that in fact every such 
pocket yields a mouth. Let K;,,; denote the pocket of CH(P) determined by 
vertices x, and x, of P. Clearly K,, =[x,,x;,1,...,x,] U[x;, x;] forms itself a 
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Fic. 1. Illustrating the proof of the One-Mouth Theorem. 


simple polygon. By the Two-Ears Theorem K;,; must have two ears and since they 
are non-overlapping they cannot both occur at x; and x,;. Therefore, at least one 
ear must occur at x, for i < k <j. Obviously such an ear for K;, is a mouth 
for P. Q.E.D. 


While the above stepwise procedure for “inflating” a polygon P by “gobbling- 
up” mouths provides an algorithm for computing the convex hull of P this is not 
the best way to tackle this problem. Triangulating P does not appear to help here 
and a straightforward approach to “gobbling-up” mouths leads to an O(n?) time 
algorithm. On the other hand several O(m) time algorithms for computing the 
convex hull of a simple polygon are known [MA], [GY], [Tol]. 

It is possible for a polygon to have many ears and only one mouth (Fic. 2(a)) 
and also many mouths and only one ear (Fic. 2(b)). Note that care is needed when 
speaking of mouths and ears as well exposed vertices, i.e., vertices of P that are 
also vertices of CH(P). For example, Guggenheimer [Gu] states that a simple 
polygon has two principal vertices that are exposed. This is false and a counterex- 
ample due to Meisters [Me2] is illustrated in Fic. 2(c). This figure also illustrates 
that polygons exist which have precisely one mouth and two ears. In fact, these 
notions suggest some interesting families of simple polygons. Recall that no O(n) 
time algorithm exists for triangulating an arbitrary simple polygon. However 


Fic. 2. (a) A polygon with only one Fic. 2. (b) A polygon with only two 
mouth and many ears. ears and many mouths. 
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‘mouth. 


Fic. 2. (c) This polygon has precisely three principal vertices: two ears and one mouth and yet none of 
them are exposed. 


certain special classes of simple polygons such as star-shaped ones do admit O(n) 
time triangulation [To2]. We now define another such class of polygons. 


DEFINITION. A simple polygon P is called a one-mouth polygon provided it 
contains precisely one mouth. 


DEFINITION. A simple polygon P is called a two-ear polygon provided it contains 
no more than two ears. 


DEFINITION. A simple polygon P is called anthropomorphic provided it contains 
precisely two ears and one mouth. (See Fic. 2(c).) 


These three classes of polygons exhibit a good deal of structure as exemplified 
by the following theorem. 


THEOREM 3. The dual-tree of every triangulation of a two-ear polygon is a chain. 


Proof. Assume the polygon P has two ears and that the dual tree of some 
triangulation of P is not a chain. Then the tree must contain at least three leaves 
which is a contradiction. Q.E.D. 


Theorem 3 allows us to triangulate the interior of a two-ear polygon of n 
vertices in O(n) time as follows. Consider any vertex x; of P. It is an easy matter 
to find another vertex x, such that [x,, x,] is an internal diameter of P in O(n) 
time if indeed such a diagonal exists [Le]. Furthermore if such a diagonal does not 
exist then the diagonal [x;_,, x;,,] is guaranteed to exist [Le]. In either case this 
diagonal partitions the polygon P into two polygons P, and P, each of which can 
be triangulated in O(n) time starting at either [x,, x,] or [x;_,, x; ,]. It suffices to 
realize that each diagonal can be inserted with a constant number of local angle 
tests. 

A similar procedure can be used to triangulate the exterior of a one-mouth 
polygon. First we can use an O(n) time algorithm for finding the convex hull of P 
[Tol]. This will identify the two vertices x; and x, that form the “lid” of the pocket 
K,, of CH(P). One of the two ears of K;; must occur at either x; or x; and can 
then be identified in a constant number of steps (i.e., independent of n). Triangu- 
lation of K;,; can then proceed as in the case of the two-ear polygon. 

We have therefore established the following theorems. 
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THEOREM 4. A one-mouth polygon can be externally triangulated in O(n) time. 
THEOREM 5. A two-ear polygon can be internally triangulated in O(n) time. 


THEOREM 6. An anthropomorphic polygon can be completely triangulated in O(n) 
time. 


One additional computational problem that is of interest here concerns the 
recognition of these types of polygons. For example, whether a simple polygon is 
star-shaped or not can be determined in O(n) time [LP]. By testing every vertex of 
a simple polygon to determine whether it is an ear or a mouth we can recognize 
anthropomorphic polygons in O(n7) time. However, using a more clever procedure 
we can reduce this complexity to O(n) [ST]. 


Note added in proof: Since the writing of this note Bernard Chazelle at Prince- 
ton University has shown that arbitrary simple polygons can indeed be triangulated 
in linear time. 
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Inequalities for Areas Associated with Conics 


W. A. DAY 
Hertford College, Oxford 


Let (x1, y,),(%2, yz), (x3, y3) be three points on the parabola y* = 4ax. The 
area of the triangle formed by the points is 


1 
A = 5 eal ¥2 — y3) + X2(¥3 — ¥1) + ¥3(V1 — Ya) 
1 
= gg 2102 — y3) + y3(y3 —y,) + y3()1 — y2)| 


1 
= [01 ~ yada ~ Yas — Ya) I- 


The tangents to the parabola at the three points are the lines 
2a(x +x,) =y1y, 2a(x +x.) =yoy, 2a(x+x3) =y3y, 
and these intersect at the points 


(42 2 er | (“= | 
4a’ 2 "\ da’? 2 "\ da’ 2 


Thus, the area of the triangle formed by the tangents is 
1 
Al = 76g 217201 — Yo) + Y2¥3(¥2 — ¥3) + ¥3V(¥3 — ¥1) | 


1 
= 6g \O" — yz)(¥2 — y3)(¥3 — y,)| = sA. 


In summary: the area of the triangle formed by three tangents to a parabola is half 
that of the triangle formed by joining their points of contact. 

This conclusion has long been known, and is to be found, for example, in 
Salmon’s Conic Sections [1]. Having set the proof as an undergraduate exercise, I 
was led to ask about the relationships between A’ and A for the ellipse and the 
hyperbola. My purpose is to point out that the area of the triangle formed by three 
tangents to an ellipse is strictly greater than half that of the triangle formed by joining 
their points of contact, and that the area of the triangle formed by three tangents to a 
hyperbola is strictly less than half that of the triangle formed by joining their points of 
contact, provided that all three points of contact lie on the same branch of the 
hyperbola. (It is easily seen that the latter conclusion is generally false if the points 
of contact do not lie on the same branch). 

I shall establish these conclusions by proving somewhat more, namely, that for 
the ellipse 


A’ 


IV 


72 (1) 
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while for the hyperbola 


(2) 


where a and b are the lengths of the semi-major and semi-minor axes. 
The proof depends on two instances of an elementary inequality, which says 
that if p,q, 7r,Uu,U,w are any nonnegative numbers then 


i <(ptu)7(qt+v)7(r4+w)'”. (3) 


(pqr)'/* + (uvw)'7* 


To verify (3) we observe that 
3 3 3 
(( par)'”° + (uvw)'”°) = pqr + 3( p?q?r2uvw)” + 3( pqru?v2w?)'” + uw. 
However, the arithmetic-geometric mean inequality implies the inequalities 


3 
3( p?q?r2uvw) ’ < pqw + qru + rpv, 


3( pqru2v2w?)'” < puw + qwu + ruv. 


Hence, 
3 
((par)'”° + (uvw)’’*) < pgr + pqw + qru + rpv + pow + qwu + ruv + uvw 
=(ptu)(q+ov)(r+w), 


and (3) follows on taking the cube root of each side. 
To obtain the first of the required special cases, let a, B, y be any real numbers 
and let 


= COS’ a, q = cos’ B, r= cos’ y, 
u=sin?a, uv =sin?B, we=sin’y. 
Then (3) tells us that 
2 2 2 1/3 a) 2 a) 1/3 
(cos? a cos? B cos* y)’” + (sin? asin? Bsin* y)“ < 1. (4) 
On the other hand, if we take 
p=q=r=1, 
u = sinh’ a, v = sinh’ B, w = sinh’ y, 
we find the inequality 
4d 2 co 2 \173 5 5 > \1/3 
1 + (sinh? @ sinh? B sinh? y)’~ < (cosh? a cosh? B cosh? y) “”, (5) 


which is the second of the special cases. 
We are now in position to verify (1). To this end we adopt the parametric 
description 


x =acost, y = bsint, (t € [0,27r)) 


of the ellipse. The area of the triangle formed by the points corresponding to the 
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values ¢,,¢,,¢, of the parameter ¢ proves to be 


1 
A= 7 ab sin(t, ~ t,) + sin(t, — t,) + sin(t, — ¢,)| 


1 1 
ab| sin ht — t,)cos maae — t) 


(6) 


1 1 
— sin 5 (hs — t,)cos 5h + ty — 2t;) 


= 2ab 


1 1 1 
sin 5 (hs — t,)sin 5 (he — t,)sin 5 (ts — t,) 


The tangents to the ellipse at the points corresponding to the values 1, t,, t, 
are the lines 


xcost, ysint, 
+ 
a b 


and so forth, and the area of the triangle formed by these lines proves to be 


ab|sin(t, — t,) + sin(t, — t,) + sin(t, — mle 


a 


2 


1 1 1 
8ab| sin Pace — t,)sin 5 (fe — t,)sin 5 (es — t,) 


= ab 


? 


1 1 1 
tan 5h — t,)tan 5 (he — t,)tan 5 (ts —t,) 


and, hence, 


1 1 1 1 
A' = —Alsec 5 (hs — t,)sec 5 (he — t,)sec 5 (ts — t,)}. (7) 


2 
On combining (6) and (7) with the inequality (4), we see that 


A 2/3 A 2/3 
~) 4 {a 1 
ea bed, <* 


and this is equivalent to (1). 
To verify (2) we adopt the parametric description 


x =acosht, y = bsinht, (t € R) 


of a branch of the hyperbola. In this case, the area of the triangle formed by the 
points corresponding to the values t¢,, t,,t, of the parameter ¢ turns out to be 


1 1 1 
A = 2ab|sinh = (t, — t,)sinh >(t, ~ ¢)sinh >(t3 - t1)}, 


1991] NOTES 


while the counterpart to (7) turns out to be 


A! - 
a) 


On appealing to (5), we obtain the inequality 


A 2/3 A 2/3 
14+ {— —_ 
| < fra , 


1 1 1 
sech 5 (fs — t,)sech 5 (te — t;)sech 5 (es 


which is equivalent to (2). 
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1. Introduction. Augustin-Louis Cauchy would be pleased. Each year we intro- 
duce our elementary analysis students to the notion of the derivative essentially as 
he gave it to us in 1823 (for details see, e.g. [4, 7]). But there is another, less well 
known, characterization of the derivative which appears in the last textbook [2] 
written by Constantin Carathéodory (1873-1950). This formulation is not only 
elegant but useful on both theoretical and pedagogical grounds. The proofs of 
many important theorems concerning differentiability become significantly easier 
and, in the process, some of the less enlightening details are rightly submerged. 

Carathéodory’s formulation also gives us a much clearer view of the fact that 
continuity is essential for differentiability; indeed, the definition itself contains the 
necessary continuity. This formulation, which shifts the details from the theory of 
limits to the theory of continuous functions, requires that our students develop a 
clearer understanding of continuity than is typical and it demands that we 
reevaluate this understanding and continually reinforce it. 

We believe that Carathéodory’s insight deserves to be better known and we 
hope that the present article will help in that effort. We also believe, however, that 
since it is somewhat more subtle and less practical from a computational point of 
view than is the standard definition, Carathéodory’s approach should be neither 
the first nor the only one that students see. But we also believe that all elementary 
analysis students can appreciate its power and should be exposed to it; in fact even 
the more perceptive first-year calculus students have something to gain from 
working with it. 


2. Carathéodory’s derivative formulation. After the usual definitions and theo- 
rems about limits and continuity are presented in a standard elementary real 
analysis course, the definition of the derivative, essentially as given to us by 
Cauchy, is given for functions of a single variable. Typically it is presented in both 
the following forms: 


Definition. The value of the derivative of the function f at a is the number 


f(a +h) — f(a) 
‘(a) = lim ——__ , 
f(a) = jim 7 
if the limit exists; equivalently, 
f(x) - f(a) 
'‘(a) = lim —————_- , 
P( ) x—-a Xx—a@ 

if the limit exists. 

Carathéodory realized the importance of the fact that the criterion for continu- 
ity is apparent in the second definition above and he proposed the following 


definition for the derivative at a point; see [2, p. 119] or [8, p. 58]. 
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CARATHEODORY’S DEFINITION OF THE DERIVATIVE. The function f, defined on 
the open interval U, is said to be differentiable at the point a € u if there exists a 
function @, that is continuous at x = a and satisfies the relation f(x) — f(a) = 
db (xx — a) for all x € U. 


We will usually write (x) instead of ¢,(x), since there seems to be little 
chance of confusion, but we must remember that the function @ depends on the 
point a. Geometrically, of course, when x # a, ¢,(x) is the slope of the secant line 
through the points (x, f(x)) and (a, f(a)). This alternative definition emphasizes 
the fact that the slopes of the secant lines, by way of which we initially arrive at the 
tangent line, approach the tangent line in a continuous manner; we rarely state this 
important fact explicitly, but we should. 

Clearly all of the subtlety of the limit in the standard definition of the derivative 
is now crouching just below the surface, hiding in the subtlety of the continuity of 
the function @. 

This definition can be used as is for complex-valued functions of a complex 
variable; in fact, it is in precisely this context that Carathéodory introduced it in 
[2]. It is interesting to note, however, that this formulation does not appear in 
Carathéodory’s 1918 text [3] on real-valued functions. 

As pointed out in [2] and [8], two immediate consequences of this formulation 
are: 

e If f has a derivative at a, then f is continuous at a. 
¢ There is at most one function @ satisfying the definition; so if f’(a) exists, 
f(a) = d(a). 

These two corollaries of the definition are both expected and their proofs are 
trivial, assuming only elementary results about continuous functions. The power of 
Carathéodory’s approach does not become apparent until we apply it to prove 
some deeper theorems. In the following sections we offer proofs of several 
standard, basic theorems which we hope will demonstrate the value of this 
formulation. 


3. An indicative example: the chain rule. Shortly after the introduction of his 
definition of the derivative, Cauchy used it to prove the chain rule (though his 
proof was incomplete). It seems appropriate then that we do the same with 
Carathéodory’s definition. “How incisive Carathéodory’s formulation is may be 
seen by applying it to the proof of the chain rule for the derivative of a composite 
function” [8, p. 59]. Many of our students will appreciate the pithy elegance of this 
proof, which is slightly revised from [8]. 


THEOREM 1. CHAIN RULE. [If f is differentiable at the point a and g is differen- 
tiable at the point b = f(a), then h=g ef is differentiable at a and h(a) = 
g'( fla)) f(a). 


Proof. Since f is differentiable at a there is a function ¢@, continuous at a and 
defined in an open interval V containing a, with f(x) — f(a) = f(x) -—b= 
o(x x — a) for all x © V. Similarly we get a function #%, continuous at b and 
defined in an open interval U containing a, for which g(x) — g(b) = W(x )(x — D) 
for all x € U. 


42 STEPHEN KUHN [January 


Then 
h(x) — h(a) = 8(f(x)) - s(f(4@)) = oF) F(x) - 5] 
= [be f(x)]o(x)(4 - a), 
for all x € V with f(x) € U. Since (Wf) is continuous at a and has the value 
2'(f(a))f'(a) there, the proof is complete. Oo 


4. Additional examples. Carathéodory’s characterization also leads to some 
sharp, concise proofs of other important theorems and we hope that the examples 
below will demonstrate the merit of Carathéodory’s approach. 


THEOREM 2. INVERSE FUNCTION THEOREM. Let f be continuous and Strictly 
monotonic on an open interval I containing c and assume that f'(c) exists and is 
nonzero. Then g =f ' is differentiable at d = f(c) and g'(d) = ['f(c)\"'. 

Proof. (Our proof here is from [8, p. 61].) We have f(x) — f(c) = d(xMx — d) 
for all x in J with ¢ continuous and ¢(x) # 0 for all x in J. Let V be the open 
interval which is the domain of g. Then y — d = f(g(y)) — d = f(g(y)) — flc) = 
élg(y)Ilg(y) — c] for all y in V; thus [g(y) — c] = A/dlg(y) (iy — d) for all y 
in V. Since g is continuous on V, 1/(¢° g) is continuous there as well, and the 
theorem follows. O 


If Carathéodory’s alternative were powerful enough to prove theorems such as 
the two above, but too powerful or cumbersome to handle the earliest calculational 
results of the subject, we would be less likely to introduce it. In fact, it also 
manages these earlier theorems with ease and even some grace. 


THEOREM 3. LINEARITY AND THE PRODUCT AND POWER RULES. If f and g are 
differentiable at a, k is a constant, and n is a natural number, then: 
i) [kf + g\(a) = kf'(a) + g(a). 
ii) (fg)'(a) = fla)ge'(a) + g(a)f'(a). 


iii) the derivative of f(x) = x" at a is f'(a) = na”. 


Proof. We assume that there exists ¢, #, continuous at a and defined on open 
intervals U and V, respectively, containing a, such that f(x) — f(a) = d(x)\(x — a) 
for all x € U and g(x) — g(a) = W(x (x — a) for all x EV. 

i) Then 


[kf + g](x) — [kf + gl(a) =k[ f(x) - f(a)] + [s(%) - 8(4)] 
= kob(x)(x — a) + p(x)(x — a) 
= [kd(x) + u(x)|(x-a), forallx EeUNV. 


ii) For all x EU NV, 
(fe)(x) — (fe)(a) =f(x)a(x) — f(a) a(a) 
= f(x)g(x) — f(x)ga(a) + f(x) 8(a) — f(a) 8(a) 
= f(x)[g(x) — g(a)] + [f(x) - f(a] e(a) 
= f(x)p(x)(x — a) + b(x)(x - a) g(a) 
= [f(x) v(x) + o(%)8(a)](% - a). 
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iii) Since x” — a? =[x"~!4+x%"%-2a + x" 3a? + +++ +xa"~? + a” x - a) 
for all x, we have o(x) =x"7!+x"-*a +x" 3a? + +++ +xa"~? + a"! and 
hence $(a) = na”~?. Oo 


Finally, we easily derive another basic but very useful result. 


THEOREM 4. CRITICAL POINT THEOREM. Let f be defined on an open interval I 
containing a. If f(a) is an extreme value then a is a critical point (i.e., f(a) = 0 or 
f(a) does not exist ). 


Proof. We only prove the theorem for f(a) a maximum and we do this by 
assuming f‘(a) exists and proving that f’(a) = 0. 

For some open subinterval U of J we have a function @, continuous at a, such 
that f(x) — fla) = d(xXx -— a) for all x © U. Let c GU, c <a. Then 
f(c) — f(a) = d(cXc — a), and since f(a) is a maximum in U, f(c) < f(a). So 
f(c) — f(a) < 0 and c — a < 0; hence ¢(c) > 0. Similarly, for d € U, d > a, we 
have f(d) — f(a) < 0 and so d(d) < 0. 

Since o(c) > 0 for all c € U, c < a, and ¢(d) < 0 for all d € U, d> a, the 
continuity of ¢ at a forces us to conclude that ¢(a) = 0. O 


It is surprising that so few elementary analysis texts even mention this alternate 
view of the derivative. Of ten undergraduate /graduate analysis texts we examined 
only Rosenlicht [12], makes use of it, although he does not mention Carathéodory 
by name; he imbeds it in his proof of the chain rule for one variable and brings it 
out in the open for partial differentiation. 

There are several analysis texts, including Protter and Morrey [10], which offer 
f'(a) + r(x; a), where r(x; a) > 0 as x > a, instead of ¢,(x), and use it to good 
advantage in the proofs of several theorems. This approach has the intended 
additional benefit of making transparent the linear approximation of the tangent 
line. 

The simplification of additional proofs using this definition and the extension of 
it to functions of several variables would make interesting projects for enterprising 
analysis students. 


5. Carathéodory’s approach in first-year calculus? It is clear that 
Carathéodory’s formulation demands too much of most first-year calculus students 
to make its presentation worthwhile. Nevertheless, it can be of value, for example, 
for students in honors calculus courses. The chain rule will serve as a good 
representative example of the relative improvements in such courses afforded by 
the proofs given here. 

Many current standard calculus texts ({11] is typical in this regard) relegate the 
more delicate details of the proof to an appendix, which will be read by few 
students. At the extreme, [5] puts the entire proof in the appendix. At the other 
extreme, [9] gives a 14-page “simpler proof that is valid for many functions” and 
then another 12-page complete proof (both proofs are within the same section of 
the text!). Neither of these routes is necessary, however. Using Carathéodory’s 
definition we get a very clean, understandable proof of the chain rule which can be 
presented to perceptive first-year students in its entirety without interrupting the 
flow of the text. 

Of 18 standard calculus texts we examined, only one, Gillett [6], refers to 
Carathéodory’s formulation of the derivative at all (exercise 66, p. 127), but no 
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mention is made of Carathéodory himself. There are a few calculus texts, including 
Boyce and DiPrima [1], which present the variation mentioned at the end of the 
previous section. 

(Note: Young [13] contains interesting biographical information on 
Carathéodory, from the viewpoint of one of his students and strong admirers.) 


Acknowledgement. Thanks to the referee for several helpful suggestions. 
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Positive Definite Matrices and Sylvester’s Criterion 
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Sylvester’s criterion states that a symmetric (more generally, Hermitian) matrix 
iS positive definite if and only if its principal minors are all positive. The theorem is 
especially useful when employing the Second Derivative Test for local extrema of 
functions of several variables. When the proof of the sufficiency of positive 
principal minors is included in a linear algebra text, it is usually based on Gaussian 
elimination [4, pp. 331-332] or on the reduction theory for quadratic forms [2, pp. 
328-329]. In either case, the heart of such proofs is a matrix computation. This 
note presents a proof which, while still very dependent on matrices, makes more 
use of ideas from the theory of vector spaces. In a one semester course in linear 
algebra, one often has the feeling of developing the machinery of abstract vector 
spaces without ever really making use of it. The proof here is short, requires only 
concepts that are standard in beginning linear algebra, and provides an opportu- 
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nity for the student to see the abstract theory applied to a concrete problem. A less 
direct proof in a similar spirit is given in [3]. 

We begin by stating the prerequisite theorems and definitions. The first theo- 
rem we shall need is the following: 


THEOREM 1. Let W, and W, be finite-dimensional subspaces of a vector space V. 
Then 


dim(W, + W,) + dim(W, N W,) = dim W, + dim W,. 


Proof. One simply extends a basis for W, N W, to a basis for W, and to a basis 
for W,. The bases combine to form a basis for W, + W,. See [2, p. 46] or [4, pp. 
199-200] for details. 


Since the determinants of similar matrices are equal, we also have 


PROPOSITION. The determinant of a diagonalizable matrix is the product of its 
eigenvalues. 


The next theorem provides one of many opportunities for nontrivial inductive 
proofs in a linear algebra course. 


THEOREM 2 (Spectral Theorem). Let A be an n Xn real, symmetric matrix. 
Then all eigenvalues of A are real and there exists an orthonormal basis of }” 
consisting of eigenvectors of A. 


Proof. See [2, pp. 312-314] or [4, pp. 295-296] for a proof. 


Having stated the background we require, we proceed to positive definite 
matrices. 


DEFINITION. A real, symmetric matrix is positive definite if for every nonzero 
vector in R”, v‘Av > 0. 


The proof of the next theorem is a routine exercise. 


THEOREM 3. A real, symmetric matrix A is positive definite if and only if all its 
eigenvalues are positive. 


Once we define principal minors, we turn to the proof of Sylvester’s criterion. 


DEFINITION. Let A be an n Xn matrix. For 1<k <n, the kth principal 
submatrix of A is the k X k submatrix formed from the first k rows and first k 
columns of A. Its determinant is the kth principal minor. 


THEOREM (Sylvester’s Criterion). A real, symmetric matrix is positive definite if 
and only if all its principal minors are positive. 


It is easy to prove that positive principal minors are necessary by showing that 
each kth principal submatrix is positive definite, hence has positive determinant by 
Theorem 3 and the Proposition. Our proof that positive principal minors implies a 
matrix is positive definite proceeds through two lemmas, instructive in their own 
rights. At each stage we use vector space methods, although, of course, parallel 
matrix proofs are possible. 
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Lemma 1. Let v,,...,v, be a basis of a vector space V. Suppose W is a 
k-dimensional subspace of V. If m < k, there exists a nonzero vector in W which is a 
linear combination of V,,4.45-++5V 


2 "n° 


Proof. Since dimW + (n -m)=k+n—m>n, W has nontrivial intersec- 
tion with the span of v,,,,,...,Vv, by Theorem 1. 


The difference between our proof and others lies mainly in the next lemma, 
which is nearly a special case of the Courant-Fischer “min-max theorem”’’ {3, 
p. 179]. 


LEMMA 2. Let A be an n Xn real, symmetric matric. If w'Aw> 0 for all 
nonzero vectors w in a k-dimensional subspace W of RR”, then A has at least k 
positive eigenvalues (counting multiplicity ). 


Proof. Let v,,...,v, be an orthonormal basis for R” consisting of eigenvectors 
for A. Let A,,...,A, be the corresponding eigenvalues. Suppose further that the 
first m eigenvalues are positive and the rest are not. If m < k, then Lemma 1 
implies there exists a nonzero vector w in W which may be written 

w= Cm+Vm+1 tos +C,V,- 


We have 
n 
t _ t , — 2 Lee 2 
WAw= Do Cg AW Ve = Ona Amar tt +CZA,, 
J,k=m+1 
since v,,...,V, is an orthonormal set. But then w’Aw < 0, a contradiction. Thus 
m > k, as desired. 


We complete the proof of sufficiency in Sylvester’s criterion by induction. For 
n = 1, the result is trivial. Assume the sufficiency of positive principal minors for 
(n — 1) X (n — 1) real, symmetric matrices. If A is an n Xn real, symmetric 
matrix with positive principal minors, then, by the inductive hypothesis, its (7 — 1)st 
principal submatrix is positive definite. Let W be the (m — 1)-dimensional sub- 
space of IR” consisting of those vectors whose last coordinate is zero. Then for any 
nonzero vector w in W, w‘Aw > 0. Lemma 2 now implies that A has at least n — 1 
positive eigenvalues (counting multiplicity). We apply the Proposition above and 
the fact that det A > 0 to conclude that A has n positive eigenvalues. This 
completes the proof of Sylvester’s criterion. 


Acknowledgements. The author wishes to thank the referee for suggesting several improvements 
and for bringing [3] to his attention. 
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The Gibbs Phenomenon for Piecewise-Linear Approximation 


J. FOSTER AND F. B. RICHARDS 
Department of Mathematics, Weber State College, Ogden, UT 84408 


In 1899 J. W. Gibbs [1] of Yale, in response to a letter in Nature by the 
American physicist A. Michelson [2], presented a result about Fourier series that 
now goes by the name of the Gibbs phenomenon. Michelson had complained 
about an undesired “overshoot” effect that occurred whenever he approximated a 
function having a jump discontinuity by a finite Fourier series. Gibbs was able to 
show that this overshoot does not disappear as the number of terms in the series 
becomes arbitrarily large. Rather, on each side of the discontinuity, it approaches a 
constant g times one-half the size of the jump, where 


g = (2/r) | 


0 Xx 


a sin x 
dx — 1 = 0.17897974 


independent of the function being approximated. 


overshoot ¢ 


Fic. 1. Approximation of a square wave by a 30-term Fourier series. 


This phenomenon is familiar to every student of differential equations or 
Fourier series, but textbooks often give the impression that it is a special quirk of 
Fourier series. The point we wish to make in this note is that, on the contrary, the 
Gibbs phenomenon has little to do with the use of trigonometric polynomials as 
approximating functions but is primarily a consequence of the fact that Fourier 
series are best least squares approximations; and that we should expect to see an 
effect of this nature whenever we employ least squares approximations to a 
jump-discontinuous function, even if the approximating functions are not trigono- 
metric polynomials. 

To illustrate, we will prove an analog of the Gibbs phenomenon when the 
approximating functions are piecewise-linear. The computations are straightfor- 
ward and would make a good classroom exercise or project. 
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Fic. 2. Best least squares approximation of a square wave by a piecewise-linear function with 30 
equally spaced nodes per period. 


Define 


1, 0<x<l 
f(x) ={ 7 -1<x<0 
and extend f by periodicity to the entire real line. The value of f(x) when x is 
integral is immaterial. We will find the unique periodic continuous function y 
which is linear on each interval [k/n,(k + 1)/n], n a fixed positive integer, and 
which best approximates f in the L, norm on [—1,1]. The numbers k/n are 
called the nodes of y”. 
Under the notation y, = y“(k/n) it should be evident from symmetry that 
y_, = —y, and y_, =yo = y, = 9. So it will suffice to find numbers yo, y,,..., y, 
that minimize 


[O@@) _ 1)” dx. 
0 
It is easy to verify that for k/n <x <(k + 1), 


yP(x) = (ne — k) (vi — Ved) + Yee 
If we let w, = y, — 1 we then have 


yx) — 1 = (nx — k) (Weg — We) + Wy 


and 
(k+1)/n n 2 
[EPO (yx) — AY de = (wher + wearte + w2)/3n 
Then 
1 2 n—l1 n—2 
['(v@@) - IF de = (2D we +L wegaey +2 wy — wea] 730. 
° k=1 k=1 


By setting the partial derivatives with respect to w,,...,w,_, equal to zero we 
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obtain the normal equations 
4w,+w,=1 
w,_, + 4w, + Wr4, = 90 (2<k <n-2) 
W,-2 + 4w,_, = 1. 


4) 

The general equation in this system is a second-order linear homogeneous 
difference equation whose general solution is w, = c,rf{ + crx where r, = 
—2+ ¥3 and r, = —2 — y3 are the roots of the characteristic equation r? + 
4r + 1 = 0. The extreme equations enable us to solve for c, and c,: 


= (3 -1/Ct - 7) 
cy = (rt — 1I)/(3 - rf). 
The solution of the system is then 
WwW, = —(rk + r2-ky 71 + rv). 


Since 0 < —r, < 1, it is easy to see that the sequence 
(-1)*'w, = (=r) "(rt + 2/0 + 7) 
is positive for 1 < k <n/2. It is also decreasing, because 
(= 1)" we = ($1) Wea = (or) (rt (rt + I)/(1 + rf) > 0 


for 1<k<n/2. Hence the maximum “overshoot” (that is, the deviation 
of y (x) from f(x)) occurs when k=1 and its value is w, = w{? = 
—(r, + r?~-')/. + r?). To see that this overshoot doesn’t disappear as n gets 
large, we compute 

lim w) = -r, =2—- 73. 

n- 
If we define the Gibbs constant to be the overshoot divided by one-half the 
magnitude of the jump, then we have shown that the Gibbs constant for approxi- 
mation by piecewise-linear functions is 2 — ¥3 = 0.26794919, somewhat larger 
than the classical Gibbs constant. 


Note 1. We leave it to the reader to verify that the Gibbs constant is indepen- 
dent of the function f. 


Note 2. Continuous piecewise-linear functions are referred to in approximation 
theory as splines of degree 1. The second author of this paper has shown that a 
Gibbs phenomenon holds for splines of higher degree [3]. 
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Teaching and Testing Mathematics Reading 


CARL C, COWEN 
Department of Mathematics, Purdue University, West Lafayette, IN 47907 


When we discuss teaching undergraduate mathematics with our colleagues, we 
frequently distinguish two sorts of goals: first, we want students to be able to do 
computations related to the course and, second, we want students to know the 
theory from the course. In a calculus course, we want students to be able to 
integrate polynomials, so in class we tell them how, and on the exam; we give them 
polynomials to integrate. In linear algebra class, we want the students to know the 
theorems concerning determinants, so we prove the theorems from the book, and 
on the exam, we ask them to prove that if A is an invertible matrix, then 
det(A~') = 1/det(A). Our hallway conversations elicit general agreement that 
most of our students do learn to do the calculations we put on the exam and most 
don’t master the theory to the point that they can write a coherent proof of any but 
the most trivial theorems. 

I want to argue in this note that there is another goal, between these in 
accessibility, that should be a conscious part of our teaching effort: 


We should teach our students to read and understand mathematics . 


And to reinforce our effort, we must test their ability to do so. Very few of our 
students will ever, after leaving our courses, have to give a formal proof of a 
theorem and few will have to do something so mundane as “Find the definite 
integral.”” On the other hand, many of them will have to read and understand 
mathematical writing to apply new ideas to the problems of their jobs. Moreover, 
learning to read and understand mathematics will make the further step of 
learning to prove theorems more possible and will have benefits for every course 
that follows. We need to convince our students that, in the long run, understanding 
what the theorems mean will be easier and have a bigger payoff than simply 
memorizing their statements. I do not believe that our current goals should be 
dropped; they are an important part of every mathematics course. On the other 
hand, I do not believe that learning to read can be left simply to chance; that’s 
what we’ve been doing and it hasn’t worked! 

If you need evidence that we have a problem, let one of your “B” students 
explain the statement of a theorem from a homework exercise that they have not 
been able to prove or have them explain the statement and proof of a theorem 
from a section in the book that you have skipped. My students, at least, do not 
have the innate ability to read and understand what they have read. When | ask 
them to read a problem and explain it to me, the majority simply recite the same 
words back again. 

To help them learn to read, we must give them opportunities to practice. In 
class, it is helpful to discuss the meaning of a new theorem before proving it. It is 
especially important to discuss hypotheses. Many students have not learned what 
role the hypothesis of a theorem plays. You might give several examples of objects 
that could satisfy the hypotheses of a theorem and get your class to tell you which 
actually do. Asking them to restate the conclusion in a particular instance is 
frequently a helpful way to deal with the conclusion. Drawing out corollaries that 
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have not yet been stated is another way to deal with the conclusion. For example, 
in an advanced linear algebra course, | prove the following theorem: 


THEOREM. Jf A is an n by n matrix and yw is a number such that 
|All < Tu 
then wl — A is invertible. 


Comment: This theorem is true for any submultiplicative norm on matrices; the 
norm used below is the one-norm: 


n 
|| All = max|  Ia,,|: j = 1,. an} 
i=1 

After writing the statement of the theorem on the blackboard, we can have the 
following (idealized) discussion: 


Instructor: What is the one-norm of the matrix 


_ 7 — 2 9 
A=(% wal 
Student: ||A|| = 1.6 


Instructor: Good! What does this theorem say about the matrix 


: 13 2 
2I1-A= [13 é)? 

Student: The theorem says it’s invertible. 

Instructor: That’s right! The yu in this case is 2 and since ||A|| = 1.6 < 2 = [pl, 
the hypothesis of the theorem is satisfied and the theorem says that 2] — A is 
invertible. In fact, the determinant of 2] — A is .88 #0 and we can find the 
inverse by direct computation. 

Instructor: What does this theorem say about the matrix 


3 2 
/ A=(2, ?,) 

Student 1: The theorem says it’s not invertible. 

Instructor: Is that correct? 

Student 2: No, the theorem doesn’t say anything about this matrix. 

Instructor: Student 2 is right! The pw in this case is 1 and since ||A|| = 1.6 > 1 
= |u|, the hypothesis of the theorem is not satisfied and we cannot tell from this 
theorem if the matrix J — A is invertible or not. In fact, the determinant of J — A 
is — .Q2 so it is invertible. Similarly, for uw = 1.2, the hypothesis of the theorem is 
not satisfied and we cannot tell from this theorem if the matrix 1.2] —A is 
invertible or not. In this case, we find that the determinant of 1.2] — A is 0, so the 
matrix 1.27 — A isn’t invertible. 

Instructor: Now, who can remind us of the definition of eigenvalue and tell 
some things we know about eigenvalues? 

Student: The number A is an eigenvalue of the matrix A if there is a nonzero 
vector v for which Av = Av. We know that A is an eigenvalue if and only if 
Al — A is singular. 

Instructor: What does this theorem say about the eigenvalues of A? 

Student: If the number A satisfies |A| > || All, then it is not an eigenvalue of A. 
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Typically, I would then formally state the corollary just given by the student and 
prove the theorem (by adding up the geometric series) and the corollary. Classes 
need varying amounts of coaxing and help in such a discussion but they require less 
coaxing after they are accustomed to the style, especially if the instructor refuses to 
answer the questions for the class! 

A second opportunity for practice is in assigning homework over material not 
covered in class. Most textbooks include much more material than can be covered 
in a typical course and much of the material that must be skipped is interesting 
and valuable. Asking the students to read a section of the book and do some 
related homework is a reasonable way for them to get practice in reading for 
understanding. Such material needs to be carefully chosen for this purpose to be 
sufficiently theoretical to be challenging yet down to earth enough that students 
can be successful. For example, in some real analysis courses, it might be reason- 
able to have students read a section in the text defining monotone function and 
culminating with the proof of the theorem that a monotone function on a finite 
interval has at most countably many discontinuities. Students could then be asked 
to prove that the sum of two increasing functions is increasing, give an example of 
two monotone functions whose sum is not monotone, and perhaps prove that a 
monotone function on the whole line has at most countably many discontinuities. 

Finally, if we really expect students to take learning to read seriously, then we 
must put it on our tests. Whatever other qualities our students have, they have 
learned to play the student game well. For their part, the students learn to pass 
our tests and, for our part, we faculty have capitulated to that hated question 
“Professor, will this be on the test?” Now we only expect them to learn what we 
are putting on the test! We can use this to our advantage if we test them on their 
reading ability. 

Clearly, on an hour test, we cannot give the students a book chapter to read and 
describe. To test reading mathematics that involves the ideas of the course, I have 
frequently included on an exam a new (to the students) theorem, together with its 
proof, followed by a question that (I hope) can be answered only by those who 
read and understood the theorem and proof. The theorems need not be signifi- 
cant, although it’s nicer if they are. Moreover, since we want it to be possible to 
answer the question only by understanding the proof, the conclusion may not be 
stated in the most elegant or cleanest way. Here are two examples. 

The first example is from a course for mathematics education majors. The aims 
of the course are to develop some understanding of theorems, proofs, and 
mathematical statements in general. The topics include material on sets, number 
theory, real numbers, and polynomial equations. This question comes from the 
first test of the semester which follows the material on number theory; the course 
does not usually include any discussion of Pythagorean triples, solutions of Dio- 
phantine equations, or writing integers as sums of squares. 


THEOREM. If m and n are positive integers such that 
2m =n? + 1, 
then m is the sum of the squares of two integers. 


Proof. Since n? + 1 is even, n” and therefore n must be odd. That is, there is 
an integer k such that n = 2k + 1. This means that 


2m =n? 4+1= (2k +1)? 4+1= (4k2 4+ 4k 41) 4+1= 4k? 4+ 4442 
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It follows that 
m= 2k? +2k+1=k? 4+ (k?74+2k 41) =k? + (k +1)’ 
which expresses m as the sum of the squares of two integers. a 


Problem: 
2 - 3785 = 7570 = 877 +1 
Write 3785 as the sum of the squares of two integers. 


Of course, if we were really going to state and prove this ‘“‘theorem”’’ in a book, 
we would probably spell out how m is written as the sum of two squares, but to do 
so in this context would obviate the need to understand the proof. What I expect 
from this question is that students will see from the proof that for k = (nm — 1)/2, 
we have m = k? + (k + 1)’. To write 3785 as the sum of two squares, I expect 
them to see that in the notation of the theorem, n = 87, so k = 43 and by the 
proof of the theorem 3785 = 437 + 447. If students perform this calculation, I 
infer that they read and understood the proof well enough to do so. 

The second example comes from a linear algebra course for engineers. At the 
time of the test, the students had studied the spectral mapping theorem for 
polynomials which implies that the positive integer powers of a diagonalizable 
matrix are similar to the powers of the diagonal matrix, but had not seen the 
theorem on square roots of positive definite matrices. 


THEOREM. [f A is a diagonalizable matrix all of whose eigenvalues are nonnega- 
tive, then there is a matrix B with nonnegative eigenvalues such that B* = A. 


Proof. Since A is diagonalizable, there is an invertible matrix S such that 
L = §~'AS is diagonal. The diagonal entries A,,A,,...,A, of L (which are the 
eigenvalues of A) are nonnegative by hypothesis. Let uw j= prj be the nonnega- 
tive square roots of the eigenvalues and let M be the diagonal matrix with 
diagonal entries w,, “5,...,,- Clearly, M* = L. Since similar matrices have the 
same eigenvalues, the matrix B = SMS~' has eigenvalues w,, »,...,,, Which 
are nonnegative. Moreover, 


B? = (SMS~')° = SMS~'SMS-! = SM?S-!=SLS-!=A. @ 


Problem: The matrix A = (1° 2) has eigenvalues 1 and 4. Find a matrix S as 
above and use it to find a matrix B with positive eigenvalues such that B? = A. 


In this question, I expect the students to understand from the proof of the 
theorem that the problem is to be solved by diagonalizing A and solving the 
corresponding problem for the diagonal matrix. In particular, I expect them to use 
the given eigenvalues of A to find a basis of eigenvectors, and thereby to construct 
S, L, M, and B. 

I have not done a comparative analysis of my classes to see if trying to teach 
reading makes a difference in how much students learn, but ] am convinced it 
helps. At the very least, if we communicate to the students that we think it is 
reasonable that they read mathematics and understand it and that we expect it of 
them, then they may come to expect it of themselves. We will all be better off if the 
students view the textbook as a source of information, not just as a list of exercises 
interspersed with messages for the instructor. 


I'd like to thank Dick Hunt, J. J. Price, and Bob Zink for their suggestions and support and the 
referee for suggestions concerning this note. 
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bisector of the angle BAC, F is the foot of the perpendicular from A to BC, and 
AG is perpendicular to AE (i.e., AG bisects one of the exterior angles at A). 
Prove that AB - AC = DF : EG. 


E 3418. Proposed by E. T. Parker, University of Illinois at Urbana-Champaign. 


For k = 1,2,...,9 let S, be the set of positive integers n such that the number 
of digits in the decimal expansion of n is a multiple of 10 and such that each of the 
digits 1,..., A occurs in exactly one-tenth of the places. For which values of k does 


yn 


nes, 


converge? 


E 3419. Proposed by Marcin E. Kuczma, University of Warsaw, Poland. 


Let F(n) be the number of ” by n matrices with entries 0 or 1 and row and 
column sums equal to 2. Let f(n) = n'/7F(n)(n!)~*. Prove that lim, _,.,,.f(m) exists 
and has a value between 0 and 1. 


E 3420. Proposed by S. G. Merzlyakov, Mathematical Institute, Ural Branch of 
the Academy of the USSR. 


Find all functions f holomorphic on the square S = {x + iy: -1 <x <1, 
—1<y < 1} in the complex plane for which there exist real-valued functions a 
and B on (—1, 1) satisfying 


| f(x + iy)| = a(x) + BCy) 
throughout S. 


SOLUTIONS OF ELEMENTARY PROBLEMS 


Inverting a Transformation by Equilateral Triangles 


E 3257 [1988, 259]. Proposed by I. A. Sakmar, Istanbul, Turkey. 


Let P,Q, R be the new vertices of equilateral triangles constructed outwardly 
on the edges of a given triangle ABC. (Cf. H. S. M. Coxeter, Introduction to 
Geometry, New York, 1961, p. 22.) 

(a) Show that any triangle POR which can be obtained in this way arises from a 
unique triangle ABC, and give a construction for recovering triangle ABC from 
triangle POR. 

(b) Show that not every triangle POR can be so obtained. 


Solution by J. G. Mauldon, Amherst College, Amherst, Massachusetts. (a) Let 
U,V,W be the new vertices of equilateral triangles constructed inwardly on the 
edges of the triangle POR. Then A, B,C are necessarily the midpoints D, E, F of 
the sides of the triangle UVW. 
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(b) The triangle APOR with sides of length p,q,r can be so obtained if and 
only if 10sin P sin Q sin R > 3 (sin? P + sin? Q + sin? R) or, equivalently, if and 
only if 11(p? + q* + r*)? > 25(p* + q* + r*). In particular, if {p, g, r} = {2, 4, 5} 
or {4,5, 8} the triangle APOR is unobtainable and an isosceles APQR with base 
angle a is obtainable if and only if 1 < v3 tana < 9. 


Proof of (a). We take the plane of discourse to be the complex plane C and we 
write w = e7'"/3 so that 1 + w + w? = 0 = w — 1. By a triangle we shall mean 
an ordered nondegenerate triangle. We identify each point of C with its 
(single, complex) coordinate, so that 0 is the origin O and the triangle AXYZ 
is a positively oriented equilateral triangle if and only if (for some A, w € C) 
(X,Y,Z) = (A + p,A + po, A + pw’) or, equivalently, if and only if X + wY + 
w’Z = 0. 

We assume without loss of generality that AABC is positively oriented, so that 
the requirement that equilateral triangles are constructed outwardly on the sides 
of AABC implies that APCB, AQAC, ARBA are positively oriented equilateral 
triangles. We shall deduce below in (*) that then APQOR is positively oriented, so 
that similarly the requirement that equilateral triangles are constructed inwardly 
on the sides of APOR implies that AUQR, AVRP, AWPQ are positively oriented 
equilateral triangles. Hence all six triangles APCB, AQAC, ARBA, AUQR, AVRP, 
AWP@ are positively oriented equilateral triangles, and consequently we have 


P= -wC-w’B, U=-wQ-o0’*R, D=(V+W)/2, 
O=-wA-w’C, V=-aR-w’P, E=(W+U)/2, 
R= —-wB-0’*A; W= -wP-o’*Q; F=(U+V)/2. 


By substitution we obtain, as required, (D, FE, F) = (A, B,C), completing the 
proof of (a). 


Proof of (b). For any triangle AXYZ we denote by S(AXYZ) the sum of the 
squares of the lengths of its sides and we write T(AXYZ) = (473 (signed area of 
AXYZ), where this area is regarded as positive or negative according as AXYZ is 
positively or negatively oriented. We also write 6,(AXYZ) = |X + w*¥ + w*Z|? 
(k = 1,2), observing that ¢, is unchanged under any translation or rotation (in C) 
of AXYZ so that, if AXYZ = aAX'Y’Z’' (directly congruent), then, for k € {1, 2}, 
,(AXYZ) = 6,(AX'Y'Z’). 

Two interesting general identities are 6,(AXYZ) — 6 {(AXYZ) = T(AXYZ) and 
b (AXYZ) + 6 (AXYZ) = S(AXYZ). In proving these, the above paragraph shows 
that we lose no generality in taking (X,Y, Z) = ¢re®,0, 5), with r, s > 0. Then 


b,( AXYZ) — o,( AXYZ) = (re + ws)(re~® + w’s) — (re® + w’s)(re~"® + ws) 
= (4/3 — 9-4ir/3) p5( gi — 9 i) 
= 2i sin(47/3)rs(2i sin 6) 
= (273 )rs sin 0 = T( AXYZ). 


The other result can be obtained similarly or, for general AXYZ, by direct 
computation. 

Turning to the specifics of the present problem, we find P + wQ + w’R = 
A+oB+o°C and P+w0*Q+wR= —2(A + w’*B + wC), and consequently 
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¢ (APQR) = ¢{AABC) and ¢,(APQR) = 4¢,(AABC). Substituting these results 
in the above two general identities, we find 


2T( APQR) = 5T( AABC) + 3S( AABC), 
8T( AABC) = 5T( APOR) — 3S( APOR). 
The first equation shows, as required in (a) above, that 
T(AABC) >0 = T(APOR) > 0. (*) 


The second equation shows that APQOR is obtainable iff 5T(APQR) > 3S(APOR). 
Then Heron’s formula (op. cit. p. 12, 1.53) for the area of a triangle yields the 
second of the two given conditions, and the first is obtained from 


5(2V3 )PQ - PRsin(Z QPR) > 3(QR? + RP? + PQ”) 
together with OR: RP: PQ::sin P:sin Q:sin R. 


Editorial comment. A closely related question is asked and answered elegantly 
(using motions) in a paper of H. G. Steiner, “Bewegungsgeometrische Lésung 
einer Dreieckskonstruktion,” Math.—Phys. Semesterber. 5 (1956), 132-137, as is 
noted by John E. Wetzel in “Some converses of Napoleon’s Theorem’, a paper 
forthcoming in this MONTHLY. The problem is studied in greater detail in another 
paper of Wetzel, “An elaboration on an example of H. G. Steiner,” Math. 
Semesterber. 37 (1990), 88-95. See also J. G. Mauldon, “Similar triangles,” this 
MontTHLy 39 (1966), 165-174. Most solvers proceeded by constructing outward 
equilateral triangles on the sides of POR, obtaining points X,Y, Z, and proving 
that A, B,C, are the midpoints of PX, OY, RZ, respectively, a construction that 
was given in 1869 by L. Kiepert in “Question 864,” Nouv. Ann. Math. (2)8(1869), 
40-42, as Wetzel points out in his papers. 

The mapping A, B,C — P,Q, R 1s the restriction to positively oriented triples 
ABC of a non-singular linear mapping of C* onto C°. (The determinant of this 
mapping is — 2.) Accordingly both the uniqueness assertion in (a) and the existence 
of the inverse mapping in (b) (to triples of unspecified orientation) are immediate. 
The image of the positively oriented triples ABC under this mapping can be 
described in various ways, including the two given above; a geometric description 
may be found on page 94 of Wetzel’s paper in Math. Semesterber. 


Solved also by R. L. Bishop, J. Dou (Spain), M. Golomb, H. Guggenheimer, J. Heuver (Canada), 
E. Kitchen, V. Konecny, L. Kuipers (Switzerland), O. P. Lossers (The Netherlands), J. H. Meyer, R. D. 
Nelson (England), C. Petalas & T. Vidalis (Greece), R. A. Simon (Chile), S. Stahl, and the proposer. 


The Maximum Value of a Symmetric Function on Points of a Simplex 


E 3263 [1988, 351]. Proposed by Walther Janous, Ursulinengymnasium, Inns- 
bruck, Austria. 


For n > 2, let H, = {(X = (x%),...,x,): Ux; = 1, x; > 0 for all i}. Forl <k <n 
and X € H,, let 


Xi, i 
Si(X) = Lo oe : ’ 


ty LK 


where the sum extends over all k-tuples (i,,...,i,) with 1 <i, < +--+ <i, <n. 
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Put 
M,(n) = sup S,(X). 
XEH, 


It is not hard to see that M,(n) = (n — 1)7”. 
(a) Show that M,(n) = 1. 
(b) Show that M,(4) = 4/27. 
(c)* For what pairs k,n with3 <k <n is it true that M,(n) = (7 )(n — 1)-*? 


Composite solution by J. A. M. Potters, Catholic University Nijmegen, Toer- 
nooiveld, The Netherlands, Woo Jung Min (student), Yonsei University, Seoul, Korea, 
and the editors. The answer to (c) is “for all such pairs’. We examine these 
maximizations in the full simplex H, with boundary, where the variables may 
vanish. We show that the value of S$, is maximized when all n variables are equal 
if k > 2, and where all but 2 variables vanish if k = 2. 


We will argue that we need look for the optimum only among vectors having all 
non-zero values equal. If so, then M,(n) = MAK, <j enl/ (; — 1)-*, where j 
represents the number of non-zero variables at the possible local extrema. For 
k = 2, this function is j;/(2j — 2), which decreases in j; the maximum occurs at 
j = 2. For all other k, the function increases with j, so the maximum occurs at 
j =n and equals (7 )(n — 1)~*; the increasing property follows readily from the 
identity 


j\.. 1) 1 ' 1 TI ' i 
- = —{1+—— - . 
(i) Fat) YE 
Suppose that more than two of the variables are non-zero. If x, and x, are two 
such variables, we can write 


xX, X> _ x4 X4 _ _ 
S,(X) = U(X) + + V(X) + W(X), 
1—x,1-x, l1-x, 1-*x, 
where X = (x3,...,X,). We can use Lagrange multipliers to maximize this func- 


tion of x,,x, over the region where x, +x,=s, 0<s <1. Differentiating 
S,(x1, x) — A(x, +x, — 5) with respect to x, and x, yields the following two 
equations: 


1 X> 
[vu +v]=3 


(1 —x,)’ 1— x, 


1 xy 
[pu +v)=a 


(1 —x,)° 1-x, 


Subtracting these yields 
[x,(1 -—x,) -x,(1 —x,)]U = [(1-x,)°- 1 -»,)']V, 
which is equivalent to 
(x, —X2)(4, +x, — 1)U = (x, — x2) (4, + x2 — 2)V. 
If x, #x,, then this reduces to (1 — s)U = (2 — s)V. Letting p = x,x,, we can 


1991] PROBLEMS AND SOLUTIONS 59 


eliminate U in the expression for S at the maximum to obtain 


ps2 POR) SP Oy SS vy 

(l-—s+p)(1-s) l1—-s+p 1l-s 
This is independent of p and the same value is obtained at the point 
(x, +x>,0,X). This implies that for any maximizing (or minimizing) point with 
more than two non-zero coordinates not all of which are equal, there is another 
point with fewer non-zero coordinates on which the value of S is the same. Hence 
S must be maximized (or minimized) at a point where all non-zero coordinates are 
equal. 

Note that if only one coordinate of X is non-zero, then S,(X) should be 
regarded as 0 if k > 2 and 1 if k = 2. If the variables are not permitted to vanish, 
then for n > 2 and k = 2 the maximum value of 1 can only be approached and not 
attained. 

Because the value of S on points with equal non-zero coordinates is monotonic 
in the number of non-zero coordinates, the argument given yields all extreme 
points, minima as well as maxima. The minima occur when there are fewer than k 
non-vanishing variables if k > 2, and when there are n equal variables if k < 2. 


No other solutions were received, except for a solution of (b) by David Secrest. 


Feuerbach When Ready 


E 3293 [1988, 873]. Proposed by Joseph Keane (student ), Carnegie-Mellon Univer- 
sity, Pittsburgh, and Gregg Patruno, The First Boston Corporation, New York, NY. 


Suppose that the distinct circles C, and C, intersect at P and Q. Suppose that 
the tangent to C, at P intersects C, again at A, the tangent to C, at P intersects 
C’, again at B, and the line AB separates P and Q. Let C, be the circle externally 
tangent to C,, externally tangent to C,, tangent to line AB, and lying on the same 
side of AB as Q. Prove that the circles C, and C, intercept equal segments on one 
of the tangents to C, through P. 


Solution by Richard K. Guy, University of Calgary, Calgary, Alberta, Canada. 
Invert with respect to a circle centered at P. The circles C,, C, invert into straight 
lines Q'B', Q'A’ (Q’ is the inverse of Q, etc.) and the tangents at P into lines 
through P, parallel to Q'B’, Q'A’. The line AB inverts into the circle PA’B’, and 
“AB separating P and Q” is equivalent to “Q’ lying inside circle PA'B'.” So we 
have a parallelogram PA'Q'B’, obtuse-angled at P and Q’. Draw the line through 
P, parallel to A’'B', meeting Q’B', Q’'A’ in R, S. Then P, A’, B' are the midpoints 
of the sides of triangle Q'RS, and RS is the inverse of the line through P on which 
C,,C, intercept equal segments. The circle PA’B' is the nine-point circle of 
triangle Q’'RS; by Feuerbach’s Theorem, it touches the incircle of that triangle. 
But this incircle is the inverse of the circle C, in the original figure, touching 
C,, C, (externally) and the line AB. 


Editorial comment. The proposers and several solvers remarked that the state- 
ment of the problem can be slightly generalized, using the fact that the nine-point 
circle of a triangle is tangent not only to the incircle but also to the three excircles. 
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Feuerbach’s Theorem is proved in §5.6 of H. S. M. Coxeter and S. L. Greitzer, 
Geometry Revisited, New Mathematical Library #19, Mathematical Association of 
America or on pp. 9-10 of D. Pedoe, Circles, Pergamon, New York, 1957. 


Solved also by J. Anglesio (France), J. Dalbec, J. Fukuta (Japan), W. Janous (Austria), O. P. Lossers 
(The Netherlands), A. Nijenhuis, H. Steinacker (Austria), Lamar University Problem Group, and the 
proposers. 


A Ratio on the Newton Line 


E 3299 [1988, 954]. Proposed by Jihad Yamout, Ohio University, Athens. 


Suppose ABCD is a plane quadrilateral with no two sides parallel. Put E = 
AB OACD and F=ADN BC. If M,N,P are the midpoints of AC, BD, EF, 
respectively, and AE = aAB, AF = bAD, where a and b are nonzero real numbers, 
prove that MP= abMN. 


Solution by Hans Kappus, Rodersdorf, Switzerland. With respect to a suitably 
chosen affine coordinate system, we may assume that A = (0,0), B = (1,0), 
C =(x,y), and D=(0,1). Then M = (x/2, y/2), N =(1/2,1/2), E = (a,0), 
F =(0,b), and P = (a/2,b/2). Because of the colinearity of D,C,E and of 
B,C,F respectively, x, y can now be expressed in terms of a,b as x = a(1 — b)/ 
(1 — ab) and y = b(1 — a)/(1 — ab). Consequently, 


SS 


1 
N= 5(-x,1-y) = (1 —a,1—-—b) 


2(1 — ab) 


WP 1 A ab 
g (4 ~*~ ¥) = sy 


(1 -—a,1—b) = abMN. 


Editorial comment. H. Guggenheimer noted that the colinearity of M, N, P was 
proved by Gauss (Works, vol. 4, p. 385-392) using Corollary III to Lemma XXV of 
Newton’s Principia. The line determined by M,N, P is sometimes called the 
Newton line. 


Solved also by J. C. Binz (Switzerland), S. Gaignoux (France), H. Guggenheimer, T. Hermann 
(Hungary), J. Heuver (Canada), P. L. Hon (Canada), W. Janous (Austria), R. H. Jeurissen (The 
Netherlands), P. Khajeh-Khalili, L. Kuipers (Switzerland), H. Lipman, O. P. Lossers (The Netherlands), 
A. Nijenhuis, A. Pedersen (Denmark), I. A. Sakmar (Turkey), A. Sarkar (England), V. Schindler (East 
Germany), M. Vowe (Switzerland), J. B. Wilker (Canada), Lamar University Problem Solving Group, 
the Stanford Ordinal, and the proposer. 


A Characterization of Consecutive Composites 


E 3308 [1989, 154]. Proposed by Ion Cucurezeanu, Liceul 10 Constanta, Roma- 
nia. 


Suppose 7 is a positive integer greater than 4. Prove that the following two 
conditions are equivalent: 

(i) both n and n + 1 are composite, 

(ii) the integer nearest to (n — 1)! /(n* + n) is even. 
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Solution and extension by Jorge-Nuno O. Silva and Owen Brison, Faculdade de 
Ciencias de Lisboa, Lisbon, Portugal. We prove the following more general result. 
If p is an arbitrary prime and n > 2p + 1 (orn > 10 if p = 3), the following are 
equivalent: 

(i) both n and n + 1 are composite, 

(ii) the integer nearest to (n — 1)!/(n? + n) is a multiple of p. 

If n is a positive integer and q is prime, let |n|, denote the largest integer wu 
such that q“|n. We-claim the following: 


Lemma. |m!|, > Irl, for integers m,q,r such that q is prime and max{4, q} < 
m<r<m+dq. Furthermore, the inequality is strict for m > 2q (unless q = 3, in 
which case it is strict for m > 9). 


For the proof of the lemma, let k be the largest integer such that g* < m; this 
implies |m!|, =k, and m > q implies k > 1. Hence we may assume that r is the 
unique multiple of g in (m,m + q], which implies 1 < |r|, < k + 1. If m > 3q, 
then at least three distinct multiples of g appear in the product for m!, including 
q*; this implies |m!|, > k +2> Irl,. If gq <m < 2q, then r = 2q; with m > 4, 
this implies g > 2, which implies |r|, = 1<k <|m!|,. Finally, if 2g <m < 3q, 
then r = 3q. Since q’|m!, we have |m!|, > 2, and |r|, = 1 if q # 3. If q = 3, then 
Ir|, = 2, but then m < 9 and the claim still holds. This completes the proof of the 
lemma. 

We prove the main equivalence by considering three cases for the pair {n, n + 1}, 
always assuming n > 2p + 1 (orn > 10 if p = 3). 

Case I, n and n+ 1 are both composite. It suffices to show that R, = 
(n — 1)!/(n? + n) is an integer divisible by p. We apply the Lemma with m = 
n — 1, q being successively p or the prime factors of m and n + 1, and r being n 
or n+ 1. We find that (n — 1)! has more factors of p than n(n + 1) (since 
gcd(n,n + 1) = 1) and as many factors of any other prime as n(n + 1). Thus R,, 
has the desired property, i.e., is an integer divisible by p. 

Case II, n is prime. It suffices to show that S, = {((n — 1)!!+ n + 1}/(n* +n) is 
an integer not divisible by p. By Wilson’s Theorem, [(m — 1)!+ 1]/n is an integer 
and so {(n — 1)!4+-n + 1}/n is an integer. Note that n+ 1=2-:[(n + 1)/2] 
divides (n — 1)! when n is prime and n > 5. By the Lemma with m = n — 1 and 
r=n +1, we have |m!|, > |r|,; this implies that (n — 1)!/(m + 1) is an integer 
divisible by p. Now we rewrite S, as {(n — 1)!/(m + 1) + 1}/n. The numerator is 
an integer not divisible by p. However, since gcd(n,n + 1) = 1, the numerator is 
still a multiple of n. Thus S, has the desired property, 1e., it is an integer not 
divisible by p. 

Case III, n + 1 is prime. It suffices to show that T, = {((n — 1)!+ n}/(n* +n) is 
an integer not divisible by p. By Wilson’s Theorem, (nm! + 1)/(n + 1) is an integer. 
Since (nm — 1)!+ 1 - (n!+ D/(n + 1) = (Cn — D!4 nh /{n + 1}, it follows that 
[((n — 1)!4+ n]/(n + 1) is also an integer. By the Lemma with m =n — 1 and 
r =n, we have |m!|, > Irl,; this implies that (n — 1)!/n is an integer divisible by 
p. Now we rewrite T, as {(n — 1)!/n + 1}/(m + 1). The numerator is an integer 
not divisible by p. However, since gcd(n,n + 1) = 1, the numerator is still a 
multiple of n + 1. Thus JT, has the desired property, i.e., it is an integer not 
divisible by p. 
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Editorial comment. The above proof simplifies somewhat when p = 2. In fact 
one can use the identities 


R, =(n—-1)!/n+n!/(n + 1) - (n - 1)! (1) 
S,={(n -— 1)!+ 1}/n4+n!/(n + 1) - (n- 1)! (11) 
T, = (n!+1)/(n + 1) + (n - 1)!/n - (n - 1)! (II) 


in the three cases considered above, where each term on the right is an integer 
whose parity can be readily determined. 


Solved also by the proposer and 34 others. 
An Arcsine Inequality 


E 3326 [1989, 445]. Proposed by George Baloglou, University of Kansas, Lawrence. 
If 0 < 6 < m, prove that 


arc sin S + arcsin 3 sin — | < 


4 


6 2. 6 
[seea} <3 


Solution by Jestis Ferrer, Oliva, Valencia, Spain. Set x =0/4, let I= 
{x: 0 <x < 7/4}, and consider the function f(x) = arc sin(2x/3) + 
arc sin(2(sin x)/3) — 4x/3. We will show f(x) < 0 on J. Since f(0) = 0, it suffices 
to show f(x) <0 on J. From f(x) = 29 — 4x7)7 1/7 + (9 + Stan? x) 1/7 — 
2/3], it suffices to show g(x) < 2/3 on J, where g(x) = (9 — 4x7)° 774 (94 
5tan? x)~'/?, Since g(0) = 2/3, it suffices to show g(x) < 0 on I. Since g(x) = 
4x(9 — 4x*)~°/2 — Stan x(1 + tan? x)\(9 + Stan? x)~?/?, it suffices to prove 


(94+ 5tan?x)” = 5(9 — 4x2)” 


tan x(1 + tan? x) 4x 
Now sin(x)/x > sin(ar/4)/(7/4) = 2V2/m7 implies 1 + tan? x = 
(1 — sin? x)~! > w?(ar? — 8x7)7! on I. Hence 
A= {5+4/(1 + tan? x)}(5 + 9/tan? x)” < (99? - 32.x2)°/? /(2V2 32x). 
We need only prove that the last expression is at most B, which is equivalent to 
(Sar? — 32x*)/(9 — 4x7) < (2527 /2)'/7ar. This follows from the fact that the left 


side of this final inequality is an increasing function whose value at x = 7/4 is 
287? /(36 — 2”), which is less than (257 /2)!/3r. 


- Solved also by J. Anglesio (France), J.-M. Becker (France), J. S. Frame, M. E. Kuczma (Poland), 
K.-W. Lau (Hong Kong), H. Lipman, K. McInturff, J.-M. Monier (France), and the proposer (twice 
alone and once with S. Reyner). Six incorrect solutions were received. 


= B. 


Missing Only the kth Powers 


E 3340 [1989, 642]. Proposed by Courtney Moen, U.S. Naval Academy, Annapo- 
lis, MD. 


Suppose k is a given integer > 2. For n € N (the set of positive integers) put 


f(n) = |(n + n¥/*)*| 41, 
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where | | denotes the greatest integer function. What is the range of the func- 
tion f? 


Solution by John Henry Steelman, Indiana University of Pennsylvania, Indiana, 
PA. The range of f consists of all positive integers that are not perfect kth powers. 

First we show that non-kth powers are in the range of f. Since k > 2, the 
binomial theorem implies that s* +5 + 1<(s + 1)‘ forall s > 0. With s =a — 1, 
we have (a — 1)‘ < a* — a. For a,m € N with aé < m < (a + 1)*, we claim that 
m = f(m — a). Let n =m — a, so that a* — a <n < m. Our various inequalities 
now imply 


a® <(m—a)+(a-1) <ntn'/* <(m—-a)+(a41) <(at+1)*. 
Thus f(n) =a + (m—a)=m. 

Next we show that the Ath powers are missed. Note that f is a strictly 
increasing function, so that f(1) = 2 implies 1 is missed. For any larger a*, it 
suffices to present two consecutive values whose images are a“ — 1 and a* + 1. If 
m is not a kth power, we showed above that m is the image of m — |m!'/* |. Hence 
flak +1-—a)=a* + 1and f(a* — a) = f(a* —1-(a—- 1) =a* - 1. 

Solved also by J. Balogh (student, Hungary), J. C. Binz (Switzerland), J. Duemmel, N. J. Fine, R. J. 
Hendel, W. Janous (Austria), H. Ki (South Korea), K. -W. Lau (Hong Kong), M. Leeney (Ireland), 
O. P. Lossers (The Netherlands), J. H. Nieto (Venezuela), A. Nijenhuis, H. Noland, V. V. Pambuccian, 
A. Pedersen (Denmark), B. M. de Weger (The Netherlands), R. N. Will, J. Zhang (China), Arizona 


State Problems Group, National Security Agency Problems Group, and the proposer. There was one 
incorrect solution. 


ADVANCED PROBLEMS 


6646. Proposed by Daniel Goffinet, St. Etienne, France. 
Suppose f is a continuous function from [0,1] x [0,1] to R such that 
1/2 


Lf {fir.) ax) | - [Luan ‘| ie 


Prove that there exist continuous functions u and r from [0,1] to R such that 
giuCy)}? dy = 1, r(x) > 0 for all x in [0,1], and f(x, y) = r(x)u(y) for all x and 
y in (0, 1]. 


6647. Proposed by A. J. Vince, Bristol Polytechnic, Bristol, England. 


Let S, = {1,2,£7,...,¢"7 |} be the set of n-th roots of unity and suppose f is 
any function on S, into the set of complex numbers of absolute value one. For 
every positive integer k less than n/2 prove that there exist integers i and j such 


that | | | | 
Ii - GS — oe Sl FS) -F(L)]. 


6648. Proposed by Walter Rudin, University of Wisconsin, Madison. 


Let () be the region obtained by removing the points 0, 1, from the Riemann 
sphere. Find all nonconstant holomorphic functions defined on (1 which map © 
into itself. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Eigenvalues of Sine and Cosine Matrices 


6591 [1989, 66]. Proposed by Ronald J. Evans, University of California, San 
Diego, and Jerrold R. Griggs, University of South Carolina, Columbia. 


Let n be an integer exceeding 4 and put ¢ = [(n — 1)/2]. Find the eigenvalues 
of the following ¢ by ¢ matrices: 

(i) the matrix S§ in which the element in the jth row and kth column is 
sin(Qajk/n),1<j, k <t. 

(ii) the matrix C in which the element in the jth row and kth column is 
cos(2irjk/n), 1 <j, k <t. 


Solution by David Callan, University of Bridgeport, Bridgeport, CT, and Grace 
Wahba, University of Wisconsin, Madison (jointly). Let A denote yn /2. The 
eigenvalues of S are A, with multiplicity [¢/2], and —A, with multiplicity |t/2]. The 
matrix C has simple eigenvalues —1 and 0 if n = 0 mod4, +1/ v2 ifn=2 
mod 4, and —1/2 if n is odd. If the remaining eigenvalues of C are even in 
number they comprise an equal number of A and —A values; otherwise, there is 
one more A. 

To show this, first calculate S* and C?. Let J denote the ¢ by ¢ identity matrix, 
let J denote the ¢ by ¢ matrix with all entries 1/2,’and let K denote the ¢ by t 
matrix in which the entry in the jth row and kth column is {1 + (—1)/+*}/2. Then 
by elementary trigonometric identities 


S? = (n/4)I 
while 
C*=(n/4)I-J 
for n odd and 
C*=(n/4)I-K 


for n even. The eigenvalues of C’ are now easily found, and are given in the 
second column of the table below (multiplicities in parentheses). This determines 
the desired eigenvalues up to sign, and a determination of the traces of C and S is 
sufficient to settle the matter. These are obtained from the value of 


Qarij* 


3 ex | = Tr(C) +i Tr(S) 
j=l 


which is known from the theory of Gaussian sums. (Cf. E. Landau, Vorlesungen 
liber Zahlentheorie, Satz 211, or T. Nagell, Introduction to Number Theory, Theo- 
rem 99.) Thus the result foHows from the values provided in the third and fourth 
columns of the table below. 
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Eigenvalues of C’ Tr(C) Tr(S) 


0,1, ri “dt — 2)times) 


7 ~( — 1)times) 


— = (twice) AG — 2)times) 


Callan and Wahba also mention that an analogous problem involving a matrix of 
exponentials (known as the finite Fourier transform matrix), namely E = (e,,) 


where 


appeared here as Problem 3824 [1937, 251; 1939, 304]. Griggs remarks that he 
came across matrices of this type while examining certain random walk problems. 


Solved also by the proposers. 


A Property of Two Dimensions 


6593 [1989, 165]. Proposed by Michel Balazard, Limoges, France. 


Suppose EF is a two-dimensional vector space of continuous real-valued func- 
tions on [0, 1] such that for every x € [0,1] there exists g in E with g(x) # 0. 
Suppose S is a nonempty subset of EF such that 


G(x) = sup g(x) < +” 
ges 


for every x € [0,1]. Show that G is continuous and give an example to show that 
the result fails if E is three-dimensional instead of two-dimensional. 


Solution by Christopher P. Grant, University of Utah, Salt Lake City. For a 
counterexample in the three-dimensional case take E to be the span of {1,1 + x, 
1+ Vx} and S to be the set 


{—Mx + Ave: 5 SA < ob. 


A routine calculation gives G = 0 at x = 0 and G = { on the rest of [0, 1]. Note 
that this counterexample also shows that (in the two-dimensional case) the non- 
vanishing condition is appropriate. 

We now prove that, with the stated hypotheses, G is continuous at every x, in 
[0, 1] if E is two-dimensional. Let x,, x,... be a sequence in [0, 1] that converges 
to X). We shall show that along some subsequence G(x,) converges to G(x,). This 
will imply the continuity of G. Choose a basis {f, g} for E with f(x,) = 0 and 
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g(x,) = 1. From the hypotheses such a choice is easily seen to be possible. Hence 
S = {c,f+d,g:a€A} 


for some index set A. Since x, —-x, and g is continuous, by passing to a 
subsequence (specifically, by deleting finitely many terms at the beginning of {x,}, 
if necessary} we can assume that g(x,) > 0 for each k. Note that 


G(x) = sup {d,}. 


If f(x,) = 0 for all k sufficiently large, then for these k we have 
G(x,) = G(x) 8(%,) 7 G(x) 


by the continuity of g, and we are done. If not, by passing to a subsequence and 
possibly changing the sign of f, we can assume that f(x,) > 0 for each k. 
By the definition of G, 


c,f(x*,) + d,g(x,) < G(x,), all ain A. 


Hence 
Caf (x,) + d,8(x,) < [G(x,) -d o(x,) ee 4a g(x;) 
a k a ki = 1 a 1 f(x,) a k 
f( xx) f(*,) 
= CO aay + df eC = 65) 


< a(n) eS + G(x0)] (0 — g(X;) 


ry, 
f(*,) | 


provided that k is so large that the bracketed quantity, which approaches 1 as 
k — o, is nonnegative. A closer look shows that the right side of this inequality has 
the form G(x,) + €, where e, — 0 as k — ~, while the supremum of the left side 
over all pairs (c,, d,) is G(x,). Hence 


G(x,) < G(x)) + €; 
and 
lim sup G(x,) < G(x). 
k 


Since G is the supremum of a set of continuous functions, it is lower semicontinu- 
ous, 1.e. 


lim inf G(x,) = G(xp). 
Hence 
along a subsequence of our original sequence, and the result follows. 
Editorial comment. Adam Riese showed that if we also require 


G(x) = inf g(x) > -© 
ges 
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then both G and G” are continuous under the weaker assumption that E be 
finite-dimensional. 
Solved also by John A. Frohliger, Reiner Martin (West Germany), Adam Riese, Kenneth Schilling, 


John Henry Steelman, and the proposer. 


A Commutative Diagram Involving Free Groups 
6594 [1989, 165]. Proposed by Stephen M. Gersten, University of Utah, Salt Lake 
City. 


Let G, A, and B be groups with A and B both free groups. Let ¢: A ~ G* B 
be a homomorphism and let a: G* A ~ G*B be given by a = (i, d), where i is 
the injection G — G « B; that is 


(i,@): g,4,...8,a, > 8,6(a,)...8,6(4,). 


Consider a commutative diagram 


Gx*A— G«B 


|r. | 


A B 


where the vertical arrows send G to {1} and are the identities on A and B, 
respectively. Prove that if B is injective, then a is also injective. 


Solution announcement. The solution will appear as an application in the 
proposer’s paper “Hierarchies of 2-complexes and equations over groups’, in 
preparation. The proposer’s proof is elaborate, and involves both geometric 
methods and towers of branched coverings. The editors published this problem 
with the hope that one of our readers would find an easier approach that had been 
overlooked, but no such solution was received. 


One and Only One Solution 


6596 [1989, 265]. Proposed by S. D. Chatterji and C. El-Hayek, Ecole Polytech- 
nique Fédérale, Lausanne, Switzerland. 


Prove that the differential equation 


dx 1 


dt 7 t7 +x? 
has a unique solution defined on R satisfying the initial condition x(1) = 1. 


Solution by David V. V. Wend, Department of Mathematics, Montana State 
University, Bozeman. We first prove this with rather general techniques, and then 
show how a more specialized method provides more precise information. The only 
singularity of x’ = x(t) = (t? + x*)7! is at (0,0), and since x’ > 0 all solutions are 
strictly increasing. For such “smooth” initial value problems unique solutions are 
guaranteed to exist by standard theorems and there are simple comparison 
theorems. Here the domain of a solution will be R unless its graph has a vertical 


68 PROBLEMS AND SOLUTIONS [January 


asymptote or has the origin as a limit point. Moreover, if 
f(t,x) <g(t,x) and a<b(a2b), 


then the solution of x’ = f(t, x), x(t)) = a, is bounded above (below) to the right 
(left) of t) by the solution of w’ = g(t,w), w(t,) = b, on their common domain. 
We apply this comparison theorem with f(t, x) = (¢t* + x*)~'. For t>1 and 
x >1, clearly f(t, x) < 5 so the increasing solution of x’ = f(t, x), x(1) = 1 is 
bounded above by w(t) = (t+ 1)/2, the solution of w’ = 5, w1) = 1. Also 
f(t,x) <x7* for x #0, so for ¢ <1 the function x(t) is bounded below by 
w(t) = (3t — 2)!77, the solution of w’ = w~*, w(1) = 1. Thus x(t) has no vertical 
asymptotes and has domain R unless it approaches 0 as t > 0°. 
For 5 <x <1 we have f(t, x) > (t? + 1)! so 
x(t) <w,(t) = arctant + c,, 


where w, solves the corresponding comparison equation with w,(1) = 1 and 
c, = (4 — 1)/4. When 


w, = 3 = arctant, + (4-7) /4, 
then 


and x(t,) < 5=w,{(t,). For 0 <x < $ we have f(t, x) > (t? + +)7! so 
x(t) <w,(t) = 2arctan(2t) +c, 


where w, solves the comparison equation with w,(t,) = ; and 
1 
= 5H 2arctan(2t,) = —0.561.... 


Now w,(t) = 0 when 


Thus x(t) = 0 for some ¢ >t) > 0 and x(t) does not tend to 0 as t > 0. Hence 
x(t) has domain R and the result follows. We add that the method of the first part 
of this solution can be used to show that in general every solution has domain R 
except the two unique solutions that approach the origin tangent to the x-axis 
(their domains are the negative and positive reals respectively). 

For an approach tailored to this specific equation, interchange the roles of the 
dependent and independent variables. This yields the Riccati equation 

had = x74 t*, 


dx 
and the usual change of variable t = —z'/z linearizes it to 
z"+x’7z=0, 


an equation whose general solution (see [3], pp. 145-146) has the form 


z= x7? (CJ) 4(2x7/2) + Cod _14(*7/2)). 
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If the solutions of the linear equation are normalized by the condition z(1) = 1, 
then clearly z(1) = 1, z’(1) = —1, are the appropriate initial conditions for the 
linear equation (t(1) = 1). Thus z = z(x) provides a solution to the Riccati 
equation on intervals about x = 1 where z(x) is nonzero. However, a well-known 
oscillation theorem [1] shows that z(x) is oscillatory on R. Thus we must deal with 
numbers x, and x, (where x, < 1 <.x,), the zeros of z(x) closest to 1 on the left 
and right respectively. Since nontrivial solutions of the linear equation can have 
only simple zeros, z'(x,) > 0 and z'(x,) < 0. Hence t(x) tends to —~ as x > xf 
and to +o as x ~ x5. Thus the range of t(x) is R and we could say that its 
inverse (cf. the comment below) x(t) is the desired solution with domain R if we 
knew that ¢(x) did not pass through (0,0). Let u(x) be the solution of the Riccati 
equation with u(0) = 0. Then (see, e.g., [6], [4] pp. 21-22, C1.14, C2.14 and 
C2.162(10)) 


u(x) = x3 4 x7/2)/I _ 1 ,4(x?/2) 
and from tables of Bessel functions [5] 
u(1) = 0.35023184 < 1 = ¢(1). 


Thus ¢(x) does not pass through the origin and the result follows. It has in fact 
been proved that all solutions of the original equation have domain R except for 
the two solutions that are inverses of u(x) and approach the origin tangent to the 
x-axis. It can be shown [6] that these two solutions have horizontal asymptotes 


x = +2.003147359... 
and that the solution v(t) with v(1) = 0 has horizontal asymptotes at 
x = —2.224..., and x = 0.96979.... 


The solution x(t) is sandwiched between these, and must have domain R since it is 
monotone. This provides yet another variant of the solution. 


Editorial comment. A common approach not involving Bessel functions was to 
(i) show by standard theory that dt/dx = t? + x* has a unique smooth solution 
t = t(x) on some interval, (ii) show (e.g. via comparison techniques) that t(x) 
assumed all real values, and then (iii) invert t = t(x). Various arguments were 
presented to show that (0) # 0, and hence that dt/dx > 0 for all x. One of the 
shortest was Nathaniel Grossman’s. Assume ¢(0) = 0. Since ¢ is a nondecreasing 
function of x and ¢(1) = 1, we have tx) <t +x for 0 <x < 1. Hence, 


[e-*t(x)]lo < fixe dv 


or 3 < e, a contradiction. 
The most explicit solution received was that of Jean Anglesio who showed that 
x(t) was the function inverse to 


J jq(x°/2) — KI _3/4( 7/2) 


Aidan 7K C277) CL) 


where 


J34(3) _ al 


k Jala 
J_34(2) + Jia(d) 


= —(0.517050167.... 
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Anglesio also provided the values 


x(0) = —0.9519678985..., 
lim x(t) = 1.7294440222..., 


[~~ @ 


and 
lim x(t) = —2.127718999.... 


tno -—@ 


M. S. Klamkin pointed out that this same differential equation with x(O) = 0 
appeared as E 1113 [1954, 259; 1954, 714; 1955, 443], and that a closely related 
problem appeared in the seventh Putnam Competition ((2], pp. 235-236). The 
Putnam reference was also provided by the proposers. For another example of a 
differential equation that may be studied in detail either by very general tech- 
niques, or by the use of specific special functions, see 6551 [1987, 694; 1989, 657]. 
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Solved also by K. F. Andersen (Canada), Jean Anglesio (France), Robert Betts, David Borwein 
(Canada), Alan E. Berger, F. J. Flanigan, Nathaniel Grossman, Michael Hurley, A. A. Jagers (The 
Netherlands), Charles Kenney, Murray S. Klamkin (Canada), Thomas McDonald, J. Rausen, Nora S. 
Thornber, and the proposers. Two incomplete solutions were received. 


Too Many Triods 


6598 [1989, 366]. Proposed by Walter Rudin, University of Wisconsin, Madison. 


Let K be a closed circular disc in the plane R?. Suppose that f is a continuous 
map from K into R? which is one-to-one on the interior of K. For each point q in 
f(K), let N(q) denote the cardinality of the set of all points p in K for which 
f(p) = @. 


Prove that N(q) > 3 for at most countably many points q. 


Solution by John Cobb, Mathematics Department, University of Idaho, Moscow. 
Let C denote the boundary circle of K. Since f(C) and f(K \ C) are disjoint, it 
suffices to consider points of f(C). (Here A \ B denotes the set of all points in A 
but not in B). 

For q in f(C), let H(q) denote the convex hull of f~'(q) in K. The H(q)’s are 
pairwise disjoint. For if not, there would be four distinct points a, b, a’, b’ in cyclic 
order around C such that 


f(a) = fla’) # f(b) = f(b’). 
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If A denotes the straight-line segment in K with endpoints a and a’, then 
f(a) = f(a’) implies that f(A) is a simple closed curve, and must separate the 
plane. Let B denote the straight-line segment in K with endpoints b and 5’, and 
let c be the intersection of A and B. Since f(b) = f(b’), it follows that f(B \ c) is 
a connected set in the complement of f(A), and hence lies entirely in one 
complementary domain. But f is locally a homeomorphism at c, and B\c 
contains points locally on each side of A. Hence f(B \c) contains points in each 
complementary domain of f(A), which a connected set cannot do. Thus A and B 
must be disjoint, and so must be any two hulls H(q) and H(q’). 

If N(q) => 3, then H(q) contains at least three non-collinear points, and hence 
contains an open set. If the set of all such g were uncountable, the same would be 
true of the collection of all the corresponding open sets. But (disjointness!) each 
such set can be designated uniquely by an ordered pair of rational numbers. 


Editorial comment. Several solvers noted that this result follows immediately 
from Moore’s theorem that any family of pairwise disjoint triods (a triod is, 
roughly, a homeomorph of the sans serif letter ““T’’) in the plane is at most 
countable. In particular, Mark Meyerson provided a sketch of Moore’s proof and 
the reference R. L. Moore, Concerning triods in the plane and the junction points 
of plane continua, Proc. Nat. Acad. Sci. 141928), 85-88. Dov Aharonov and 
Harold S. Shapiro obtained both the present result and also stronger results with 
stronger smoothness conditions on f (On the topology of certain simply connected 
domains, Technion preprint series no. MT-465, Haifa; see also Dov Aharonov, 
Holder conditions and the topology of simply connected domains, Canad. Math. 
Bull. 26(2) (1983), 189-191). 


Solved also by Dov Aharonov (Israel) & Harold S. Shapiro (Sweden), D. W. Finley, Richard P. 
Jerrard, Ilias G. Kastanas, Marcin E. Kuczma (Poland), Paul MacManus, Mark D. Meyerson, Arlo W. 
Schurle, and the proposer. 
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Lectures on Minimal Surfaces, Vol. 1. By Johannes C. C. Nitsche, Cambridge 
University Press, New York, 1989, xxv + 563 pp. 


ROBERT OssERMAN 
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In 1978, Springer-Verlag published a book by an unknown author, Arthur L. 
Besse, with the unpromising title, Manifolds All of Whose Geodesics are Closed. To 
the uninitiated, it might have seemed yet another book in yet another narrow 
subject, of possible interest to a handful of people. Quite the opposite was true. 
Besse turned out to be a cousin, once removed, of Bourbaki, with ties to the 
highest families of French Geometry, and the subject of the book touches not only 
many parts of geometry, but also other important areas of mathematics. 

_ Three years before the Besse book, Springer published Nitsche’s Vorlesungen 
Uber Minimalfldchen. Here, too, the subject matter seemed terribly constrained. In 
the decade before the book’s appearance the relatively narrow subject of minimal 
surfaces had begun to expand significantly in various directions—to surfaces in 
higher-dimensional euclidean spaces through the contributions of Chern, to 
higher-dimensional hypersurfaces through Bombieri-de Giorgi-Giusti-Miranda, to 
minimal surfaces in spheres with Calabi, Chern, and Lawson, to general- Rieman- 
nian manifolds via Jim Simons, and into geometric measure theory with Federer- 
Fleming and Allard-Almgren. Nitsche resisted all temptation to enlarge the scope 
of his volume, and focused his attention on classical two-dimensional minimal 
surfaces in three-dimensional euclidean space. Furthermore, he did so at a length 
of 775 pages, and to top it off: all in German. 

Like Manifolds All of Whose Geodesics are Closed, Nitsche’s Vorlesungen Uber 
Minimalflachen overcame its apparently limited nature by first of all exhibiting a 
rich theory, with many interesting properties and not a few surprises, and second, 
by strong links with other parts of mathematics, including topology, partial differ- 
ential equations, complex function theory, calculus of variations, and measure 
theory. In addition, as Plateau’s experiments make clear, mathematical physics 
lurks always not far beneath the surface. The classic by C. V. Boys on Soap 
Bubbles, Their Colours, and the Forces which Mold Them, and the 1978 book by 
Cyril Isenberg, The Science of Soap Films and Soap Bubbles, emphasize the physical 
aspects of the subject. 

Since the appearance of Nitsche’s book, the field of minimal surfaces has 
further expanded, if not exploded. In the late seventies, Meeks and Yau demon- 
strated its importance and value in topology, Schoen and Yau in relativity. Leon 
Simon and later Schoen managed a synthesis of PDE and geometric measure 
theory, yielding major new advances. Important contributions were made by 
Harvey and Lawson, including their fundamental new theory of calibrations. All of 
this only further accelerated during the eighties when a whole new generation of 
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geometrical measure theorists—mostly students of Almgren—sprang into action, 
while a school of young topologists also joined the fray. 

But the subject of classical minimal surfaces in R* was not lying dormant. 
Among the noteworthy developments were: first, the global analysis approach to 
Plateau’s problem by BOhme, Tomi, and Tromba; second, the theory of stability, 
initiated by Barbosa/do Carmo, do Carmo/Peng and Fischer-Colbrie /Schoen, 
that later flowered into the currently active subject of the index of minimal 
surfaces, started by Fischer-Colbrie and Gulliver/Lawson; third, the theory of 
complete embedded minimal surfaces in R°, due initially to Jorge-Mecks, and later 
blossoming under Costa, Hoffman, Meeks and others; fourth, the breakthroughs 
on the problem of the Gauss map of complete minimal surfaces, first by Xavier 
and then Fujimoto. 

Meanwhile, behind the scenes, there was Nitsche’s book, the source of a wealth 
of background material for some, and of mainly frustration to others for whom the 
language was an obstacle. At some point it became clear that a translation would 
be a worthwhile project. Arrangements between publishers were made, and Jerry 
Feinberg undertook the literally thankless task of providing an English translation 
of the German text. However, the many advances we have just described (and 
others) made the original text terribly out of date in places. Faced with that, and 
some dissatisfaction with the manner in which many of his carefully formulated 
and sometimes colloquial or colorful phrases had been turned into English, 
Nitsche decided to get actively involved once more. He added 12 pages of new 
Preface, a number of updates inserted in the text, 65 pages of appendices, and over 
350 new items to the bibliography. It soon became clear that the expanded version 
was a bit much for a single volume. What we have so far is the first of two volumes, 
devoted to the translation of Chapters I-V of the original, and including the 
additions already mentioned. 

By far the bulk of the present volume is devoted to the classical Plateau 
problem: finding a minimal surface with a given Jordan curve as boundary. 
Courant’s 1950 book, Dirichlet’s Principle, Conformal Mapping, and Minimal 
Surfaces (reprinted by Springer in 1977) might easily have given the impression 
that the last word had been said on the subject. After describing earlier partial 
results, Courant says: “Complete and satisfactory success was however achieved 
only in 1930 and 1931, independently, by T. Rado and J. Douglas.” Many pages 
later one finds a reference to three “unsolved problems”: boundary behavior, 
uniqueness, and interior regularity. The third of these revolved around the fact 
that the solutions of Douglas and Rad6 were allowed to have certain singularities, 
called branch points, that were in fact never observed in experiments with soap 
films. How Courant could have considered Plateau’s problem “completely and 
satisfactorily” resolved with that gap still remaining becomes clear if one reads 
further and realizes that Courant believed that the solution surfaces really did 
have branch points, so that what remained was not to prove the regularity of the 
surface, but the (perhaps) less serious problem of finding conditions on the 
boundary curve for which branch points did or did not occur. In any case, one now 
knows that the solution surfaces of Douglas and Rado are in fact regularly 
immersed disks, with no branch points or singularities, and Nitsche includes in the 
English edition a good deal of new material elucidating the point. There have been 
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many important contributions to the other two “unresolved problems” mentioned 
by Courant. However, especially regarding uniqueness, there is much still to be 
done. In an appendix to the new edition, Nitsche includes a proof of Tomi’s 
theorem that a regular analytic Jordan curve can bound only a finite number of 
area-minimizing solutions to the Plateau Problem. Whether there can be an 
infinite number of solutions altogether (including non-area-minimizing ones) is still 
unknown, even for real analytic boundary curves. 

The second volume, promised soon, required even greater revisions and addi- 
tions to include the newer results in active areas such as the theory of complete 
minimal surfaces, the subject matter of one of the four remaining chapters. We are 
also informed of a Volume Three “now in preparation,” but with no hint as to its 
content. In the meantime, Nitsche has given us the latest installment—and a very 
substantial one—in the never-ending story of minimal surfaces in old-fashioned 
Euclidean three-space. 

I should like to repeat a comment I made in reviewing the original German 
edition: not only does Nitsche provide us with a great resource in his extensive 
bibliography, but he enhances enormously the usefulness of the bibliography, and 
the book as a whole, by indicating after each item listed, all the places in the text 
where it is mentioned. He is also very good about including precise page refer- 
ences when indicating sources of theorems and proofs. Both of those practices are 
worthy of emulation, rather than the habit of some of our lazier authors who 
content themselves with a vague general reference to a 400-page book. The one 
slightly disconcerting decision Nitsche has made in separating his book into two 
volumes was to leave references to later sections unamplified, even though the 
sections referred to may be in a separate (and not yet existent) volume. 

But not to end on a peevish note, Nitsche’s book has much to offer in a subject 
area that remains eminently accessible, though highly developed; having it avail- 
able in an updated and English-language version should assure a wide and 
enthusiastic audience. 
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Education, S(15-16). How to Use Cooperative 
Learning in the Mathematics Class. Alice F. Artzt, 
Claire M. Newman. NCTM, 1990, v + 73 pp, 
$6.95 (P). (ISBN: 0-87353-293-7] Intended for high 
school mathematics teachers, this monograph dis- 
cusses ways of implementing cooperative learning in 
the classroom. Sample activities and problem sets 
are included. SG 


Education, P*, L. Teaching and Learning Math- 
ematics in the 1990s: 1990 Yearbook. Thomas J. 
Cooney, Christian R. Hirsch. NCTM, 1990, viii + 216 
pp, $18. [ISBN: 0-87353-285-6] The 1990 NCTM 
yearbook is devoted “to the people of mathematics 
education—teachers and students—and their chang- 
ing roles in the face of the calls for reform in the 
1990s.” The 28 articles are grouped into seven parts: 
Part 1 is on new perspectives in teaching; Part 2 dis- 
cusses classroom methods such as small-group work 
and writing; Part 3 deals with new models and ideas 
for assessment; Parts 4 and 5 are concerned with cul- 
tural and contextual factors in teaching and learning; 
Part 6 addresses the role of technology in mathemat- 
ics instruction; and Part 7 discusses the meaning of 
the word “professionalism” as it relates to mathe- 
matics teachers. The authors of the articles repre- 
sent the entire spectrum of teaching, from elemen- 
tary through post-secondary. Every teacher of math- 
ematics will certainly find something of interest and 
importance in this book. MPR 


History, L. Apollonius Conics Books V to VII: The 
Arabic Translation of the Lost Greek Original in the 
Version of the Bani Misa, Volumes I and II. Ed. 
and Trans.: G.J. Toomer. Sources in the Hist. of 
Math. & Physical Sci., V. 9. Springer-Verlag, 1990, 
$119 set [ISBN: 0-387-97216-1]. Volume I: Introduc- 
tion, Tezt, and Translation, xcv + 547 pp, Volume 
IT: Commentary, Figures, and Indezes, 337 pp. Vol- 
ume I contains a 95-page introduction and the text in 
Arabic along with a translation into English. Volume 
IT contains commentary, about 300 nicely rendered 
illustrations, indexes, and a bibliography. JDEK 


History, L*. The Collected Papers of Albert Ein- 
stein, Volume 2: The Swiss Years: Writings, 1900- 
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1909. Eds: John Stachel, et al. Princeton Uni- 
versity Pr, 1989, xxxii + 656 pp, $85. [ISBN: 0- 
691-08526-9] Sixty-two papers, reviews, and related 
documents from Einstein’s most productive period, 
including several of the most influential papers in 
the history of physics (molecular forces, quantum 
hypothesis, Brownian motion, relativity, electrody- 
namics). Extensively annotated with several topical 
essays provided by the editors—all in English. Ein- 
stein’s papers are in German; a separate typescript 
un-annotated translation is under preparation. Sec- 
ond of anticipated 30 volumes. (Volume 1, TR, De- 
cember 1988.) LAS 


History, P*, L**. Joseph Liouville 1809-1882: 
Master of Pure and Applied Mathematics. Jesper 
Litzen. Stud. in History of Math. & Physical Sci., 
V. 15. Springer-Verlag, 1990, xix + 884 pp, $98. 
(ISBN: 0-387-97180-7] A massive, thoroughly re- 
searched biography of Liouville, in two parts: an ac- 
count in six chapters of his scientific and professional 
career—written broadly for non-specialist readers— 
followed by eleven chapters describing his mathemat- 
ical contributions to various fields. Numerous quota- 
tions (translated into English) from Liouville’s exten- 
sive unpublished notebooks enliven Lutzen’s account 
of the academic and political intrigues of nineteenth 
century France. Extensive notes link the text to a 
lengthy bibliography of both published and unpub- 
lished references. LAS 


Logic, T(14-16). First Order Mathematical Logic. 
Angelo Margaris. Dover, 1990, x + 211 pp, $6.95 
(P). (ISBN: 0-486-66269-1] Corrected reprint of the 
1967 Blaisdell original (TR, June-July 1968). Two 
substantive parts: predicate calculus and first order 
theories. Concludes with decidability and a sketch of 
a proof of Godel’s theorem. LAS 


Logic, P, L. Reflections on Kurt Godel. Hao 
Wang. MIT Pr, 1988, xxvi + 336 pp, $12.95 (P). 
[ISBN: 0-262-23127-1] Paperback version of the 
1987 original edition (TR, January 1988). A mixture 
of biography, personal recollections, and intellectual 
self-expression by a famous logician who was a close 
professional associate of Godel. LAS 
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Graph Theory, S(18), P, L. Theory of Finite and 
Infinite Graphs. Dénes Konig. Transl: Richard Mc- 
Coart. Birkhauser Boston, 1990, 426 pp, $75. [ISBN: 
0-8176-3389-8] Reprint of the first book on graph 
theory published in 1936 and containing many clas- 
sical problems. Results updated in commentary by 
W.T. Tutte, plus biographical sketch of D. Konig by 
T. Gallai. Important for history of field. JPH 


Graph Theory, T*(16-17), S, P, L. Distance 
in Graphs. Fred Buckley, Frank Harary. Addison- 
Wesley, 1990, xiii + 335 pp. (ISBN: 0-201-09591- 
2] An introductory textbook based on the classic 
Graph Theory by Frank Harary, updated and uni- 
fied by the concept of distance in graphs. Theorems 
are clearly stated; proofs are carefully explained and 
motivated, usually with accompanying figures to elu- 
cidate and illustrate concepts. Exercises, large bibli- 
ography. LCL 


Combinatorics, P. Coding Theory and Design 
Theory, Part II: Design Theory. Dijen Ray- 
Chaudhuri. IMA, V. 21. Springer-Verlag, 1990, xiii 
+ 378 pp, $39. [ISBN: 0-387-97231-5] Proceedings 
of a workshop on coding theory and design theory 
held May 1-June 10, 1988 during the year in applied 
combinatorics at the IMA, Minneapolis, Minnesota. 
This volume concentrates on design theory. LC 


Discrete Mathematics, S, P, L. Ramsey Theory, 
Second Edition. Ronald L. Graham, Bruce L. Roth- 
schild, Joel H. Spencer. Wiley, 1990, xi + 196 pp, 
$49.95. (ISBN: 0-471-50046-1] The definitive work 
on Ramsey Theory, this edition gives a complete 
treatment of Saharon Shelah’s noninductive proof of 
van der Waerden’s Theorem. Also, new material on 
several more detailed areas, including Graph Ram- 
sey Theory and Euclidean Ramsey Theory. The final 
chapter relates Ramsey Theory to areas other than 
discrete mathematics (e.g., the unprovability results 
of Paris and Harrington). (First Edition, TR, March 
1981.) LCL 


Discrete Mathematics, S, L. Implementing Dis- 
crete Mathematics. Steven Skiena. Addison-Wesley, 
1990, x + 334 pp, $43.25. (ISBN: 0-201-50943- 
1] Reference on combinatorial algorithms and a lab 
manual for experimentation in discrete mathematics. 
Contains over 200 functions written in Mathemat- 
ica. These functions include algorithms for generat- 
ing permutations, partitions, Young Tableaux , and 
creating a variety of graph embeddings. Exercises 
make use of functions described in the book. Useful 
as supplement for a combinatorics or graph theory 
course or as an independent study course. (Does not 
develop the theory behind the functions described.) 
LC 


Number Theory, T(18), S**, P***, L**. Ele- 
mentary and Analytic Theory of Algebraic Numbers, 
Second Edition. Wiladystaw Narkiewicz. Springer- 
Verlag, 1990, xii + 746 pp, $89.50. [ISBN: 0-387- 
51250-0] Updated edition (First Edition, TR, June- 
July 1974) of an important resource. Intended to 
“reconcile the two approaches” to algebraic number 
theory (the classical Dedekind ideal theory and lo- 
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cal methods). Each chapter ends with exercises and 
extensive notes for further reading. Appendix in- 
cludes updated “unsolved problems.” Besides up- 
dating most chapters to account for recent results, 
notes at the end of the chapter were rewritten to in- 
clude literature through 1983, as has the 166(!) page 
bibliography. Assumes knowledge of topology and 
algebra (including some Galois theory). A nice ref- 
erence for expert and nonexpert alike. SB 


Number Theory, T(17-18), S, P**, L**. Divi- 
sor Theory. Harold M. Edwards. Birkhauser Boston, 
1990, xiv + 166 pp, $29.50. [ISBN: 0-8176-3448-7] 
Introduction to Kronecker’s Theory of Divisors, a 
precursor to Dedekind’s Ideal Theory (indeed, divi- 
sors and ideals coincide in algebraic number fields). 
Covers the general theory (including some material 
not treated by Kronecker), and applications to alge- 
braic number theory and algebraic curves (appendix 
contains proof of Riemann-Roch theorem for curves). 
Typeset with TpX using large type, making it even 
more enjoyable to read. No exercises. SB 

Number Theory, T(18), S, P, L. Hilbert Modular 
Forms. Eberhard Freitag. Springer-Verlag, 1990, viii 
+ 250 pp, $49. (ISBN: 0-387-50586-5] Provides “a 
description of the singular cohomology of the Hilbert 
modular group and its Hodge decomposition includ- 
ing explicit formulae.” Each section begins with a 
brief abstract, adding to its readability. Assumes 
knowledge of basic complex analysis and algebra; the 
appendix covers other preliminaries. SB 


Number Theory, S(16-18), P, L. Number Theory 
and Cryptography. Ed: J.H. Loxton. London Math. 
Soc. Lect. Note Ser., V. 154. Cambridge University 
Pr, 1990, xi + 235 pp, $27.95 (P). (ISBN: 0-521- 
39877-0] Twenty-two papers on number-theoretic 
aspects of cryptography, including a lengthy chap- 
ter on sieves. BC 


Number Theory. Elementary Theory of Num- 
bers. William J. LeVeque. Dover, 1990, viii + 
132 pp, $4.95 (P). (ISBN: 0-486-66348-5] A reprint 
of the 1962 edition. Intended as a text for fresh- 
men or sophomore seminars and for teacher refresher 
courses. Covers Euclidean algorithms, congruences, 
continued fractions, Gaussian integers, and Diophan- 
tine equations. Contains relatively few exercises. SG 


Number Theory, T(18), P. A Classical Intro- 
duction to Modern Number Theory, Second Edition. 
Kenneth Ireland, Michael Rosen. Grad. Texts in 
Math., V. 84. Springer-Verlag, 1990, xiv + 389 pp, 
$49.95. [ISBN: 0-387-97329-X] Adds two chapters to 
the 1981 edition (TR, January 1983; Extended Re- 
view, May 1984), one containing an elementary proof 
of the Mordell-Weil Theorem, the other chronicling 
the huge progress made in arithmetic geometry in 
the ’80s. The rest of the book remains essentially 
the same—a wonderful introduction to elementary 
number theory, reciprocity laws, zeta functions of va- 
rieties, number fields, L-functions, diophantine equa- 
tions, and elliptic curves. SG 


Number Theory, S(17). Abstract Analytic Num- 
ber Theory. John Knopfmacher. Dover, 1990, xi + 
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336 pp, $9.95 (P). (ISBN: 0-486-66344-2] A reprint 
with additional bibliography of the 1975 North- 
Holland edition (TR, May 1976). The focus of the 
book is on analytic properties of multiplicative semi- 
groups that arise in number theory. SG 


Linear Algebra, S(16-18), P*, L. Matriz Theory 
and Applications. Ed: Charles R. Johnson. Proc. 
of Symposia in Appl. Math., V. 40. AMS, 1990, x 
+ 260 pp, $57. [ISBN: 0-8218-0154-6] Based on 
the AMS Short Course given at the January 1989 
Phoenix meeting. The purpose: to present a sam- 
ple of the ways in which modern matrix theory is 
stimulated by its interplay with other subjects. The 
“other subjects” represented in these seven papers 
are combinatorics, probability theory, statistics, op- 
erator theory and control theory, algebraic coding 
theory, partial differential equations, and analytic 
function theory. LCL 


Linear Algebra, T(17-18: 2), S, P. Angewandte 
Lineare Algebra. Bertram Huppert. Walter de 
Gruyter, 1990, viii + 646 pp, DM 198. [ISBN: 3-11- 
012107-7] Assumes considerable knowledge of lin- 
ear algebra. Chapters on finite-dimensional Hilbert 
spaces, linear differential and difference equations, 
non-negative matrices, geometric algebra, and spe- 
cial relativity. Many problems. JD-B 


Linear Algebra, T(15-17: 1), S, L. Abstract 
Linear Algebra. Morton L. Curtis. Universitext. 
Springer-Verlag, 1990, x + 168 pp, $23 (P). [ISBN: 
0-387-97263-3] A rather sophisticated introduction, 
“directed to students with mathematical leanings,” 
covering the basic results of linear algebra and cul- 
minating with the Theorem of Hurwitz (that the only 
normed algebras over the real numbers are the real 
numbers, the complex numbers, the quaternions, and 
the octonions). The emphasis is on the beauty of the 
mathematics (e.g., noncommutative algebras form 
the setting for proving the Cayley-Hamilton Theo- 
rem, exterior algebras are used in defining determi- 
nants), and there are no applications. LCL 


Group Theory, T(18: 1), S, P. Oligomorphic Per- 
mutation Groups. Peter J. Cameron. London Math. 
Soc. Lect. Note Ser., V. 152. Cambridge University 
Pr, 1990, viii + 160 pp, $22.95 (P). [ISBN: 0-521- 
38836-8] Based on notes from a 1988 symposium 
on Model Theory and Groups; presents recent work 
on special types of infinite permutation groups. Ref- 
erence, index. JS 


Group Theory, T(18: 1), S, P, L. Representa- 
tions and Characters of Finite Groups. M.J. Collins. 
Stud. in Adv. Math., V. 22. Cambridge Univer- 
sity Pr, 1990, xii + 242 pp, $59.50. (ISBN: 0-521- 
23440-9] Largely self-contained (except for some 
block theory); accessible to advanced undergradu- 
ates. Two foundation chapters are followed by treat- 
ment of Suzuki’s theory of exceptional characters, 
Feit’s theory of coherence, Brauer’s characterization, 
and Reynolds and Robinson on isometries. Ap- 
pendix, exercises, indexes, bibliography. JS 

Group Theory, S(18), P. Induced Modules over 
Group Algebras. Gregory Karpilovsky. Math. Stud., 
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V. 161. Elsevier Science, 1990, xi + 520 pp, $120.50. 
[ISBN: 0-444-88414-9] Intended to present a com- 
prehensive look at the current state of the theory for 
induced modules, including the substance of a num- 
ber of recent developments previously available only 
in research articles. Includes chapters on projective 
summaries, Green theory, restrictions, permutation 
modules, and permutation lattices. Bibliography, in- 
dex. JS 


Algebra, S(18), P. Lecture Notes in Mathematics- 
1430: Commutative Algebra. Eds: W. Bruns, A. 
Simis. Springer-Verlag, 1990, 160 pp, $18 (P). 
(ISBN: 0-387-52745-1] Eight papers containing the 
proceedings of the Workshop in Commutative Alge- 
bra, Salvador, Brazil, 1988. JS 


Algebra, T(15-16: 2), S. Modern Algebra: Two 
Volumes Bound as One. Seth Warner. Dover, 1990, 
xi + 818 pp, $16.95 (P). [ISBN: 0-486-66341-8] An 
unabridged, corrected, one-volume version of the 
original 1965 two-volume work. JS 


Algebra, T(18: 1), S, P. Almost Free Modules: 
Set-theoretic Methods. Paul C. Eklof, Alan H. Mek- 
ler. Math. Lib., V. 46. North-Holland (US Distr: 
Elsevier Science), 1990, xvi + 481 pp, $115.50. 
(ISBN: 0-444-88502-1] A blending of set-theoretic 
techniques and algebraic methods applied to four 
problems concerning almost free modules, the struc- 
ture of Ext, the structure of Hom, and endomor- 
phism rings of groups. Exercises, open problems, 
bibliography, index. JS 


Algebra, T(17: 1), L. Linear Representations of 
Groups. Ernest B. Vinberg. Transl: A. Iacob. Ser. 
of Adv. Textbooks in Math., V. 2. Birkhauser 
Boston, 1989, vi + 146 pp, $29.50. [ISBN: 0-8176- 
2288-8] Fundamentals of linear representation the- 
ory, mainly of finite and compact groups, and the 
elements of Lie groups. Includes a section on the 
Laplace spherical functions and the representations 
of SU2 and SO3. Contains numerous examples, ex- 
ercises, and solutions. LC 


Algebra, T(15-17: 2), S, L. Undergraduate Alge- 
bra, Second Edition. Serge Lang. UTM. Springer- 
Verlag, 1990, xi + 367 pp, $39.80. [ISBN: 0-387- 
97279-X] This edition includes the Sylow theorems 
and additional material on symmetric polynomials, 
principal rings, Jordan normal form, and field the- 
ory. There are several new exercises, and more than 
half a dozen genuine research questions that can be 
formulated and motivated within the language of this 
course. (First Edition, TR, June-July 1987.) LCL 


Algebra, P. Lecture Notes in Mathematics-1425: 
Groups of Self-Equivalences and Related Topics. Ed: 
R.A. Piccinini. Springer-Verlag, 1990, 214 pp, $21.80 
(P). (ISBN: 0-387-52658-7] The proceedings of a 
conference held at the University of Montreal in 
1988. In addition to the research articles, there 
are two special articles: a survey of groups of self- 
equivalences, and a paper presenting seventeen open 
problems. LCL 


Algebra, T(18). Reflection Groups and Cozeter 
Groups. James E. Humphreys. Stud. in Adv. Math., 
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V. 29. Cambridge University Pr, 1990, xii + 204 
pp, $39.50. [ISBN: 0-521-37510-X] Basic theory of 
Coxeter groups. First part deals concretely with 
the finite reflection groups and the affine Weyl 
groups. Includes classification of associated Cox- 
eter graphs, and polynomial invariants of finite re- 
flection groups. Part two develops the general prop- 
erties of Coxeter groups, including Bruhat ordering. 
Work of Kaghadan and Lusztig on representations 
of Hecke algebras associated with Coxeter groups is 
discussed. LC 


Algebra, T(17), L. Notes on Lie Algebras. Hans 
Samelson. Universitext. Springer-Verlag, 1990, xii + 
162 pp, $29.80 (P). [ISBN: 0-387-97264-1] Revision 
of 1969 edition (TR, April 1970). Errors corrected, 
some text rewritten. New material includes the for- 
mulas of Freudenthal and Klimyk for the multiplici- 
ties of weights, Brauer’s algorithm for the splitting of 
tensor products, and the Bose-Patera proof of which 
representations can be brought into orthogonal or 
symplectic form. LC 


Algebra, S(18), P. Symmetric and G-algebras 
With Applications to Group Representations. Gre- 
gory Karpilovsky. Math. & Its Applic., V. 60. 
Kluwer Academic, 1990, xvi + 368 pp, $99. [ISBN: 0- 
7923-0761-5] Theory of symmetric and G-algebras 
with applications to modular group representation 
theory and block theory of group algebras. Essen- 
tially self-contained. LC 


Algebra, P, L. Nathan Jacobson: Collected Math- 
ematical Papers, Volumes 1-3. Nathan Jacobson. 
Birkhauser Boston, 1989, $225 set. Volume 1 (1934- 
1946), xviii + 454 pp, [ISBN: 0-8176-3410-X]; Vol- 
ume 2 (1947-1965), xviii + 556 pp, (ISBN: 0-8176- 
3411-8]; Volume 3 (1965-1988), xviii + 596 pp. 
[ISBN: 0-8176-3446-0] Complete collection of Ja- 
cobson’s papers, both research and expository. Each 
volume begins with an autobiographical account of 
the specified period. KS 


Algebra, P. The Orbit Method in Representation 
Theory. M. Duflo, N.V. Pedersen, M. Vergne. 
Progress in Math., V. 82. Birkhauser Boston, 1990, 
x + 227 pp, $39. [ISBN: 0-8176-3474-6] Ten pa- 
pers from the proceedings of a conference held at the 
University of Copenhagen in 1988. The orbit method 
plays a major role in the representation theory of Lie 
groups and Lie algebras. LCL 


Algebra, S(16-18), P, L. Numbers. H.-D. Ebbing- 
haus, et al. Transl. and Ed.: H.L.S. Orde, J.H. Ew- 
ing. Grad. Texts in Math., V. 123. Springer-Verlag, 
1990, xviii + 393 pp, $59. [ISBN: 0-387-97202-1] A 
must for any serious student of mathematics. Num- 
bers and number systems—natural, integral, real, 
complex, p-adic, quaternion, Cayley, nonstandard, 
and Conway—are described in detail. Many major 
theorems (the Fundamental Theorem of Algebra, the 
Gelfand-Mazur Theorem, the transcendence of pi) 
are proved, and historical notes are sprinkled thickly 
throughout. SG 

Algebra, T*(15-16), S*, L*. 
Joseph Rotman. Universitext. 


Galois Theory. 
Springer-Verlag, 
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1990, xii + 108 pp, $29 (P). [ISBN: 0-387-97305-2] 
A considered and cohesive introduction to the main 
results of Galois theory: the cubic and quartic for- 
mulas, the fundamental theorem, the insolvability of 
the quintic, and the “Great Theorem” that solvabil- 
ity of a polynomial by radicals is equivalent to the 
solvability of its Galois group. The exposition builds 
on an introductory knowledge of linear and abstract 
algebra, leaving advanced topics in field theory out 
altogether if not directly needed. Especially suitable 
for a senior or first-year graduate seminar or for self- 
study. LCL 


Calculus, T(13), L. Precalculus: Functions and 
Graphs, Second Edition. Bernard Kolman, Arnold 
Shapiro. Harcourt Brace Jovanovich, 1990, xvi + 645 
pp, $31. (ISBN: 0-15-571063-X] This edition adds 
chapters on maxima and minima and root finding. 
(First Edition, TR, January 1985.) OJ 


Calculus, P, L. Priming the Calculus Pump: 
Innovations and Resources. Ed: Thomas W. 
Tucker. MAA Notes No. 17. MAA, 1990, 321 pp, 
$20 (P). (ISBN: 0-88385-067-2] In-depth report on 
twelve innovative calculus projects—complete with 
samples of problem sets, tests, and student work— 
together with extensive collateral information (over 
60 abstracts of other projects, list of NSF awards, 
database of software for college mathematics) on the 
calculus reform movement. Third in MAA’s cal- 
culus reform series, following Towards a Lean and 
Lively Calculus (1986) and Calculus for a New Cen- 
tury (1987), this volume provides a wealth of detail 
from real classrooms with real students. You will 
be amazed at what calculus students can accomplish 
with the right tools. LAS 


Calculus, S*(13), P. Calculus Activities for 
Graphic Calculators. Dennis Pence. PWS-Kent, 
1990, x + 266 pp, $15 (P). [ISBN: 0-534-92431-X] 
Clear and concise instructions and examples (includ- 
ing display output) for Casio, Sharp, and HP graphic 
scientific calculators; Chapters 1-4 introduce basic 
calculator operations and graphing; remaining chap- 
ters illustrate graphical and numerical approaches to 
first-year calculus topics. Exercises. JNC 

Real Analysis, T(16-17: 1, 2), L. Analysis on 
Manifolds. James R. Munkres. Addison-Wesley, 
1991, xi + 366 pp, $48.50. [ISBN: 0-201-51035-9] 
Orderly, clearly written presentation of the theory 
of derivatives and the Riemann integral for multi- 
variable functions, and of the theory of integration 
of forms on manifolds in Euclidean space. Like the 
author’s other texts, this book should be a good one 
from which to learn a body of standard material. JO 


Complex Analysis, T(15-16), L. Complez Vart- 
ables for Scientists and Engineers, Second Edition. 
John D. Paliouras, Douglas S. Meadows. Macmillan, 
1990, xiii + 586 pp. (ISBN: 0-02-390561-1] Fairly 
standard, applications-oriented text appropriate for 
a first course aimed at engineering majors. Many 
examples illustrating theorems and many exercises 
of varying degrees of difficulty. One complaint: it 
takes over 200 pages to get to the Cauchy integral 
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formula, and almost 250 pages to get to series. (First 
Edition, TR, November 1975.) MPR 


Complex Analysis, P. Lecture Notes in Mathemat- 
ics-1422: Complez Geometry and Analysis. Ed: V. 
Villani. Springer-Verlag, 1990, v + 109 pp, $14.70 
(P). [ISBN: 0-387-52434-7] Proceedings from a 
symposium held in Pisa during May 1988. Ten pa- 
pers intended to survey current research in complex 
geometry, including essays on hyperkahler and C'R- 
manifolds and complex vector bundles. JO 


Partial Differential Equations, P. Lattice Gas 
Methods for Partial Differential Equations. Eds: 
Gary D. Doolen, et al. Addison-Wesley, 1990, xix 
+ 554 pp, $47.50; $26.95 (P). [ISBN: 0-201-15679- 
2] Twenty-nine papers on lattice gas methods, an 
up-and-coming numerical approach to solving par- 
tial differential equations, especially for hydrody- 
namics. BC 


Partial Differential Equations, P. Solitons in 
Physics, Mathematics, and Nonlinear Optics. Eds: 
Peter J. Olver, David H. Sattinger. IMA, V. 25. 
Springer-Verlag, 1990, xiii + 215 pp, $29. [ISBN: 
0-387-97309-5] Solitons, or solitary waves, are so- 
lutions of certain partial differential equations that 
have many surprising, particle-like properties. This 
book contains eleven papers from a 1988 workshop 
at the Institute for Mathematics and Its Applica- 
tions. BC 


Numerical Analysis, T(16-17: 1), P. Eléments 
D’Analyse Numérique. Marc Atteia, Michel Pradel. 
Cepadues-Editions, 1990, viii + 154 pp, 90F (P). 
[ISBN: 285428-229-9] Concise, demanding intro- 
ductory text. Treats polynomial approximation and 
interpolation, initial-value problems, linear systems. 
Problems. JD-B 


Numerical Analysis, T(15). Numerical Analy- 
sts, An Introduction. Lars Eldén, Linde Wittmeyer- 
Koch. Academic Pr, 1990, x + 347 pp, $39.95. 
[ISBN: 0-12-236430-9] Intended as a text for a stan- 
dard undergraduate course, this is a revision of a 
book originally published in Swedish. It is written 
from the viewpoint that students will best under- 
stand algorithms if they have worked through them 
using a hand-held calculator, and exercises and ex- 
amples are constructed accordingly. AWR 
Functional Analysis, T(17-18: 2, 3), S, L. 
A Course in Functional Analysis, Second Edition. 
John B. Conway. Grad. Texts in Math., V. 96. 
Springer-Verlag, 1990, xvi + 399 pp, $49.50. (ISBN: 
0-387-97245-5] Most significant change in this edi- 
tion is that the chapter on Fredholm Theory has been 
completely rewritten and simplified. Bibliograph- 
ical notes and (more) exercises have been added. 
(First Edition, TR, December 1985; Extended Re- 
view, November 1986.) KS 

Functional Analysis, T(18: 2), P. Volterra Inte- 
graland Functional Equations. G. Gripenberg, S.-O. 
Londen, O. Staffans. Encyclop. of Math. & Its Ap- 
plic., V. 34. Cambridge University Pr, 1990, xxii 
+ 701 pp, $99.50. [ISBN: 0-521-37289-5] A new 
book intended as a text (there are problem sets) 
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and as a book for researchers in the field (there are 
extensive references, and an effort has been made 
to bring the reader up to current research). Ef- 
forts have been made to cover aspects of analysis 
needed but not easily found elsewhere (the reader 
is assumed to be familiar with the contents of, say, 
Rudin’s books), and care is given to including an 
extensive table of symbols and their meaning; it is 
nicely produced. The three parts cover linear the- 
ory, quasi-linear and existence problems for nonlinear 
equations, and frequency domain methods in nonlin- 
ear equations. AWR 


Functional Analysis, P. Lecture Notes in Mathe- 
matics-1421: A Nonlinear Theory of Generalized 
Functions. Hebe A. Biagioni. Springer-Verlag, 1990, 
xii + 214 pp, $21.80 (P). [ISBN: 0-387-52408-8] 
Presents a theory (due to J.F. Colombeau) for in- 
corporating distributions into a differential algebra, 
which Laurent Schwartz proved impossible in 1954. 
(Schwartz wasn’t wrong; Colombeau weakens cer- 
tain properties.) Includes applications to numeri- 
cal methods in fluid dynamics, elastoplasticity, and 
acoustics. BC 


Functional Analysis, P. Uniform Frechet Alge- 
bras. Helmut Goldmann. Math. Stud., V. 162. Else- 
vier Science, 1990, viii + 355 pp, $102.50. [ISBN: 0- 
444-88488-2] A study of algebras of complex-valued 
continuous functions on hemicompact spaces which 
are complete in the compact open topology. Com- 
pares with the theory of uniform (Banach) alge- 
bras. MLR 


Functional Analysis, T(16-17: 1-3), L. Introduc- 
tion to Hilbert Spaces with Applications. Lokenath 
Debnath, Piotr Mikusinski. Academic Pr, 1990, xiv 
+ 509 pp, $49.95. [ISBN: 0-12-208435-7] First half 
of this friendly text contains basics of Hilbert space 
theory and of linear operators on Hilbert spaces, in- 
cluding simple introduction (without concept of mea- 
sure) to Lebesgue integration. Later, Hilbert space 
formalism is used to develop foundations of quantum 
mechanics, and many applications of Hilbert space 
methods are given. Contains wide selection of exam- 
ples and exercises. KS 


Functional Analysis, T(16-17: 2, 3), S, L. 
Functional Analysis. Frigyes Riesz, Béla Sz.-Nagy. 
Transl: Leo F. Boron. Dover, 1990, xii + 504 
pp, $11.95 (P). [ISBN: 0-486-66289-6] Paperback 
reprint of classic text. Part One takes up differen- 
tiation, the Lebesgue integral, and the Stieltjes in- 
tegral and its generalizations. Part Two addresses 
integral equations, Hilbert and Banach spaces, trans- 
formations of Hilbert space, self-adjoint transforma- 
tions, groups and semigroups of transformations, and 
spectral theories for linear transformations of general 
type. KS 


Analysis, P, L. Basic Hypergeometric Series. 
George Gasper, Mizan Rahman. Encyclop. of Math. 
& Its Applic., V. 35. Cambridge University Pr, 1990, 
xx + 287 pp, $59.50. (ISBN: 0-521-35049-2] Valu- 
able text and reference book on the significant re- 
sults of basic hypergeometric series from the past 
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200 years. Chapter topics include introduction of 
definitions and notations of hypergeometric and ba- 
sic hypergeometric series and the early work of Heine, 
Jackson, and Bailey; summation, transformation and 
expansion formulas; basic contour integrals; bibasic 
series; the Askey-Wilson g-beta integral and appli- 
cations to orthogonal polynomials. Prerequisites are 
a good background in analysis. Exercises provide 
additional formulas and practice with proof tech- 
niques. LC 


Analysis, T(17-18: 2), P, L. Chebyshev Polyno- 
mials: From Approzimation Theory to Algebra and 
Number Theory, Second Edition. Theodore J. Rivlin. 
Wiley, 1990, xiii + 249 pp, $49.95. [ISBN: 0-471- 
62896-4] Aims to give a survey of important prop- 
erties of the Chebyshev polynomials and introduce 
some interesting areas of mathematics (e.g., interpo- 
lation theory, numerical integration, ergodic theory) 
where the Chebyshev polynomials serve as an exam- 
ple. Revisions include new applications, a new chap- 
ter on algebraic and number theoretic properties of 
the Chebyshev polynomials, and new exercises (First 
Edition, TR, May 1975). LC 

Analysis, P. Asymptotic and Computational Analy- 
sis: Conference in Honor of Frank W.J. Olver’s 65th 
Birthday. Ed: R. Wong. Lect. Notes in Pure & 
Appl. Math., V. 124. Marcel Dekker, 1990, xii + 
755 pp, $115 (P). [ISBN: 0-8247-8347-6] Proceed- 
ings of the International Symposium on Asymptotic 
and Computational Analysis held June 1989 in Win- 
nipeg, Canada. LC 


Analysis, T(17-18), P. Unitary Representations 
and Harmonic Analysis: An Introduction, Second 
Edition. Mitsuo Sugiura. Math. Lib., V. 44. El- 
sevier Science, 1990, xv + 452 pp, $112.75. (ISBN: 
0-444-88593-5] Introduction to harmonic analysis 
and theory of unitary representations of (Type I) Lie 
groups. This edition includes a section on irreducible 
unitary representations of SZ(2,R). Other changes 
include a handful of rewritten proofs and a bibliogra- 
phy tripled in length. (First Edition, TR, May 1976.) 
KS 


Analysis, P, L. Iterative Functional Equations. 
Marek Kuczma, Bogdan Choczewski, Roman Ger. 
Encyclop. of Math. & Its Applic., V. 32. Cambridge 
University Pr, 1990, xix + 552 pp, $99.50. (ISBN: 0- 
521-35561-3] Topics covered include systems of lin- 
ear and nonlinear equations of finite and infinite or- 
der in various function classes, conjugate and com- 
mutable functions, linearization, iterative roots of 
functions, and special functional equations. Many 
results have appeared before only in research litera- 
ture. Contains an extensive bibliography. KS 

Analysis, S(18), P. Quasiconformal Mappings and 
Sobolev Spaces. V.M. Gol’dshtein, Yu. G. Reshet- 
nyak. Math. & Its Applic., V. 54. Kluwer Academic, 
1990, xix + 371 pp, $133. [ISBN: 0-7923-0543-4] A 
relatively self-contained introduction to basic meth- 
ods and recent results on Sobolev spaces, various ge- 
ometric classes of mappings (including quasiconfor- 
mal and quasiisometric mappings), and nonlinear ca- 
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pacity. The first three chapters comprise a readable 
(occasional quirks of translation aside), inviting, and 
fairly leisurely introduction to modern analysis: inte-: 
gration, differentiation, distribution theory, Sobolev 
spaces, and other function spaces. Note price. PZ 


Analysis, S, P, L*. Problems in Mathematical 
Analysis. Piotr Biler, Alfred Witkowski. Pure & 
Appl. Math., V. 132. Marcel Dekker, 1990, v + 227 
pp, $49.75. (ISBN: 0-8247-8312-3] More than 1300 
problems in analysis, most of them neither standard 
nor easy (at the level of the Monthly Problems Sec- 
tion and beyond), covering the usual topics in analy- 
sis and including measure and integration, functional 
analysis, Fourier analysis, functions of several vari- 
ables, analytic functions. The answers and hints are 
extremely laconic; many are references to the prob- 
lems literature (especially journals). LCL 


Analysis, S(17-18), P. Operator Theory, Operator 
Algebras and Applications. Eds: William B. Arve- 
son, Ronald G. Douglas. Proc. of Sympos. in Pure 
Math., V. 51. AMS, 1990, $174 set [ISBN: 0-8218- 
1486-9]. Part 1, xii + 640 pp; Part 2, xii + 385 
pp. Proceedings of a 1988 Summer Institute held in 
Durham, New Hampshire. KS 


Algebraic Geometry, P. Lectures on the Mordelil- 
Weil Theorem. Jean-Pierre Serre. Transl: Mar- 
tin Brown, Michel Waldschmidt. Aspects of Math., 
V. E15. Friedr Vieweg (US Distr: GLP Intern. 
Vieweg), 1990, x + 218 pp. [ISBN: 3-528-18968-1] 
Notes from a pre-Faltings course. Topics include the 
Mordell-Weil Theorem, Mordell’s conjecture, inte- 
gral points on curves via Siegel’s method and Baker’s 
method, Hilbert’s irreducibility theorem, construc- 
tion of Galois extension, and large sieves. SG 


Differential Geometry, P. Symplectic Geometry. 
A.T. Fomenko. Trans: R.S. Wadhwa. Adv. Stud. in 
Contemp. Math., V. 5. Gordon & Breach, 1988, xiii 
+ 387 pp, $180. (ISBN: 2-88124-657-5] Basic the- 
ory and some recent results related to the structure 
of symplectic manifolds, the embedding of symplectic 
and Lagrangian manifolds, and the complete integra- 
bility of Hamiltonian systems. Too compact to make 
a good textbook, but contains some nice reference 
sections, including tight over views of matrix groups, 
symplectic structures on manifolds, and Hamiltonian 
systems with symmetries. Note the price. JO 


Differential Geometry, T(18: 2), S, P, L. Lie 
Groups and Algebraic Groups. A.L. Onishchik, E.B. 
Vinberg. Transl: D.A. Leites. Springer-Verlag, 1990, 
xix + 328 pp, $79. [ISBN: 0-387-50614-4] “The 
only book where the theory of semisimple Lie groups 
is based systematically on the technique of algebraic 
groups.” Theory is developed as a sequence of prob- 
lems (with hints), leading eventually to classification 
of real and complex semisimple Lie groups and Levi- 
Malcev theorems for Lie and algebraic groups. Ta- 
bles, exercises, bibliography, index. JS 

Geometry, T, S*(13-15), P, L*. Connections: 
The Geometric Bridge between Art and Science. Jay 
Kappraff. McGraw-Hill, 1991, xxi + 471 pp, $19.50 
(P); $39.50. [ISBN: 0-07-034251-2; 0-07-034250-4] 
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A spectacular presentation of design science—“the 
grammar of space”—that explores with rich details 
instances of similarity, proportion, tilings, graphs, 
lattices, polyhedra, isometries, and symmetry in art, 
architecture, engineering, and science. Punctuated 
with exercises and problems (thus making the mono- 
graph useful as a course or seminar text); illus- 
trated with over 200 figures; supported by an ex- 
tensive multi-disciplinary bibliography that is well- 
referenced to the text. A superb option for interdis- 
ciplinary seminars. LAS 


Topology, P. General Topology and Applications: 
Proceedings of the 1988 Northeast Conference. Ed: 
R.M. Shortt. Lect. Notes in Pure & Appl. Math., 
V. 123. Marcel Dekker, 1990, xiv + 291 pp, $99.75 
(P). [ISBN: 0-8247-8349-2] More than twenty pa- 
pers covering a wide range of research topics repre- 
senting the topological interests of Melvin Henriksen, 
for whom this conference was organized, on the oc- 
casion of his sixtieth birthday. LCL 


Dynamical Systems, P, L*. Dynamical Chaos. 
Eds: M.V. Berry, I.C. Percival, N.O. Weiss. Prince- 
ton University Pr, 1987, 199 pp, $19.50 (P). [ISBN: 
0-691-02423-5] Proceedings of a 1987 discussion 
meeting of the Royal Society of London, reprinted 
from their Proceedings. Introduced by E.C. Zeeman; 
contains survey papers with various applications, in- 
cluding cardiac arrhythmias, biological populations, 
convection, solar system, Hamiltonian systems, and 
renormalization. LAS 


Dynamical Systems, T(18: 2), L. Introduction 
to Applied Nonlinear Dynamical Systems and Chaos. 
Stephen Wiggins. Texts in Appl. Math., V. 2. 
Springer-Verlag, 1990, xiv + 672 pp, $49.95. [ISBN: 
0-387-97003-7] Intended as a graduate text with 
sufficient completeness to prepare students for re- 
search in the field. An informal expositional style and 
constant attention to the development of the reader’s 
intuition make this a most readable text. JO 


Dynamical Systems, S(17-18), P. Hamiltonian 
Systems: Chaos and Quantization. Alfredo M. Ozo- 
rio de Almeida. Mono. on Math. Physics. Cam- 
bridge University Pr, 1990, ix + 238 pp, $24.95 (P). 
(ISBN: 0-521-34531-6] A physics-oriented introduc- 
tion to chaos in Hamiltonian systems, focusing on 
neighborhoods of periodic orbits, and implications 
for quantum dynamics. (1988 hardcover text, TR, 
November 1989.) BC 


Dynamical Systems, P. Geometry in the Neigh- 
borhood of Invariant Manifolds of Maps and Flows 
and Linearization. U. Kirchgraber, K.J. Palmer. Pit- 
man Res. Notes in Math. Ser., V. 233. Longman Sci- 
entific & Technical (US Distr: Wiley), 1990, 89 pp, 
$32 (P). [ISBN: 0-470-21657-3] Presents detailed 
proofs of generalized Grobman-Hartman lineariza- 
tion theorems for discrete and continuous dynamical 
systems. Studies the geometric properties of invari- 
ant manifolds and foliations. OJ 

Control Theory, P. Signal Processing, Part 
II: Control Theory and Applications. Ed: F.A. 
Grinbaum, J.W. Helton, P. Khargonekar. IMA, V. 
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23. Springer-Verlag, 1990, xiii + 411 pp, $39. (ISBN: 
0-387-97230-7] ‘Twenty papers from a program at 
the Institute for Mathematics and Its Applications. 
Applications include tomography, x-ray problems, 
and speech recognition. BC 


Control Theory, P. Lecture Notes in Control and 
Information Sciences-140: Singularly Perturbed and 
Weakly Coupled Linear Control Systems: A Recur- 
sive Approach. Z. Gaji¢é, D. Petkovski, X. Shen. 
Springer-Verlag, 1990, vii + 202 pp, $32.90 (P). 
(ISBN: 0-387-52333-2] Singularly perturbed and 
weakly coupled linear systems of ordinary differential 
equations are traditionally studied via non-recursive, 
and hence computationally expensive, power series 
methods. Here the authors develop recursive algo- 
rithms for solving the associated Lyapunov-Ricatti 
equations. Most chapters include applications to 
real-world case studies. PZ 


Probability, T(15-17: 2), L. The Theory of Prob- 
ability and the Elements of Statistics. B.V. Gne- 
denko. Transl: B.D. Seckler. Chelsea, 1989, 529 
pp, $23.95. [ISBN: 0-8284-1132-9] Fifth Edition 
of a classic, mathematical probability text; includes 
a new chapter on mathematical statistics with em- 
pirical distribution functions and Glivenko-Cantelli, 
Kolmogorov-Smirnov theorems. Testing and estima- 
tion are dealt with cursorily. Remainder of book cov- 
ers advanced concepts in probability: CLTs, LLNs, 
characteristic functions, infinite divisibility, stochas- 
tic processes, etc. No measure theory is required, 
although some is discussed. (1967 Chelsea edition, 
TR, February 1968; 1976 MIR edition, TR, August- 
September 1976; 1978 MIR edition, TR, November 
1981.) MK 


Stochastic Processes, T(17: 1), P, L. Mathe- 
matical Methods for Construction of Queueing Mod- 
els. V.V. Kalashnikov, S.T. Rachev. Oper. Res. Ser. 
Wadsworth, 1990, xvi + 431 pp, $65.95. [ISBN: 0- 
534-132545] This work is concerned with modelling 
extant queueing systems. While some consideration 
is given to the theory of the usual well-known queue- 
ing models, the emphasis here is on matching dis- 
tribution parameters to empirical data, choosing a 
class of model appropriate to the real-life system, 
and determining the effects of the inevitable mathe- 
matical simplifications and approximations found in 
any mathematical model. No exercises. SM 


Stochastic Processes, T(18). Stochastic Pro- 
cesses with Applications. Rabi N. Bhattacharya, Ed- 
ward C. Waymire. Ser. in Prob. & Math. Stat. Wi- 
ley, 1990, xiii + 672 pp, $69.96. [ISBN: 0-471-84272- 
9] Textbook designed for science and engineering 
(and mathematics) graduate students. Presentation 
is technically simple with emphasis on computation 
and examples. Applications from physics, computer 
science, economics, and engineering. MLR 


Stochastic Processes, P. Stochastic Equations and 
Differential Geometry. Ya. I. Belopolskaya, Yu. L. 
Dalecky. Math. & Its Applic., V. 30. Kluwer Aca- 
demic, 1990, xv + 260 pp, $99. [ISBN: 90-277-2807- 
0] Presents theory of stochastic equations in Ba- 


82 


nach space and on manifolds, leading to discussions 
of Kolmogorov equations and diffusion processes on 
Lie groups and principal fibre bundles. JO 
Elementary Statistics, T(13-16: 1), S, L*. 
Statistics for Lawyers. Michael O. Finkelstein, Bruce 
Levin. Texts in Stat. Springer-Verlag, 1990, xxii + 
608 pp, $39 (P). [ISBN: 0-387-97136-X] A series of 
several dozen case studies (e.g., Dalkon shield, au- 
tomobile emissions, death penalty, draft lotteries) 
arranged by relevant statistical methodology (e.g., 
descriptive statistics, counting, statistical inference, 
sampling, regression), interspersed with brief syn- 
opses of the relevant theoretical issues and commen- 
tary on actual court decisions. Numerous exercises 
require both calculation (full solutions at the end of 
the volume) and interpretation. Of interest both to 
future lawyers and to future statisticians; excellent 
source of “relevant” examples. LAS 


Computational Statistics, P, L. Advances in Sta- 
tistical Methods for Genetic Improvement of Live- 
stock. Eds: Daniel Gianola, Keith Hammond. Adv. 
Ser. in Agric. Sci., V. 18. Springer-Verlag, 1990, xx + 
534 pp, $79. (ISBN: 0-387-50809-0] Edited version 
of an international symposium which took place in 
Armidale, Australia, February 16-20, 1987 concern- 
ing recent developments in statistics and comput- 
ing and their application to identifying genetically 
superior livestock and estimating genetic trends in 
breeding programs. Methods range from likelihood- 
inference, Bayesian and empirical Bayesian tech- 
niques for nonlinear models to generalized linear 
models and design of experiments and breeding pro- 
grams. MK 


Statistics, T(16-17: 1, 2), C, L. Robust Esti- 
mation and Testing. Robert G. Staudte, Simon J. 
Sheather. Ser. in Prob. & Math. Stat. Wiley, 1990, 
xix + 351 pp, $54.95. (ISBN: 0-471-85547-2] Well- 
motivated, practical presentation of robust estima- 
tion and testing procedures illustrated with several 
small data sets. Examines the robustness of these 
procedures primarily with respect to outliers and fat- 
tailed distributions. Solid treatment of robust meth- 
ods of inference in regression. Includes disk contain- 
ing data sets and Minitab macros to calculate robust 
estimates and their standard errors. RWJ 


Statistics, P. Spatial Statistics: Past, Present, and 
Future. Ed: Daniel A. Griffith. Mono., No. 12. In- 
stitute of Math Geography (IMaGe, 2790 Briarcliff, 
Ann Arbor, MI 48105), 1990, xvi + 398 pp, (P). 
(ISBN: 1-877751-42-1] Contains spring 1989 Syra- 
cuse University symposium proceedings. The thir- 
teen papers summarize recent developments in point 
and regional processes, and explore various mod- 
elling, estimation, testing, and data analysis ques- 
tions. RWJ 


Statistics, P. Statistical Design and Analysis of In- 
dustrial Ecpertments. Ed: Subir Ghosh. Stat.: Text- 
books & Mono., V. 109. Marcel Dekker, 1990, xi + 
533 pp, $55. (ISBN: 0-8247-8251-8] Contains eigh- 
teen papers on methods of experimental design and 
quality control, with some emphasis on factorial de- 
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signs. Applications papers include analysis of health 
studies data. RWJ 


Statistics, T(17-18: 1, 2), S, P, L. Regression Es- 
timators: A Comparative Study. Marvin H.J. Gru- 
ber. Statistical Modeling & Decision Sci. Academic 
Pr, 1990, xi + 347 pp, $49.95. [ISBN: 0-12-304752- 
8] Derives and examines the relation, efficiencies, 
and robustness of least squares, ridge, minimax, and 
Bayes estimates of the parameters of a linear regres- 
sion model. Applications to the Kalman filter, and 
one- and two-way ANOVA. RWJ 


Statistics, P, L. A Statistical Model: Frederick 
Mosteller’s Contributions to Statistics, Science, and 
Public Policy. Eds: S.E. Fienberg, et al. Ser. in 
Stat. Springer-Verlag, 1990, xviii + 283 pp, $39. 
[ISBN: 0-387-97223-4] Belatedly honors Mosteller 
on his seventieth birthday. Includes a brief biogra- 
phy by J.W. Tukey, comprehensive bibliography of 
Mosteller’s books, papers, etc., and reviews of his 
books. Friends, collaborators, colleagues, and stu- 
dents contribute discussions of Mosteller’s contribu- 
tions to mathematical statistics, scientific general- 
ism, methodology and applications, as well as bib- 
liographical anecdotes on Mosteller as an educator 
and his years at Harvard. Includes numerous pho- 
tographs. MK 


Statistics, P, L. Contributions to Econometric The- 
ory and Application: Essays in Honour of A.L. 
Nagar. Eds: R.A.L. Carter, J. Dutta, A. Ullah. 
Springer-Verlag, 1990, xiii + 366 pp, $49. [ISBN: 0- 
387-97285-4] Collection of papers in honor of A.L. 
Nagar’s sixtieth birthday. ‘Topics range from the 
role of exogenous variables in structural models, to 
measurement of the inequality in the distribution 
of income in developing countries, to dynamic, si- 
multaneous, linear equation models and resampling 
schemes in linear regression and errors-in-variables 


models. MK 


Statistics, S(18), P, L. Nonlinear Estimation. 
Gavin J.S. Ross. Ser. in Stat. Springer-Verlag, 
1990, viii + 189 pp, $39. (ISBN: 0-387-97278-1] 
Advanced text concerns formulating, fitting, and in- 
terpreting non-standard, non-linear models. Mathe- 
matical exposition terse, but covers broad spectrum; 
inference, stable transformations, geometry of non- 
linear inference, etc. One chapter treats compu- 
tational methods for non-linear modelling, includ- 
ing computer programs, libraries, and algorithms for 
non-linear optimization; another contains brief ex- 
planations of applications. Many references make for 
a nice, encyclopedic companion. MK 


Statistics, T(17-18: 2, 3), L. Statistical Inference. 
George Casella, Roger L. Berger. Stat. & Prob. Ser. 
Wadsworth, 1990, xviii + 650 pp, $54.95. (ISBN: 
0-534-11958-1] Develops theoretical statistics from 
first principles of probability theory. Discusses how 
to evaluate, as well as find, the appropriate statis- 
tical techniques for estimation and hypothesis test- 
ing problems. Presents use of sufficiency, likelihood, 
and invariance principles in chapter on data reduc- 
tion. Other topics: ANOVA, decision theory and 
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Stein estimation, and linear regression. Nearly 600 
exercises. RWJ 


Statistics, P*. The Collected Works of John 
W. Tukey, Volume VI: More Mathematical: 1938- 
1984. Ed: Colin L. Mallows. Stat. & Prob. Ser. 
Wadsworth, 1990, lxxii + 661 pp, $59.95. [ISBN: 
0-534-05103-0] Sixth volume of a series containing 
Tukey’s many and diverse works. Previous volumes 
covered his contributions in the areas of time series 
(TR’s, May 1985 and February 1986), data analysis 
(TR, June-July 1987), and graphics (TR, December 
1988). This volume contains “six early mathemati- 
cal works, four papers on fiducial inference, five on 
transformations, and twenty-seven on a miscellany of 
topics in mathematical statistics,” again with com- 
ments by the editor on each of the papers. RSK 


Computer Literacy, T, S*(13-16), L. Computer 
Ethics: Cautionary Tales and Ethical Dilemmas in 
Computing. ‘Tom Forester, Perry Morrison. MIT 
Pr, 1990, viii + 193 pp, $19.95. (ISBN: 0-262- 
06131-7] An unrelenting tale of mistakes, misman- 
agement, and malfeasance in the production and use 
of computer systems, laced with thought-provoking 
questions and—at each chapter’s end—hypothetical 
scenarios intended to provoke role-playing debate 
among students. Covers computer crime, software 
theft, hacking, reliability, privacy, artificial intelli- 
gence, and workplace issues. Raises many ethical 
questions, but does not examine them from a formal 
philosophical perspective. Intended merely to sensi- 
tize computer science students to ethical issues, not 
to lay a foundation for a new branch of ethics. LAS 


Elementary Computer Science, T(13: 1). Prob- 
lem Solving and Structured Programming in FOR- 
TRAN 77, Fourth Edition. Elliot B. Koffman, Frank 
L. Friedman. Addison-Wesley, 1990, xvi + 652 pp, 
(P). [ISBN: 0-201-51216-5] A text for a first course 
in computer programming. High school algebra 
presents a sufficient mathematical preparation. Con- 
tains a large number of applications from business, 
engineering, and the physical sciences. Nonstandard 
topics include elementary computer-aided design and 
numerical methods. Exercises and short program- 
ming projects appear at the end of each chapter, 
along with examples of common programming errors. 
(First Edition, TR, June-July 1981; Second Edition, 
TR, January 1986.) SM 


Programming, T(14), S. An Introduction to 
Object-Oriented Programming and C++. Richard S. 
Wiener, Lewis J. Pinson. Addison-Wesley, 1988, xii 
+ 273 pp, (P). [ISBN: 0-201-15413-7] C++ is an 
object-oriented extension of the system programming 
language C. The extensions allow a user to define a 
hierarchy of classes, and to create objects with the 
properties of inheritance and polymorphism. This 
text is an introduction to the principles of object- 
oriented software development, and the syntax and 
semantics of the C++ programming language. GMS 


Programming, S(13), L. The C++ Programming 
Language. Bjarne Stroustrup. Addison-Wesley, 1987, 
villi + 328 pp, (P). [ISBN: 0-201-12078-X] Readable 
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introduction to C++ with careful attention to the 
needs of programmers already fluent in C. Although 
it is a complete reference, this book concentrates 


most on those features and data types which are not 
part of C. JO 


Programming, S. Advanced Prolog: Techniques 
and Ezamples. Peter Ross. Intern. Ser. in Logic 
Programming. Addison-Wesley, 1989, x + 294 pp. 
(ISBN: 0-201-17527-4] Prolog is an acronym for 
Programming with logic, and it is a language based 
on the principles of the first-order predicate logic. 
This text assumes that the reader has had a brief in- 
troduction to the language but is not a sophisticated 
user. It shows how to use the advanced features of 
the language to design and build complex software 
systems. GMS 


Programming, T(13-14: 1), L. Program Design 
with Modula-2. Susan Eisenbach, Chris Sadler. In- 
tern. Comput. Sci. Ser. Addison-Wesley, 1989, xiii 
+ 285 pp, (P). (ISBN: 0-201-17567-3] Based on 
courses given at the Polytechnic of North London. 
Introduction to programming in Modula-2 with em- 
phasis on program design rather than language syn- 
tax. LC 


Languages, T(16-18: 1), S, L. Logic Program- 
ming: Systematic Program Development. Yves Dev- 
ille. Intern. Ser. in Logic Programming. Addison- 
Wesley, 1990, xvii + 338 pp. [ISBN: 0-201-17576- 
2] Text is divided into three parts which reflect the 
problem solving cycle of a Prolog programmer: elab- 
orate a specification for the problem; construct a 
logic description of the problem; derive a correct and 
efficient Prolog program. Assumes familiarity with 
Prolog. Chapter exercises, background sections, and 
summaries. Appendices; bibliography. RJA 


Computer Systems, S(18), P. Computers in 
Mathematics. Eds: David V. Chudnovsky, Richard 
D. Jenks. Lect. Notes in Pure & Appl. Math., 
V. 125. Marcel Dekker, 1990, vii + 410 pp, $89.75 
(P). [ISBN: 0-8247-8341-7] Fifteen papers from the 
July 1986 International Conference on Computers 
and Mathematics held at Stanford. Topics include 
something for everyone: papers range from modern 
applications of computer algebra (e.g., Grobner ba- 
sis methods for solving systems of polynomial equa- 
tions) to history of computation (George Dantzig 
on linear programming and its effects on comput- 
ing) to applications of computing to mathematical 
physics. PZ 


Computer Systems, T(13-14:1). Calculator En- 
hancement for Single- Variable Calculus: A Manual 
of Applications Using the HP-28S Calculator, Pre- 
liminary Edition. James H. Nicholson. Consultant: 
J.W. Kenelly. Harcourt Brace’ Jovanovich, 1991, ix 
+ 99 pp, (P), [ISBN: 0-15-505676-X]; Calculator En- 
hancement for Linear Algebra. D.R. LaTorre, 1991, 
ix + 130 pp, (P), [ISBN: 0-15-505674-3]; Calcula- 
tor Enhancement for Multivariable Calculus. J.A. 
Reneke. Ed: D.R. LaTorre. 1991, vii + 94 pp, (P), 
(ISBN: 0-15-505675-1]; Calculator Enhancement for 
Differential Equations. T.G. Proctor. Ed: D.R. 
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LaTorre. 1991, xiii + 105 pp, (P). (ISBN: 0-15- 
505673-5] Four slim volumes on using the HP-28S 


(and, soon, the HP-48S) calculator for mathematical 
work in calculus, multivariable calculus, linear alge- 
bra, and differential equations. In addition to ideas 
and exercises on a variety of topics in each course 
area, each volume contains many examples of useful 
programs. Taken together they considerably extend 
the power and ease of use of the calculator for an 
impressive variety of mathematical operations. Stu- 
dents who use these supplements diligently will in- 
deed “enhance” their understanding both of the cal- 
culator and of the underlying mathematics. PZ 


Computer Graphics, T(16-17), P. Curves and 
Surfaces for Computer Aided Geometric Design: A 
Practical Guide, Second Edition. Gerald Farin. 
Computer Sci. & Scientific Computing. Academic 
Pr, 1990, xvii + 444 pp, $39.95. (ISBN: 0-12-249051- 
7| This edition adds programs in C to implement 
basic methods. (First Edition, TR, June-July 1989.) 
OJ 


Computer Graphics, P. Computation of Curves 
and Surfaces. Eds: Wolfgang Dahmen, Mariano 
Gasca, Charles A. Micchelli. NATO ASI Ser. C, V. 
307. Kluwer Academic, 1990, ix + 536 pp, $149. 
[ISBN: 0-7923-0724-0] Proceedings of a NATO Ad- 
vanced Study Institute at Puerto de la Cruz, Tener- 
ife, Spain in July 1989. Fifteen papers in the areas 
of discrete methods for curve and surface represen- 
tation, interpolation and data fitting, multivariate 
splines and applications, and algebraic and differen- 
tial geometric techniques for machine representation 
of curves and surfaces. OJ 


Theory of Computation, P. Lecture Notes in 
Computer Science-429: Design and Implementation 
of Symbolic Computation Systems. Ed: A. Miola. 
Springer-Verlag, 1990, xii + 284 pp, $24.70 (P). 
(ISBN: 0-387-52531-9] About 40 brief papers on 
aspects of design and implementation of symbolic 
computation—theory, languages, software environ- 
ments, and architectures—comprise the proceedings 
of the DISCO ’90 symposium held in April 1990 at 
Capri, Italy. PZ 

Artificial Intelligence, P. A Computational Model 
of Metaphor Interpretation. James H. Martin. Per- 
spectives in Artif. Intellig., V. 8. Academic Pr, 
1990, xxiii + 229 pp, $34.95. (ISBN: 0-12-474730- 
2] Presents a theory of metaphor acquisition, inter- 
pretation, and denotation, and describes in detail a 
computer program applying this theory to natural 
language interpretation systems. JO 


_ Artificial Intelligence, S(16-18), P, L. The 
Handbook of Artificial Intelligence, Volume IV. Eds: 
Avron Barr, Paul R. Cohen, Edward A. Feigenbaum. 
Addison-Wesley, 1989, xv + 699 pp, (P). (ISBN: 0- 
201-51731-0] Eight chapters, each by different au- 
thors. Each chapter is on an independent topic and 
stresses issues, principles, theory, and the relation- 
ship to artificial intelligence. Some topics are re- 
peats from previous volumes but all represent new, 
fresh views of the field as it is today. Topics: black- 
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board systems, cooperative distributed problem solv- 
ing, fundamentals of expert systems, natural lan- 
guage understanding, knowledge-based software en- 
gineering, qualitative physics, knowledge-based sim- 
ulation, computer vision. Bibliography; name and 
subject indexes. RJA 


Artificial Intelligence, P. Intelligence as Adaptive 
Behavior: An Ezperiment in Computational Neu- 
roethology. Randall D. Beer. Perspectives in Artif. 
Intellig., V. 6. Academic Pr, 1990, xxiii + 213 pp, 
$29.95. (ISBN: 0-12-084730-2] Criticizes the tra- 
ditional artificial intelligence notion that intelligent 
computer systems should be modeled after human 
reasoning, and argues that animal adaptation to en- 
vironments provides a better model. To back up this 
argument, the author describes the development of 
an artificial insect and discusses its behavior. JO 


Artificial Intelligence, S. Common LISP Pro- 
gramming for Artificial Intelligence. Tony Hasemer, 
John Domingue. Intern. Comput. Sci. Ser. Addison- 
Wesley, 1989, xiii + 444 pp, (P). [ISBN: 0-201-17579- 
7| Lisp is a language based on the functional pro- 
gramming model. It is widely used to solve prob- 
lems in artificial intelligence such as theorem prov- 
ing, natural language understanding, and expert sys- 
tems. The dialect called Common Lisp is the most 
widely accepted and used version of the language. 
This text teaches that language and shows how it 
can be used tq.solve problems in the area of artificial 
intelligence. GMS 


Artificial Intelligence, P. Machine Vision for 
Three-Dimenstonal Scenes. Ed: Herbert Freeman. 
Academic Pr, 1990, xi + 419 pp, $59.95. [ISBN: 
0-12-266722-0] Proceedings of a April 1989 confer- 
ence sponsored by the Center for Computer Aids for 
Industrial Productivity (CAIP) of Rutgers Univer- 
sity, New Brunswick, New Jersey. Fourteen chap- 
ters include twelve papers on various approaches to 
machine recognition of 3-D objects, and two sum- 
maries of panel discussions on the present state of 
the field. OJ 


Artificial Intelligence, $(18), P, L. Neural Net- 
work Models in Artificial Intelligence. Matthew Zei- 
denberg. Ser. in Artif. Intellig. Ellis Horwood (US 
Distr: Wiley), 1990, 268 pp. [ISBN: 0-13-612185- 
3] Overview of the current state of neural networks 
research. Exposition is clear enough to make this 
book a good starting point for learning about the 
field. Chapters on applications to learning, expert 
systems, knowledge representation, speech synthe- 
sis, visual perception, and language understanding. 
Good sized bibliography. JO 


Artificial Intelligence, T, S. The Computing Neu- 
ron. Eds: Richard Durbin, Christopher Miall, 
Graeme Mitchison. Computat. & Neural Systems 
Ser. Addison-Wesley, 1989, xiv + 417 pp, $44.25. 
(ISBN: 0-201-18348-X] A collection of twenty-one re- 
search papers presented at a meeting entitled “The 
Neuron as a Computational Unit” held at Cambridge 
University in 1988. The papers describe recent re- 
search work in artificial intelligence, cognitive psy- 
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chology, and connectionist modelling. GMS 


Artificial Intelligence, P. The Organization of 
Learning. C.R. Gallistel. MIT Pr, 1990, xi + 648 
pp, $45. (ISBN: 0-262-07113-4] Long discussion of 
what sorts of spacial and temporal data animals can 
represent in their nervous systems, and what sorts 
of computational methods they use to interpret the 
data. This book is dense with examples of animal 
behavior and makes fascinating reading. The math- 
ematics in the book is spread out and sometimes hard 
to find. JO 


Computer Science, P. Formal Semantics and 
Pragmatics for Natural Language Querying. James 
Clifford. Tracts in Theoretical Comput. Sci., V. 8. 
Cambridge University Pr, 1990, xv + 194 pp, $29.95. 
(ISBN: 0-521-35433-1] The author argues that ex- 
isting database theories ignore the passage of time 
and the corresponding changes in data, so that 
databases “forget” what they once knew, and are 
unable to answer simple time-related questions. This 
book develops the notion of a historical database to 
address these limitations. JO 


Computer Science, T(16-17: 2), S. Software 
Engineering: Methods and Management. Anneliese 
von Mayrhauser. Academic Pr, 1990, xxvili + 864 
pp, $49.95. [ISBN: 0-12-727320-4] Part I: meth- 
ods and techniques for developing software (prob- 
lem definition, requirements, specifications, design, 
coding, testing, operation, and maintenance). Part 
II; introduction to the management of software engi- 
neering projects (management by metrics, feasibility, 
planning, personnel, leadership, development guide- 
lines). LCL 


Applications, P, L. Fractals in the Natural Sci- 
ences. Eds: M. Fleischmann, D.J. Tildesley, R.C. 
Ball. Princeton University Pr, 1990, 200 pp, $19.50 
(P); $50. [ISBN: 0-691-02438-3; 0-691-08561-7] A 
collection of papers, each followed by brief questions 
and answers from a 1988 Royal Society “discussion 
meeting.” Opening paper is by B. Mandelbrot on 
fractal geometry; others are on applications to phys- 
ical or life sciences. First published in 1989 by the 
Royal Society of London. LAS 


Applications (Biological Science), P, L. Mathe- 
matical Evolutionary Theory. Ed: Marcus W. Feld- 
man. Princeton University Pr, 1989, x + 341 pp, 
$19.95 (P). [ISBN: 0-691-08503-X] A collection of 
fourteen papers written by students or colleagues of 
Samuel Karlin to commemorate Karlin’s numerous 
contributions to mathematical evolution theory. Ge- 
netics and randomness play central roles in all these 
papers. The book is divided into two sections. The 
first section contains papers concerned with general 
genetic theory, while the second section contains ap- 
plications of the general theory to specific popula- 
tions. SM 

Applications (Biological Science), S$(16-17), P, 
L. Fundamentals of Mathematical Evolutionary Ge- 
netics. Yuri M. Svirezhev, Vladimir P. Passekov. 
Math. & Its Applic., V. 22. Kluwer Academic, 1990, 
xvi + 395 pp, $129. [ISBN: 90-277-2772-4] Deter- 
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ministic and stochastic models of selection, muta- 
tion, migration, and other factors in evolutionary ge- 
netics. Note the price! BC 


Applications (Biological Science), P, L. Lec- 
ture Notes in Biomathematics-81: Mathematical A p- 
proaches to Problems in Resource Management and 
Epidemiology. Eds: C. Castillo-Chavez, S.A. Levin, 
C.A. Shoemaker. Springer-Verlag, 1989, vii + 327 
pp, $33 (P). (ISBN: 0-387-51820-7] Proceedings 
from a conference held at the Center of Applied 
Mathematics at Cornell University in Fall 1987. Ex- 
cellent survey of the current status of applying quan- 
titative methods, some stochastic, to problems con- 
cerning population growth, from individual cells to 
large vertebrates. MK 


Applications (Biological Science), S(16-17), P, 
L. Population Harvesting: Demographic Models of 
Fish, Forest, and Animal Resources. Wayne M. 
Getz, Robert G. Haight. Mono. in Population Biol- 
ogy, V. 27. Princeton University Pr, 1989, xv + 391 
pp, $15.95 (P); $45. [ISBN: 0-691-08516-1; 0-691- 
08515-3] Fish, animal, and forest resources share 
the quality of reproducing seasonally, creating large 
numbers of individuals of a specific age. This age 
class structure of important populations leads to im- 
portant similarities in mathematical models describ- 
ing these populations. This work explores these sim- 
ilarities, and also introduces the reader to nonlinear 
models. SM 


Applications (Economics), S, P, L*. The In- 
visible Hand: Economic Equilibrium in the History 
of Science. Bruna Ingrao, Giorgio Israel. Transl: 
Ian McGilvray. MIT Pr, 1990, xiii + 491 pp, $47.50. 
(ISBN: 0-262-09028-7] Anexploration of the history 
of efforts to mathematicize economics, rooted in the 
eighteenth century efforts to imitate Newtonian me- 
chanics, centered on the nineteenth century work of 
Léon Walras which established the idea of equilib- 
rium as the central issue in economic theory, and 
moving through the twentieth century work of Ar- 
row, Debreau, and Smale. Focuses throughout on the 
“invariant paradigmatic core” of ezistence (is there 
a state of compatibility in a competitive market?), 
uniqueness (is there one such state?), and stability 
(do market forces lead to this state?). The answers 
finally emerged in the last two decades: usually, sel- 
dom, and hardly ever. LAS 


Applications (Economics), S(17-18), P, L*. 
Toward a Formal Science of Economics: The Az- 
tomatic Method in Economics and Econometrics. 
Brent P. Stigum. MIT Pr, 1990, xiv + 1033 pp, 
$45. (ISBN: 0-262-19284-5] Encyclopedia dealing 
with foundations for a formal methodological basis 
of econometric theory. Begins with axiomatic meth- 
ods and mathematical logic. Proceeds to economet- 
ric theory of consumer choice, probability theory, 
nonstandard analysis, and epistemological concerns. 
Concludes with applications to empirical analysis of 
econometric theories; determinism, uncertainty, and 
the utility hypothesis; prediction, distributed lags, 
and stochastic difference equations. MK 
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Applications (Engineering), T(14: 1). Solving 
Problems in Structures, Volume 2. P.C.L. Croxton, 
L.H. Martin. Longman Scientific & Technical (US 
Distr: Wiley), 1990, vii + 282 pp, $34.95 (P). (ISBN: 
0-582-02355-6] Volume 2 of a two-volume set which 
is part of a series on the solution of problems for en- 
gineering students in universities, polytechnics, and 
colleges. Informal treatment (no proofs) of relevant 
theory. Worked out and outlines of solutions of prob- 
lems dealing with the elastic analysis of a hyperstatic 
structure. Volume 1 dealt with statics and statically 
determinate structures. JDEK 


Applications (Fluid Dynamics), P. Fluid Dy- 
namics of Viscoelastic Liquids. Daniel D. Joseph. 
Appl. Math. Sci., V. 84. Springer-Verlag, 1990, xvii 
+ 755 pp, $59.95. (ISBN: 0-387-97155-6] A study 
of the elasticity of liquids and of asymptotic theo- 
ries of constitutive models. Intended as a reference 
for special topics which do not appear in any other 
reference. JO 


Applications (Fluid Dynamics), P. Two Phase 
Flows and Waves. Eds: Daniel D. Joseph, David 
G. Schaeffer. IMA, V. 26. Springer-Verlag, 1990, xi 
+ 164 pp, $25. (ISBN: 0-387-97293-5] Eleven pa- 
pers based on a symposium held at the Institute for 
Mathematics and Its Applications at the University 
of Minnesota in January 1989. Focus is on the prop- 
erties of materials which consist of many small solid 
particles or grains, and on the development of waves 
in flowing composites. JDEK 

Applications (Physical Science), S(17-18), P. 
Computational Mechanics of Nonlinear Response of 
Shells. Eds: W.B. Kratzig, E. Onate. Ser. in Com- 
putational Mechanics. Springer-Verlag, 1990, viii + 
405 pp, $93. (ISBN: 0-387-52035-X] Unlike most 
collections of scientific papers, this one represents 
a careful selection, providing a rigorous introduc- 
tion based on continuum mechanics and then several 
advanced papers concerned with new finite element 
derivations for nonlinear shell problems. MU 


Applications (Physical Science), P. Mathemat- 
ical Models of Chemical Reactions: Theory and Ap- 
plications of Deterministic and Stochastic Models. P. 
Erdi, J. Toth. Princeton University Pr, 1989, xxiv 
+ 259 pp, $69.50. [ISBN: 0-691-08532-3] Written 
jointly by a chemist and a mathematician, this vol- 
ume surveys mathematical models, both determin- 
istic and stochastic, of chemical kinetics. The last 
chapter is on applications. Should appeal to chemists 
with a mathematical inclination. KS 

Applications (Physics), T(14: 2), S, L. A 
Course in Mathematics for Students of Physics: 2. 
Paul Bamberg, Shlomo Sternberg. Cambridge Uni- 
versity Pr, 1990, xvii + 443 pp, $54.50. (ISBN: 0-521- 
33245-1] In this second of two volumes, the authors 
present some relatively advanced mathematical top- 
ics in natural physical contexts. For example, the be- 
ginnings of algebraic topology and the calculus of ex- 
terior differential forms appear in discussions of elec- 
trical networks and electrostatics respectively. Very 
interesting insights are available to the physically- 
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minded mathematician, but this course is not for the 
average or the lazy undergraduate. JO 


Applications (Physics), P. Convezity Methods in 
Hamiltonian Mechanics. Ivar Ekeland. Ser. of Mod- 
ern Surveys in Math., Band 19. Springer-Verlag, 
1990, x + 247 pp, $79.50. [ISBN: 0-387-50613-6] A 
thorough treatment of nonlinear systems with Hamil- 
tonian H(t, p,q) convex in the p,q variables. The 
major focus is on nonlinear analysis, critical point 
theory, and properties of the index of periodic solu- 
tions. The author admits that “many, if not most, 
important Hamiltonians are nonconvex” (and many 
that are can be treated with other methods), but 
still feels that the “unity and simplicity” of this ap- 
proach “more than offsets the loss of scope.” You 
decide. MPR 


Applications (Physics), S, L*. The Mathemat- 
ics of Projectiles in Sport. Neville de Mestre. Aus- 
tralian Math. Soc. Lect. Ser., V. 6. Cambridge Uni- 
versity Pr, 1990, xi + 175 pp, $22.95 (P). [ISBN: 0- 
521-39857-6] Everything you ever wanted to know 
about projectiles—angles, distances, time of flight, 
envelopes, drag, spin, Magnus effect, wind—derived 
from military references but applied here to sports 
projectiles (golf, cricket, basketball, javelin, etc.). 
Uses vector calculus and dimensionless equations. 
Numerous exercises; extensive references. LAS 


Applications (Physics), S(18), P. Symplectic 
Techniques in Physics. Victor Guillemin, Shlomo 
Sternberg. Cambridge University Pr, 1990, xi + 468 
pp, $29.95 (P). (ISBN: 0-521-38990-9] Paperback, 
corrected edition of 1984 text (TR, January 1985). 
Introduces symplectic geometry in the historical con- 
text of the study of optics. This introduction is fol- 
lowed by a chapter laying out the basic theoretical 
results, and three independent chapters on applica- 
tions to Yang-Mills equations, completely integrable 
systems, and symplectic homogeneous spaces. JO 


Applications (Physics), S(18), P. Stochastics, 
Algebra and Analysts in Classical and Quantum Dy- 
namics. Eds: S. Albeverio, Ph. Blanchard, D. Tes- 
tard. Math. & Its Applic., V. 59. Kluwer Aca- 
demic, 1990, xv + 247 pp, $88. (ISBN: 0-7923- 
0637-6] This volume collected the Proceedings of 
the Fourth French-German Encounter on Mathemat- 
ics and Physics which was held in Marseille, France 
from February 28 to March 4, 1988. MU 


Reviewers 


RJA: Richard J. Allen, St. Olaf; SB: Steve Benson, St. Olaf; 
JNC: Judith N. Cederberg, St. Olaf; LC: Laura Chihara, 
St. Olaf; BC: Barry Cipra, St. Olaf; SG: Steven Galovich, Car- 
leton; JPH: Joan P. Hutchinson, Macalester; OJ: Ockle John- 
son, St. Olaf; RWJ: Roger W. Johnson, Carleton; MK: Michael 
Kahn, St. Olaf; RSK: Richard S. Kleber, St. Olaf; JDEK: 
Joseph D.E. Konhauser, Macalester; LCL: Loren C. Larson, 
St. Olaf; SM: Steve McKelvey, St. Olaf; JO: Jeff Ondich, 
St. Olaf; MLR: Margaret L. Reese, St. Olaf; MPR: Matthew 
P. Richey, St. Olaf; AWR: A. Wayne Roberts, Macalester; KS: 
Karen Saxe, St. Olaf; GMS: G. Michael Schneider, Macalester; 
JS: John Schue, Macalester; LAS: Lynn Arthur Steen, St. Olaf; 
MU: Milton Ulmer, Carleton; PZ: Paul Zorn, St. Olaf. 
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In appreciation ... 


At this time of the year we would like to thank those who have refereed papers for the Monthly during the past year. 
Without their diligent efforts we could not have functioned. — The editors. 


Harvey Abbott, Asuman Aksoy, Stanley Alama, Felix Albrecht, Stephanie Alexander, J. Ralph Alexander, 
William Allard, Steve Althoen, George E. Andrews, Larry Andrews, Thomas Angell, Tom Apostol, Kenneth Appel, 
Tom Armstrong, Mark Ashbaugh, Richard Askey, Paul Atela, Kendall Atkinson, Tom Banchoff, Steven Bank, Robert 
Barnhill, Michael Barr, Michael Beeson, Katalin Bencsath, Edward Bender, John Benedetto, Arthur Benjamin, 
Jon Bentley, Sterling Berberian, David Berg, Vitaly Bergelson, Bruce Berndt, Ted Bick, Patrick Billingsley, David 
Bindschadler, Richard Bishop, Tibor Bisztriczky, R. Boas, Joseph Bonin, Robert Borden, Nigel Boston, Michael 
Botsko, William Boyce, John Brillhart, Duane Broline, Ezra Brown, Herbert Brown, Andrew Bruckner, Robert 
Burckel, Frank Burk, Donald Burkholder, Lynne Butler, Eugenio Calabi, C. David Callan, Rodney Canfield, G. 
Cargo, Bille Carlson, George Carrier, Robert Carroll, Jesus Castillo, Frank Cater, Thomas Cecil, Gulbank Chakerian, 
Roger Chalkley, Jal Choksi, Joseph Cima, Courtney Coleman, Matthew Cook, Don Coppersmith, Paul Corazza, 
Robert Craggs, George Cross, Robert Currier 

Everett Dade, John D’Angelo, Martin Davis, Phillip Davis, John Dawson, Jane Day, Mahlon Day, Carl DeBoor, 
N. G. DeBruijn, William Derrick, John Dettman, Luc Devroye, M. Dickson, Peter Dixon, J. Doob, Larry Dornhoff, 
David Doster, Underwood Dudley, Hugh Edgar, H. Edwards, Paul Eggermont, Richard Elderkin, Jim Epperson, Paul 
Erdos, Richard Escobales, Ronald Evans, Charles Eynden, Kenneth Falconer, Michael Filaseta, Nathan Fine, Stephen 
Fisher, Harley Flanders, Gerald B. Folland, David Foulis, Michael Frame, George Francis, Eduardo Friedman, 
Wolfgang Fuchs, Stephen Gagola, Jr., David Gale, Fred Galvin, Adriano Garsia, Ira Gessel, Richard Gibbs, Leonard 
Gillman, Clark Givens, Alan Gluchoff, Irving Gluck, Richard Goldberg, Herman Gollwitzer, Adolph Goodman, Sandy 
Grabiner, Judith Grabiner, Ronald Graham, James Graham-Eagle, Alfred Gray, Daniel Grayson, Leon Greenberg, 
Curtis Greene, Andreas Griewank, Jerrold Griggs, Branko Grunbaum, H. N. Gupta 

L. R. Haff, Heine Halberstam, Mary-Elizabeth Hamstrom, David Harbater, Hiroshi Haruki, Maurice Heins, 
Lester Helms, Michael Henle, Douglas Hensley, Ralph Henstock, David Herron, Edwin Hewitt, Nigel Higson, Adolph 
Hildebrand, Aimo Hinkkanen, Kevin Hockett, Alan Hoffman, Robert Hogg, Roger Horn, Paul Humke, I. Martin 
Isaacs, J. Isbell, Arieh Iserles, Howard Jacobowitz, Robert James, Gerald Janusz, Richard Jerrard, Carl Jockusch, 
Bruce Johnson, Charles Johnson, David Johnson, David Joyce, Dan Kalman, Wilfred Kaplan, Irving Katz, Louis 
Kauffman, Donald Keedwell, Linda Keen, Arnfried Kemnitz, Clark Kimberling, Roger Kirchner, Victor Klee, Daniel 
Kleitman, Ronald Knill, Fred Kochman, Paul Koosis, Ralph Kopperman, David Kurtz 

Jeffery Lagarias, Tsit-Yuen Lam, Ray Langebartel, Peter Lax, Derrick Lehmer, Emma Lehmer, Joseph Lehner, 
Calvin Long, William Lucas, Frederick Luttman, Jr., Mark Lynch, Russell Lyons, Thomas MacGregor, Hosam Mah- 
moud, John Mallet, Walter Mallory Maynard Mansfield, Marvin Marcus, David Marker, Mario Martelli, George 
Martin, Jerold Mathews, R. Daniel Mauldin, Leon McCulloch, Richard McGehee, Catherine McGeoch, Fred Mc- 
Morris, Christian Meyer, Kenneth Meyer, Paul Meyers, Ernst Michael, Joseph Miles, Anthony. Miller, David Minda, 
Peter Monk, David S. Moore, John Morgan, John Morrison, B. C. Mortimer, David Muller, James Munkres, Kent 
Nagle, Zuhair Nashed, Richard Nau, Morris Newman, W. Nicholson, Albert Nijenhuis, Zbigniew Nitecki, Ivan Niven, 
Richard O’Neil, Karl Norton, Joseph O’Rourke, Beresford Parlett, G. W. Peck, N. Tenney Peck, Richard Pfiefer, 
Alden Pixley, Esteban Poffald, Carl Pomerance, Horacio Porta, Stephen Portnoy, Josef Przytycki, Bruce Reznick, 
Billy Rhoades, Arthur Robinson, Derek Robinson, Dave Roeder, C. A. Rogers, S. Roman, Joseph Rosenblatt, Ira 
Rosenholtz, Richard Roth, Bruce Rothschild, Joseph Rotman, Walter Rudin 

Bill Sands, Donald Sarason, Jonathan Schaer, Ken Schilling, James Schmerl, Frank Schmidt, Wlater Schneider, 
Glenn Schober, Lowell Schoenfeld, James Schupp, Richard Scott, David Secrest, John Selfridge, Marjorie Senechal, 
Jeffrey Shallit, Daniel Shanks, Louis Shapiro, Donald Sherbert, Don Shimamoto, Herb Silverman, Evelyn Silvia, 
Benjamin Sims, Michael Singer, David Singer, Walter Sizer, Charles Small, Don Small, Lance Small, F. Smith, 
David Smith, Ernst Snapper, J. Laurie Snell, Jerome Spanier, Richard Stanley, John Steinig, Richard Stoneham, 
Gilbert Strang, Karl Stromburg, David Styer, Ted Suffridge, Sheila Sundaram, Lajos Takacs, Carsten Thomassen, 
Michael Townsend, William Trench, Dan Ullman, Stephen Ullom, Jeff Vaaler, Rudolf Vydorny, Sam Wagstaff, Jr., 
Walter Wallis, John Walter, Edward Wang, Larry Washington, David Watkins, Paul Weichsel, Elliott Weinberg, 
Howard Weiner, Gregory Wene, John Wetzel, John Wierman, Tom Wieting, Robert Wijsman, Donald Wilken, K. 5S. 
Williams, Robert Woodrow, Patrick Wotus, Mary Wright, D. B. Zagier, Doron Zeilberger, Paul Zorn 
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For adoption 
consideration, request 
examination packages 
from your Houghton 
Mifflis regional office. 


3400 Midway Rd.. 
Dallas, DX 75244-5165 
1900S. Batavia Ave.. 
Geneva, IL 60134 
925... Meadow Dr.. 
Palo Alto. CA 94303 
0) Campus Dr.. 
Princeton. NJ 08540 


New 

Basic Mathematics with Business Applications 
Joanne S. Lockwood, Plymouth State College 

Richard N. Aufmann and Vernon C, Barker 

Both of Palomar College 

About 200 perforated pages * paperback ¢ Just published 


New 

Essential Mathematics with Applications, Third Edition 
Vernon C, Barker and Richard N. Aufmann 

About 300 perforated pages * paperback * Just published 


New 

Prealgebra: Mathematics for a Variable World 
Daniel R. Bach and Patricia Leitner 

Both of Diablo Valley College 

About 550 pages * paperback * Just published 


Elementary Algebra with Basic Mathematics 
Richard N. Aufmann, Vernon C. Barker, and Joanne S. Lockwood 
462 pages * paperback * 1989 


Beginning Algebra 

Robert G. Marcucci, San Francisco State University 
Harold L. Schoen, The University of lowa 

478 pages * hardcover * 1990 


Intermediate Algebra 
Norman L. Siever, Los Angeles Valley College 
636 pages * hardcover * 1990 


New 

Algebra with Trigonometry for College Students 

Richard N. Aufmann, Vernon C. Barker, and Joanne S. Lockwood 
About 700 pages * hardcover * Just published 


College Algebra and Trigonometry 
768 pages * hardcover * 1990 


College Algebra 
481 pages * hardcover * 1990 


College Trigonometry 
303 pages * hardcover * 1990 


New 
Precalculus 
About 700 pages * hardcover * Just published 


All by Richard N. Aufmann, Vernon C. Barker, and 
Richard D. Nation, Jr., Palomar College 


Precalculus 


Dennis Carrie, Golden West College 
686 pages * hardcover * 1990 


4, Houghton Mifflin 


Building the skills of 


3/4 million students . . . 
and counting 


Basic College Mathematics: 
An Applied Approach, Fourth Edition 


About 576 perforated pages « paperback e Instruc- 

tor's Annotated Edition e Instructor's Resource Manual/ 

Testing Program e Test Bank e Computerized Test Gener- 

ator e Computer Tutor™ e Solutions Manual e Math ACE: 

Additional Computer Exercises e Videos e Transparencies ' a 
Just published \ 


Introductory Algebra: An Applied Approach 
Third Edition 


About 576 perforated pages e paperback e Ancillaries as 
above e Just published 


Also available in a hardcover edition. 


Intermediate Algebra: An Applied Approach 
Third Edition 


About 656 perforated pages e paperback e Ancillaries as above 
Just published 


Also available in a hardcover edition. 


All by Richard N. Aufmann and Vernon C. Barker 
Both of Palomar College 


The program that has helped over three-quarter million 

students perfect their basic math and algebra skills 

continues to provide the most comprehensive ap- 

proach to learning with: 

e An interactive approach replete with examples 
and practice problems within the text 

e Abundant real-life applications 

e An objective-specific format that coordi- 
nates text elements and ancillary items. 


Now, the best method is even better with: 

e New use of full color to highlight 
important text features 

e 100 percent new word problems 

e An expanded and improved 
ancillary package. 


For adoption consideration, request 
examination packages from your 
Houghton Mifflin regional sales office. 


3, Houghton Mifflin 


13400 Midway Rd., Dallas, TX 75244-5165 

1900 S. Batavia Ave., Geneva, IL 60134 
925 E. Meadow Dr., Palo Alto, CA 94303 
101 Campus Dr., Princeton, NJ 08540 


Developmental Mathematics 
Essential Mathematics 1997 
Introductory Algebra 3/e 1997 
Intermediate Algebra 3/e 1991 


all by Linda R. Pulsinelli and Patricia |. Hooper 
New! 
Basic Mathematics 1997 


Beginning Algebra with Geometry 1997 


Intermediate Algebra 1997 
A Short Course in Geometry 1997 
all by Patricia A. Juelg 
New! 
Newt Prealgebra 19917 
Developmental Mathematics 1997 
Essential Mathematics 1991 
Basic College Mathematics 3/e 1990 
Elementary Algebra 3/e 1990 


Intermediate Algebra 3/e 1990 
all by Ignacio Bello 


Precalculus Mathematics 

Trigonometry: An Analytic Approach 6/e 1997 
by Irving Drooyan and Charles C. Carico 
New! 
(a Algebra and Trigonometry for College Students 1997 
College Algebra with Review 1991 
College Algebra 2/e 1990 

Algebra and Trigonometry 2/e 1990 

Trigonometry 2/e 1990 


Precalculus 2/e 1990 
all by Michael Sullivan 


Applied Mathematics 
Applied Mathematics for Business, Economics, 
New! Life Sciences, and Social Sciences 4/e 1991 
Applied Calculus for Business, Economics, 
Life Sciences, and Social Sciences 4/e 1997 
Finite Mathematics for Business, Economics, Life Sciences, and Social Sciences 5/e 1990 
College Mathematics for Business, Economics, Life Sciences, and Social Sciences 5/e 1990 
Calculus for Business, Economics, Life Sciences, and Social Sciences 5/e 1990 
Essentials of College Mathematics: For Business, Economics, 
Life Sciences, and Social Sciences 1989 
all by Raymond A. Barnett and Michael R. Ziegler 


And Keep in Mind... 


New! Supplemental Applications and Topics to Accompany 
aim =the Barnett/Ziegler College Mathematics Series 1991 
by Jon E. Baum 


Add it up with Macmillan and Dellen! 
Contact your local representative for more information. 


Where Two Plus Two Equals More! 


Mathematics for Teacher. 


Mathematics for Elementary Teachers 2/e 1997 
by Gary L. Musser and William F. Burger 


free 


Problem Solving with Elementary Mathematics 1997 
by David Gay 


Discrete Mathematic 


Discrete Mathematics 2/e 1990 
by Richard Johnsonbaugh 


Discrete Mathematics 1990 
by Jerrold W. Grossman 


Linear Algebra 


Elementary Linear Algebra 5/e 1997 
by Bernard Kolman New: 


Linear Algebra with Applications 3/e 1990 
by Steven J. Leon 


Simulation/Operations Researc 


A Course in Simulation 1997 
by Sheldon M. Ross 


free 


Calculus with the Hewlett-Packard Symbol Manipulating Calculators: 


fre 


Student Supplement 


HP-28S and HP-48SX 1997 
by Lynn E. Garner 


Student’s Guide to Using Casio Graphics Calculators 1997 
Student’s Guide to Using Hewlett-Packard Graphics Calculators 1991 
Student’s Guide ta Using Texas Instrument Calculators 1997 
all by Joan Ebert Girard 


SOFTWARE! 


We offer two different testing systems, which are available for many of our 
mathematics textbooks: DellenTest, acomputer-generated random testing 
system for the IBM or Apple II, and Macmillan Testing Software, for the 
IBM or Macintosh. These systems allow you to create and generate 
flawlessly structured tests, and best of all, they are both exceedingly easy 
to use. To find out which system is appropriate for your text, please contact 
your local representative. 


MACMILLAN PUBLISHING COMPAN 


College Divisione866 Third AvenuesNew York, NY 10022 
COLLIER MACMILLAN CANADA, INC.*1200 Eglinton Avenue East, Suite 200*Don Mills, Ontario M3C 3N1 


SUBSCRIBE TO 


UME 
TRENDS 


News and reports on 
Undergraduate 
Mathematics 
Education 


Keep up with what's happening in 
Undergraduate Mathematics 
Education. 


UME TRENDS is conducting a 
subscription drive for its third 
volume beginning March 1991. 


Whether you are receiving it now or 
not, you must subscribe in order to 
keep your issues coming. 


We must receive a minimum 
number of subscriptions 1n order to 
keep publishing UME TRENDS. 


SUBSCRIBE NOW! 


Subscriptions are 
$12 per year for six 
issues. 

Copy or clip the adjoining form and 


mail it to arrive by February 8, 
1991. 
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New From 


Steven G. Krantz 


COMPLEX ANALYSIS: 
THE GEOMETRIC VIEWPOINT 


Steven G. Krantz 


Geometric methods have been used in 
complex analysis since the 1930s when 
Lars Ahlfors discovered that they give a nice 
way to look at the Schwarz lemma. Since 
that time they have become a central part of 
the research activities of complex analysis. 
However these important techniques have 
never found their way into a text accessible 
to a broad audience. 


Steven G. Krantz, a leading worker in com- 
plex analysis and a well-known mathemati- 
cal expositor, has written the first book ex- 
plaining how complex analysis can be stud- 
ied using methods of geometry. Assum- 
ing no background in Riemannian geometry, 
and only one semester of complex analysis, 
Krantz explains the role of Hermitian met- 
rics and of curvature in understanding the 
Schwarz lemma, normal families, Picard’s 
theorems, conformal mappings, and many 
other topics. A minimum of geometric for- 
malism is used to gain a maximum of ge- 
ometric and analytic insight. The climax of 
the book is an introduction to several com- 
plex variables fromthe geometric viewpoint. 
Poincaré’s theorem, that the ball and bidisc 
are biholomorphically inequivalent, is dis- 
cussed and proved. 


COMPLEX 


THE GE OME 
TRIC 
VIEWPOINT 


STEVEN G KRANT? 
phTiGay 
@ 
Oy 
* saat? 


Th 
€ Carus Mathematical Monograph 
: S 
Number 23 


Except for the minimal background require- 
ments, the book is self-contained. A review 
of relevant topics in the classical theory of 
one complex variable is provided. The style 
is light and inviting. The book is a must for 
anyone with an interest in complex analysis. 
Take a glance at the main chapter headings 
and order your copy today 

Mm Principal Ideas of Classical Function 
Theory, 

Basic Notions of Differential Geometry, 
Curvature and Applications, 

Some New Invariant Metrics, 

A Glimpse of Several Complex Vari- 
ables 


210 pp., 1990, Hardbound, 
ISBN 0-88385-026-5 


List $22.00 MAA Member $18.50 
Catalog Number CAM-23 


ORDER FROM 


2) Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D. C. 20036 


New From 


Forman S. Acton 


NUMERICAL METHODS THAT WORK 


Forman Acton 


Numerical Methods That Work, originally 
published in 1970, has been reissued by 
the MAA with a new preface and some 
additional problems. Acton deals with a 
common sense approach to numerical al- 
gorithms for the solution of equations: alge- 
braic, transcendental, and differential. It as- 
sumes that a computer is available for per- 
forming the bulk of the arithmetic. The book 
is divided into two parts, either of which 
could form the basis of a one-semester 
course in numerical methods. Part! dis- 
cusses most of the standard techniques; 
roots of transcendental equations, roots of 
polynomials, eigenvalues of symmetric ma- 
trices, and so on. Part Il cuts across the ba- 
sic tools, stressing such common problems 
as instabilities in extrapolation, removal of 
singularities, and loss of significant figures. 
The book is written with clarity and preci- 
sion, intended for practical rather than the- 
oretical use. 


An eminently readable and very well 
movitivated introductory text. 
Mathematics of Computation 


A first-rate book which can be used ei- 
ther as a text or reference. 
Choice 


560 pp., Paperbound, 1990 
ISBN 0-88385-450-3 


List $16.50 MAA Member $13.00 
Catalog Number NMTW 


ORDER FROM 


” I Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D. C. 20036 


University of Arizona 
GRADUATE FELLOWSHIPS 
IN THE 
MATHEMATICAL SCIENCES 


Up to 12 fellowships for outstanding new graduate students in the mathematical 
sciences will be available in 1991-92. Fellowship applicants should be seeking the Ph.D. and 
planning careers in teaching and/or fundamental research. Anticipated stipends are $12,000 
for 12 months with both in-state and non-resident tuition waived. 

These fellowships are restricted to citizens, permanent residents, or individuals who 
have established intent to become citizens or permanent residents of the United States. 
Applications from U.S. women and students belonging to U.S. minority groups are particularly 
invited. Currently one-fourth of the U.S. graduate students in pure and applied mathematics 
at Arizona are women. 

The University of Arizona has excellent programs in traditional pure and applied 
mathematics, and is a leading institution in interdisciplinary applied mathematics. This 
presents a wealth of opportunities for graduate study encompassing such areas as dynamical 
systems, number theory, computational science, geometry, nonlinear partial differential 
equations, mathematical physics, probability, and applications of mathematics in the physical, 
biological, social, and engineering sciences. In addition, outstanding computational 
facilities for graduate study and research are available to the over 170 graduate students in 
the mathematical sciences at the University of Arizona. 

Fellowship applicants of superior quality will be among the students invited to the 
Fifth Annual Workshop for Advanced Undergraduates on Current Ideas in Nonlinear Science, March 
2-5, 1991. Limited support is available for attendees. (The deadline for Workshop 
applications is February 1, 1991.) The workshop is designed to communicate topics in current 
active research in three areas: (i) Geometry in Analysis and Number Theory, (ii) Application 
of Computers to Mathematical Research, and (iii) Mathematical Applications in the Life 
Sciences. 

For information and application materials contact: 
W. M. Greenlee or T. W. Secomb 
Department of Mathematics/Program in Applied Mathematics 
University of Arizona - Tucson, AZ 85721 - (602) 621-2068 
The University of Arizona is an Equal Opportunity Employer 


Writing Mathematics Well, by Leonard Gillman 
64 pp., 1987, ISBN-0-88385-443-0 
Catalog Number - WMW List: $6.50 MAA Member: $5.00 


Good writing conveys more than the forceful writing. It does this both by pre- 


author originally had in mind, while poor 
writing conveys less. Well-written papers 
are more quickly accepted and put into 
print and more widely read and appreci- 
ated than poorly written ones—and for 
notes, monographs, and books the qual- 
ity of writing is more important than it is 
for papers. 

In Writing Mathematics Well, Leo- 
nard Gillman tells his readers how to 
develop a clear and effective style. All 
aspects of mathematical writing are cov- 
ered, from general organization and 
choice of title, to the presentation of 
results, to fine points on using words and 
symbols, to revision, and finally, to the 
mechanics of putting your manuscript into 
print. No book can by itself make you a 
better writer, but this one will alert you to 
the opportunities for better and more 


cept and by example. 

A book to be read for its sharpness 
and wit as well as for enlightenment, Writ- 
ing Mathematics Well should be on the 
shelf of anyone who writes or intends to 
write mathematics. It will amuse and 
delight the already careful writer and it will 
help reform and refine the sensibilities of 
those who may be somewhat careless 
about their writing. 


SN 
c 


NS 
S 


WS 


\\ 


4, 


Order from 

The Mathematical Association 
of America 

1529 Eighteenth St. NW 
Washington, D.C. 20036 


Revised 
and 
Updated 


THE LAST PROBLEM 


E. T. Bell 
Revised and updated by Underwood Dudley 


What Eric Temple Bell calls the last prob- 
lem is the problem of showing that Pierre 
Fermat was not mistaken when he wrote 
in the margin of a book, almost 350 years 
ago, that 2” + y” = z” has no solution in 
positive integers when n > 3. The orig- 
inal text of THE LAST PROBLEM traced 
the problem from Babylonia in 2000 B.C. 
to seventeenth-century France. Along the 
way we learn quite a bit about history, and 
just as much about mathematics. Under- 
wood Dudley’s notes bring us up-to-date on 
recent attempts to solve the problem. 


The book is unique in that it is a biogra- 
phy of a famous problem. The book fits 
no categories. It is not a book of mathe- 
matics. Pages go by without an equation 
appearing. It is not a history of number the- 
ory because it includes too much about the 
history of the western world, and it is not 
a history of western civilization because its 
focus is on mathematics. It is too entertain- 
ing to be scholarly and contains too much 
mathematics to be widely popular. It is an 
unusual book. 


What T.A.A. Broadbent said about Bell's 
work applies to THE LAST PROBLEM. 


TPR ae 


tle] ly: ( 


Ue Li ip pad: 


Nernvoag Dudley: 


HHL iit et iMFI) 
a ee 


ily Wii] Mh in 


dassiainal Hy 


oT 

“A. hay 
an } 
tf | 


as 


———. 
=—- = 


— 
=-—_— 
= 


iH 


a iat 


= i 


S— 


His style is clear and exuberant, his 
opinions, whether we agree with them 
or not, are expressed forcefully, often 
with humor and a little gentle malice. 
He was no uncritical hero-worshipper, 
being as quick to mark the opportunity 
lost as the ground gained, so that from 
his books we get a vision of mathemat- 
ics as a high activity of the questing 
human mind, often fallible, but always 
pressing on the neverending search for 
mathematical truth. 


This is a rich and varied, wide-ranging book, 
written with force and vigor by someone with 
a distinctive style and point of view. It will 
provide hours of enjoyable reading for any- 
one interested in mathematics. 


328 pp., Paperbound, 1990 
ISBN-0-88385-451 -1 


List: $17.50 MAA Member: $13.50 
Catalog Number TLP 


ORDER FROM 


Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D. C. 20036 


HBJ Brings you Range, 
Depth, and Innovation in 1991 


INTRODUCTORY ALGEBRA 
and 
INTERMEDIATE ALGEBRA 


RONALD HATTON and GENE R. SELLERS 


both of Sacramento City College 


Now in Hardcover! 
ELEMENTARY ALGEBRA 
Second Edition 

and 
INTERMEDIATE ALGEBRA 
Second Edition 
YOSHIKO YAMATO and 
MARY JANE CORDON 
both of Pasadena City College 


FINITE MATHEMATICS 
WILLIAM OWEN and FRED CUTLIP 
both of Central Washington University 


ALGEBRA FOR COLLEGE STUDENTS 
Third Edition 

BERNARD KOLMAN, Drexel University 
and ARNOLD SHAPIRO 


ELEMENTARY LINEAR ALGEBRA 
WITH APPLICATIONS 

Second Edition 

RICHARD O. HILL, JR., 
Michigan State University 


CALCULATOR ENHANCEMENT FOR A 
COURSE IN LINEAR ALGEBRA 
DON LA TORRE, Clemson University 


CALCULUS 

One and Several Variables 
ROBERT ELLIS and DENNY GULICK 
both of University of Maryland 

Just $29.95 (Net) 


SINGLE-VARIABLE CALCULUS 
WITH DISCRETE MATHEMATICS 
JOHN A. FEROE and 

CHARLES STEINHORN 

both of Vassar College 


CALCULATOR ENHANCEMENT FOR A 
COURSE IN SINGLE VARIABLE CALCULUS 
JOHN KENNELLY and 

JAMES H. NICHOLSON 

both of Clemson University 


CALCULATOR ENHANCEMENT FOR A 
COURSE IN MULTIVARIABLE CALCULUS 
JAMES A. RENEKE, Clemson University 


CALCULATOR ENHANCEMENT FOR A 
COURSE IN DIFFERENTIAL EQUATIONS 
THOMAS G. PROCTOR, 

Clemson University 


CALCULATOR ENHANCEMENT FOR A 
COURSE IN INTRODUCTORY STATISTICS 
IRIS FETTA, Clemson University 


ABSTRACT ALGEBRA 
DENNIS KLETZING, Stetson University 


Harcourt Brace Jovanovich, Inc. 

College Sales Office 

7555 Caldwell Avenue, Chicago, IL 60648 
(708) 647-8822 


Two Books © 
Mathematical 


Ingenuity By 
Ross Honsberge 


r 


Ross Honsberger is the author of seven 
books in the Dolciani Mathematical Expo- 
sition series, each of which presents prob- 
lems from algebra, arithmetic, number the- 
ory, probability, and geometry, and provides 
ingenious solutions and/or intriguing results. 
His most recent addition to the series is 
MORE MATHEMATICAL MORSELS which 
is a continuation of the earlier Mathematical 
Morsels volume published in 1979. Now is 
your opportunity to own both of these ex- 
cellent books. The problems presented are 
meant to be enjoyed, rather than instruct, al- 
though instruction is almost always the au- 
tomatic by-product. 


Honsberger says in the Preface of his first 
volume of morsels, “Mathematics abounds 
in bright ideas. No matter how long and 
hard one pursues her, mathematics never 
seems to run out of exciting surprises. And 
by no means are these gems to be found 
only in difficult work at an advanced level. 
All kinds of simple notions are full of ingenu- 
ity. The present volume discusses scores of 
elementary problems and contains dozens 
of marvellous ideas.” 


All of the problems are accessible to any- 
one with a knowledge of freshman mathe- 
matics. Buy both of these excellent books 
today, and share them with your students. 


This is an excellent text..lt exposes 
gifted undergraduate students to a set 
of elementary problems...whose proofs 
are ingenious, usually containing beau- 
tiful ideas...lts lucid expository style 
and stimulating mathematical content 
make its reading a rewarding experi- 
ence for mathematicians at all levels. 
(about Mathematical Morsels) 


M.S. Cheema in Mathematical Reviews 


SB ay fe cir Dart By he dom 
Perera re ee 


More Mathematical Morsels 


MATHEMATICAL ASSOCIATION OF AMERI 
ws Dolciant Mathematical Exposkions Nol se 


2 
alee wet 


...even the more advanced theo ¥. an 
will find many of the problems fa "i!" 


ing... Ua 
D.T. Swift-Hook in Physics Educatic 2° 
(about Mathematical Morsels) sr 


© mae! 


rit 


MORE MATHEMATICAL MORSELS 


Ross Honsberger 
315 pp., Paperbound, 1990, 


List: $16.00 MAA Member: $13.50 
Catalog Number DOL-10 


MATHEMATICAL MORSELS 
Ross Honsberger 


249 pp., Hardbound, 1979 
ISBN 0-88385-303-5 


List: $ 28.00 MAA Member: $ 21.00 
Catalog Number DOL-03 


I Er eI 


Both volumes purchased now 
List: $40.00 MAA Member: $29.00 
Catalog Number DOL-310 


ORDER FROM 


Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D. C. 20036 


THE AMERICAN 
MATHEMATICAL MONTHLY 


Volume 98. Number 2 February 


| Contents (ISSN 0002- 
ARTICLES 
Carlyle Circles and the Lemoine Simplicity 
of Polygon Constructions .............. 2... eee eee eee DUANE W. DETEMPLE 
Part Metric and Hyperbolic Metric ............. 2... cee ee ee ee ee H. S. BEAR 
" NOTES 
Arcs with Positive Measure and 
; a Space-Filling Curve ............... TIMOTHY LANCE ano EDWARD THOMAS 
} 
7 Unions and Common Complements 
of Subspaces ............ K. P. S. BHASKARA RAO ano A. RAMACHANDRA RAO 
‘ The Generation of All Rational 
Orthogonal Matrices ............... HANS LIEBECK ano ANTHONY OSBORNE 
Hypocycloids, Continued Fractions, 
: and Distribution Modulo One ...............2 000 e eee NORMAN RICHERT 
| The Perimeter of aRoSe . 0.0.0.0... ccc cece cece es JAMES P. BUTLER 
THE TEACHING OF MATHEMATICS 
wl A Tree Among the Stepping Stones .................. 2-2 ee eee A. R. MEIJER 
An Analytic Technique to Prove Borel’s Strong Law - 
of Large Numbers ............. cece ee ee ee eee eee eens LIU WEN 
A Topological Mean Value Theorem for the Plane............ IRA ROSENHOLTZ 
Disks and Shells Revisited .............. 0. ce eee eee eee eee WALTER CARLIP 
i. L’H6pital’s Rule Via Integration ................. 02 ee eee DONALD HARTIG 
PROBLEMS AND SOLUTIONS 
Elementary Problems and Solutions ............... 2... eee ee eee ee eee eens 
Advanced Problems and Solutions .......... 20... 0. cc cece ee eee eee eens 
REVIEWS 
The Mathematics of Projectiles in Sport. 
By Neville de Mestre ............... 2.02.2 eee VICTOR G. SZEBEHELY 
A Transition to Advanced Mathematics, by Douglas Smith, Maurice Eggen, 
and Richard St. Andre. Foundations for Advanced Mathematics, 
by Carol Avelsgaard. Introduction to Advanced Mathematics, 
by William Barnier and Norman Feldman ................ JUDITH PALAGALLO 
TELEGRAPHIC REVIEWS ............ 0. ec cc eee eee ene eee ee nea 


1991 


9890) 


109 


124 


127 


131 


133 
139 


144 


146 
149 
154 
156 


158 
168 


178 


179 


NOTICE TO AUTHORS 


See statement of editorial policy (volume 89, p. 3). Follow the format in current issues. Put your full mailing 
address in a line at the head of the paper, following the title and the author’s name. Send three copies of the 
manuscript, legibly typewritten on only one side of the paper, double-spaced with wide margins, and keep one as 
protection against loss. Prepare illustrations carefully, on separate sheets of paper in black ink, the original 
without lettering and two copies with lettering added. Rough copies of the illustrations should be included in the 
manuscript at the appropriate places. Supply the full five-symbol Mathematics Subject Classification number, as 
described in Mathematical Reviews, 1985 and later. (This is necessary for indexing purposes.) 

All manuscripts whose length is more than 7 manuscript pages should be sent to HERBERT S. WILF, 
Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104. Shorter articles about the 
teaching of mathematics should be sent to either MELVIN HENRIKSEN (linear algebra, algebra, differential 
equations), Department of Mathematics, Harvey Mudd College, Claremont, CA 91711, or STAN WAGON 
(calculus, analysis, number theory, discrete mathematics, statistics), Department of Mathematics, Macalester 
College, St. Paul, MN 55105. Other shorter articles should be submitted as follows: in algebra, discrete mathe- 
matics, or probability, to RODICA SIMION, Department of Mathematics, George Washington University, 
Washington, DC 20052; in geometry or topology to DENNIS DETURCK, Department of Mathematics, Univer- 
sity of Pennsylvania, Philadelphia, PA 19104; in analysis to DAVID J. HALLENBECK, Department of Mathe- 
matical Sciences, University of Delaware, Newark, DE 19716. Papers in other fields may be sent to any of the 
editors; however, do not send a paper to more than one editor. Proposed problems or solutions should be sent to 
PAUL T. BATEMAN, MontTu ty Problems, Department of Mathematics, University of Illinois, 1409 West Green 
Street, Urbana, IL 61801; unsolved problems to RICHARD GUY, Department of Mathematics and Statistics, 
University of Calgary, Alberta, Canada T2N 1N4. 


EDITOR 
HERBERT S. WILF 


ASSOCIATE EDITORS 
PAUL T. BATEMAN DAVID J. HALLENBECK RODICA SIMION 
DONNA BEERS PAUL R. HALMOS ANITA E. SOLOW 
DENNIS DETURCK MELVIN HENRIKSEN JOEL SPENCER 
HAROLD G. DIAMOND JOAN P. HUTCHINSON LYNN A. STEEN 
JOHN DIXON WILLIAM M. KANTOR KENNETH B. STOLARSKY 
J. H. EWING JOSEPH KONHAUSER STAN WAGON 
RICHARD GUY RICHARD LIBERA DOUGLAS B. WEST 
LEE A. RUBEL 


Reprint Permission: Marcia P. SwWARD, Executive Director, Mathematical Association of America, 1529 
Eighteenth Street, N.W., Washington, DC 20036-1385. 

Advertising Correspondence: Ms. ELAINE PEDREIRA, Advertising Manager, Mathematical Association of 
America, 1529 Eighteenth Street, N.W., Washington, DC 20036. 

Subscription correspondence, changes of address, and inquiries about nondelivered or defective issues: 
Membership /Subscriptions Department, Mathematical Association of America, 1529 Eighteenth Street, N.W., 
Washington, DC 20036. 

Microfilm Editions: University Microfilms International, Serial Bid Coordinator, 300 North Zeeb Road, Ann 
Arbor, MI 48106. 


The AMERICAN MATHEMATICAL MONTHLY (ISSN 0002-9890) is published monthly except bimonthly June-July and 
August-September by the Mathematical Association of America at 1529 Eighteenth Street, N.W., Washington, DC 20036 
and Montpelier, VT. 

The annual subscription price of $32 for the AMERICAN MATHEMATICAL MONTHLY to an individual member of the 
Association is included as part of the annual dues. (Annual dues for regular members, exclusive of annual subscription 
prices for MAA journals, are $60. Student and unemployed members receive a 66% dues discount; emeritus members 
receive a 50% dues discount; and new members receive a 40% dues discount for the first two years of membership.) The 
nonmember /library subscription price is $128 per year. 

Copyright © by the Mathematical Association of America (Incorporated), 1991, including rights to this journal issue 
as a whole and, except where otherwise noted, rights to each individual contribution. General permission is granted to 
Institutional Members of the MAA for noncommercial reproduction in limited quantities of individual articles (in whole 
or in part) provided a complete reference is made to the source. 

Second class postage paid at Washington, DC, and additional mailing offices. 

Postmaster: Send address changes to the AMERICAN MATHEMATICAL MONTHLY, Membership/Subscriptions Depart- 
ment, Mathematical Association of America, 1529 Eighteenth Street, N.W., Washington, DC 20036-1385. 

PRINTED IN THE UNITED STATES OF AMERICA 


Carlyle Circles and the Lemoine Simplicity of Polygon Constructions 


DuANE W. DeTempe_e, Washington State University 


DUANE W. DETEmMpLeE: I received my Ph.D. from Stanford University in 
1970, where I wrote my dissertation on bounded univalent functions under 
the supervision of M. M. Schiffer. I have been at Washington State 
University since then, with the exception of the 1981-82 year when I was a 
visiting associate professor at Claremont Graduate School. My current 
research areas include combinatorial geometry and graph theory. 


1. Historical Comments and Introduction. Until very late in the eighteenth 
century the only regular polygons known to have Euclidean straightedge and 
compass constructions were precisely the ones shown in Book IV of Euclid’s 
Elements. This situation changed abruptly on March 30, 1796, when one month 
before his nineteenth birthday Carl Friedrich Gauss made the first entry in his 
notebook [7]: 


Principia quibus innititur sectio circuli, ac divisibilitas geometrica in septemdecim 
partes etc. 


Mart. 30 Bruns [igae]. 


Principle of the circle’s division, and how one geometrically divides the circle 
into seventeen parts, and so forth. 


March 30 [ Braunschweig | 


Gauss had discovered that besides the regular polygons of 2” - 3, 2” - 4, 2” -5 
and 2” - 15 sides, there were a number of other constructible polygons, including 
the 17-gon. Gauss secured priority to his discovery by publishing an announcement 
on June 1, 1796, which appeared in the “Intelligenzblatt der Allgenieinen Liter- 
aturzeitung,” the first and only time he published in a journal of advance notices. 
In his short note he wrote that ‘‘This discovery is really only a corollary of a theory 
with greater content, which is not complete yet, but which will be published as 
soon as it is complete” ({1], [7]). The full meaning of Gauss’ pronouncement came 
in 1801 with the publication of his monumental Disquisitiones Arithmeticae [6]. Its 
final two sections (articles 365 and 366) discuss the issue of polygon constructibil- 
ity. Recasting the problem in terms of constructing the N vertices of the polygon 
on a given circle, Gauss stated his result as follows. 


In general therefore in order to be able to divide the circle geometrically into 
N parts, N must be 2 or a higher power of 2, or a prime number of the form 
2™ + 1, or the product of several prime numbers of this form, or the product 
of one or several such primes into 2 or a higher power of 2. 


The sufficiency of the condition follows readily from Gauss’ analysis. The neces- 
sity, however, is not obvious and Gauss never published a proof of this assertion. 
The first proof is credited to Pierre L. Wantzel (1814-48) [15]. 
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Primes of the form 2” + 1, and therefore necessarily of the form F, = 27 +1, 
are the Fermat primes. Gauss knew that F, = 3, F, =5, F, = 17, PF; = 257 and 
F', = 65537 were prime, and he knew that Euler had shown F; to be composite. 
Even today no other Fermat primes have been found, although the smallest 
unsettled cases are F,,, F,,, and F,,. (The primality status of Fermat numbers as 
of 1983 can be found in Keller [8], and a shorter but more recent table is included 
in Young and Buell [17]). 

Beyond the brilliant theoretical breakthrough of Gauss there is still an intrigu- 
ing puzzle: how can one devise a sequence of steps with straightedge and compass 
which constructs a regular N-gon? Gauss expressed interest in this problem, 
although he did not offer explicit geometric constructions. Throughout the last 
century, and even into present times, numerous constructions have been contrived 
for the 17-gon (see [1], [2] for historical comments). J. F. Richelot’s construction of 
the 257-gon required a total of 194 pages, published in four parts in Crelle’s 
journal in 1832. A Professor O. Hermes labored for ten years on the construction 
and associated algebra of the 65537-gon. The work filled a trunk which was 
donated to the Mathematical Institute at Gottingen. Nearly a century later it 
remains stored in an attic there, in all likelihood having never been read. 

In what follows we show that a remarkably uniform procedure to construct the 
regular F,-gons is afforded by consistent use of an idea attributed to the Scottish 
historian Thomas Carlyle (1795-1881). Before turning his attention to the literary 
arts, the young Carlyle taught mathematics, translated Legendre’s influential 
Elements de Géométrie into English, and, important for our purposes, devised an 
elegant geometrical solution to quadratic equations. His idea is based on what we 
will call the Carlyle circle construction, which we describe in the next section. In 
the remaining sections we will demonstrate constructions of the 5-, 17- and 
257-gon which employ the Carlyle circle method, and, at least in a general way, we 
also discuss the method’s applicability to the 65537-gon. 

Our unified approach has two nice features: first, the correctness of the 
constructions is easy to verify, and second, the constructions are highly efficient. 
For the pentagon the procedure is similar to the popular construction from 
antiquity found in Ptolemy’s Almagest, but with a slight twist at the end. For the 
17-gon we rediscover a method of Smith [12] which appeared in 1920, but again we 
can offer a small improvement. The 257-gon is fairly difficult at the concrete level 
of actually using the Euclidean tools with sufficient accuracy to produce a convinc- 
ing construction, but at the conceptual level the procedure is remarkably straight- 
forward in its use of 24 Carlyle circles. Indeed, one becomes convinced that the 
style of construction needs no change to handle the 65537-gon. 

A quantitative measure of the simplicity of a geometric construction was devised 
in 1907 by Emile Lemoine (1840-1912), and is described in Eves [5] this way. 
Consider the following five operations. 

S,: to make the straightedge pass through one given point. 

S,: to rule a straight line. 

C’,: to make one compass leg coincide with any point. 

C,: to make one compass leg coincide with any point of a given locus. 

C,: to describe a circle. 

If these operations are performed m,,™m,,1,,n>5,n3 times respectively in a con- 
struction then m,S, + m,S, + n,C, + n,C, + nC, 1s the symbol of the construc- 
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tion. The total number of operations m,+m,+n,+n,+ Hn, Is called the 
simplicity of the construction, where we note that a low value of simplicity 
corresponds to an efficient construction. In what follows we will assume that we 
use a single noncollapsing compass!. Simplicity is improved by having circles with a 
common radius, or a common center, described in succession since the compass 
would not have to be reset, or one of its legs would not require repositioning. 

As an example, the construction of a circle and two lines which cross orthogo- 
nally at the circle’s center (if we begin with the circle, C,, then draw a line through 
the circle’s center, S$, + S,, and finally erect the perpendicular bisector, 2C, + 
2C, + 2S, + S,) has symbol 3S, + 28, + 2C, + 0C, + 3C, and simplicity 10. If 
the first two steps in the construction just described are reversed (begin with a line, 
S,, and then draw a circle centered on the line, C, + C,) then the symbol is 
28, + 28, + 2C, + C, + 3C;, but the simplicity is still 10. 

The constructions we will describe for regular polygons have lower simplicity 
measures than their competitors and are perhaps nearly optimal. An interesting, 
but apparently open, problem is whether or not it can be proved that a given 
construction has optimal Lemoine simplicity. 


2. Carlyle Circles. A variety of methods to construct the roots of a quadratic 
equation are known, but the one of Carlyle we now describe is especially attractive. 
It appeared in Leslie’s Elements of Geometry with the remark “The solution of 
this important problem now inserted in the text, was suggested to me by Mr. 
Thomas Carlyle, an ingenious young mathematician, and formerly my pupil” (see 
[5]). 

Beginning with the usual Cartesian x, y-axes and a unit distance, our intention 
is to construct the roots of the quadratic equation x* — sx + p = 0, where s and p 
are given signed lengths. To this end, plot the points A(0,1) and B(s, p). The 
circle which has the segment AB as a diameter will be called the Carlyle circle C, > 
of the given quadratic equation. The center of C,, is at M(s/2,(1 + p)/2), which 
can be constructed as the midpoint of AB. It will later be found useful to notice 
that M is also the midpoint of S(s,0) and Y(0O, 1 + p), where the advantage derives 
from being able to locate S and Y on the x- and y-axes, respectively. 

If we suppose C, , crosses the x-axis at H,(x,,0) and H,(x,,0), with x, > x, 
we have the two cases shown in Fic. 1. For p of either sign we observe that 
x, +x, = 5s. For p < 0 the intersecting chords theorem for FiGure 1(a) shows 
OH, : OH, = OA - OC; that is, (x,)-(—x,) = (1): (—p). For p > 0, the inter- 
secting secants theorem for Fic. 1(b) shows x, : x, = p. In all cases, then, 


(x —x,)(x —x,) =x? -— sx +p, S=X,+X), DP =X 1X, 
and so we conclude the following theorem: 


If the Carlyle circle C, , intersects the x-axis at x, and x», these are the roots of 


x*—sxtp=0. 


An alternate proof can be based on the Pythagorean theorem. 


'The modern noncollapsing compass permits one to draw with ease a circle centered at a given 
point whose radius is the distance between two other points. If one only has a collapsing compass, the 
type assumed by Euclid, then it is surprisingly cumbersome to transfer lengths and consequently the 
simplicity numbers of most constructions is greatly increased. 
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Y(0,1 + p) 
C(O, p) 


(a) Case p < 0 (b) Case p > 0 


Fic. 1 


The quadratic equations we will consider later all have real roots, but it should 
be pointed out that complex roots are also related to the Carlyle circle. Suppose 
the y-coordinate (1 + p)/2 of the center of C, , exceeds the circle’s radius 


(are 


a condition which reduces to having a negative discriminant A = s* — 4p. The 
circle C, , does not intersect the x-axis, which reflects the fact that the quadratic 
equation has a pair of complex conjugate roots z,,z, =s/2 + ivV— A. We will 
now verify that any circle centered on the x-axis and orthogonal to C, ,, will meet 
the vertical line through the center of C, ,, at 
s 1 S 
5. 5v-a and 5. _ = 
2 2 2 2 
To see this, consider the circle centered at (é,0) which is orthogonal to C, p- Its 
equation is 


2 


S 2 1+p 
2 2 2 
and setting x = s/2 we find 
l+py\’ 
y?= 5 —r?= —iA 


Therefore any such orthogonal circle can be used to construct z,, and z,, with the 
y-axis now interpreted as the imaginary axis of the complex plane. 


3. Construction of the Regular Pentagon. We assume that we have already 
constructed the unit circle and the x, y-axes. Following Gauss we view the problem 
as the construction of the roots of z> — 1 = 0. The root z, = 1, corresponding to 
the point P,(1,0), can be factored out to show that the remaining points 
P,, P,, P3, Py correspond to the roots of the quartic equation z* +z? + z* + 
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z+1=0. Letting « = e*7'/ these roots are «', €”, €°, e*, which we observe sum 
to —1. 
Two other sums of roots are of special interest, namely, 


Ny) =e + €* = 2c0s(27/5) 
n, =e* + €° = 2c08(47/5). 
It is simple to check that 


(1) 


NM tm = —I, N° m= —i, N) > O0> m4. (2) 
From (2) it follows that 7, and 7, are, respectively, the larger and smaller root 
of the quadratic equation x* + x — 1 = 0, and therefore they can be constructed 
by means of the Carlyle circle C_,_, which has center M(—1/2,0). M is 
constructed as the midpoint of OO, where O(—1,0). The Carlyle circle centered at 
M and passing through A(0, 1) then intersects the x-axis at H,(7,, 0) and H,(7;, 0). 
In view of (1), circles of unit radius centered at H, and H, will intersect the 
original unit circle at P,, P,,P3,P,, thereby forming the desired pentagon 
P) P,P,P3P, as shown in FiGureE 2. 


Fic. 2 


Beginning with the unit circle and the x, y-axes as given, the construction just 
described has symbol 2S, + S, + 8C, + 0C, + 4C,, for a Lemoine simplicity of 
15. The popular construction of Ptolemy (see [4], [15]) proceeds in the same way 
through the construction of H,, but then takes AH, as the distance between 
successive vertices of the pentagon, resulting in a simplicity of 16 to locate all five 
vertices. 


4. Construction of the Regular Heptadecagon (17-gon). Letting « = e77'/1", 


our task here is to construct the roots ¢«!,¢7,...,¢!° of the equation z'° + 
zi +--+ +z+1=0. Gauss’ idea was to arrange these roots in a cycle in which 


each root is a gth power of its predecessor: 
2 
el e® 68 ek e® = e!, (3) 


Since ¢” = ¢™ (41) We can see that corresponding to the prime p = 17 there 
must be found a g for which 


g’#1(modp), 1l<r<p-—2 and g?~!'=1(modp). (4) 
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A positive integer g which satisfies the conditions of (4) for p is said to be a 
primitive root of the prime p. For any Fermat prime p > 3, g = 3 1s always a 
primitive root. This follows from the fact that 3 is a quadratic nonresidue by the 
quadratic reciprocity theorem, and any nonresidue is a primitive root because the 
size of the multiplicative group is a power of 2. With this choice the cycle (3) 
becomes 


1 2.3 2.9 210 213 2.5 215 211 2.16 214 £8 27 24 212 2.2 26 (5) 


Next, Gauss considered the sums of the terms of various subcycles of (5), which 
are called periods. A typical example is the period ny). = e' +e? te + +++ +27. 
For notational convenience a period will be represented by an J/-tuple of the 
exponents of ¢ which occur in the sum, and so the two periods of length eight are 
expressed as 


No.2 = (1,9, 13, 15, 16, 8, 4, 2) 
1,2 = (3,10,5, 11, 14,7, 12,6). (6) 
There are four periods of length four, namely, 
No.4 = (1, 13, 16, 4), No.4 = (9, 15,8, 2) 
m4 = (3,5,14,12), 3,4 = (10, 11,7, 6). 


There are eight periods of length two, but only two are needed for the construc- 
tion, 


(7) 


No,g = (1, 16) = 2 cos(27/17) 


14.3 = (13,4) = 2cos(87/17). (8) 
From (8), and using (7), we easily see that 
0,8 + 14,8 = No,4> 0,8 ° 4,8 = 1,4> 10,8 7 14,8- (9) 
Similarly it is straightforward to calculate that 
0,4 + 2,4 = No,2> o,4°N2,4= — 1, No.4 > 12,4 (10) 
1,4 + 13.4 = 1,25 M1,.4°3,4= —1, 11.47 13,4 (11) 
No.2 +M,2 = —1, N0,2° 1,2 = — 4; No,2 > 11,2: (12) 


The inequalities in (10) and (11) are geometrically obvious, and since a calculation 
shows that (74 — 12,4) ° (11,4 — 13.4) = 2(N9.2 — 14,2) it then follows that 
No.2 > 14,2 a8 claimed in (12). 

Equations (8)—(12) readily lead to the construction of the regular 17-gon as 
shown in FiGureE 3. It is assumed that we are given the unit circle centered at O 
and the x, y-axes. From (12) we see that No,2 and 7, » are, respectively, the larger 
and smaller roots of the quadratic equation x” + x — 4 = 0 whose corresponding 
Carlyle circle C_, _4 is centered at M,(—1/2, — 3/2). Therefore: 


(i) Draw the perpendicular bisector of QO, using a circle at Q through O and 
the existing circle. This construction has Lemoine symbol 2S, + S, + 
2C, + C, and locates Q’. 

(ii) Draw a circle at Q’ through P, (symbol 2C, + C,) to locate Mo. 

(iii) Draw the Carlyle circle at M, (symbol 2C, + C3) to locate Ho (np 2,0) 
and H, (7, 5,0). 
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Fic. 3 


From equations (10) and (11) we see that Carlyle circles centered at My 9(379 2, 0) 
and M, (37; 2,0) will locate Hy 4(1,4,0) and H, ,(7,,4,0). Therefore: 


(iv) Draw the perpendicular bisectors of OH, , and OH, 5, using the same 
circle at O (symbol 4S, + 2S, + 3C, + 3C;). This locates M, 2 and M, ». 

(v) Draw the Carlyle circles at My 2 and M, , (symbol 4C, + 2C;) to locate 
Ay, and H, 4. 


Turning next to Equation (9), we see that Hy .(79 g,0) requires a Carlyle circle at 
M, 4. Therefore: 


(vi) Set the compass to radius QH, , and draw a circle at O (symbol 3C, + C3) 
to locate Y(O, 1 + 1, 4). 
(vii) Draw the line YH, , (symbol 2S, + a 
(viii) Draw the perpendicular bisector to YH 4 symbol 25, + S, + 2C, + 2C3) 
to locate Mg 4. 
(ix) Draw the Carlyle circle at My 4 (symbol 2C, + C3) to locate Ho ¢. 


In view of equation (8), the following step locates P, and P,,, which correspond 
to «| and «°°, respectively. Except for laying off the distance P,P,, this completes 
the construction. 


(x) Set the compass to unit radius and draw a circle at Hy , (symbol 3C, + C3) 
to locate P, and P,,. 


The ten steps described above have total symbol 10S, + 58, + 23C, + 0C, + 
13C;, for a simplicity of 51. However, two simple modifications can reduce the 
simplicity of the construction to 45.* First, the costly double bisection of step (iv) 
can be avoided by using half-scaled Carlyle circles to locate M). and M, . This 
requires a perpendicular bisector of Q'O, but since the compass is already set at 


*Our thanks to the referee for suggesting these modifications. 
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unit radius the symbol is 2S, + S, + C, + C,. We also need a circle of radius OQ' 
at O (symbol 2C, + C;). Since there is no more need for step (iv) the simplicity is 
reduced by four. Step (viii) can also be replaced by a less costly procedure. 
Following step (vi) the perpendicular bisector of OY can be constructed with 
symbol 2S, + S, + C, + C,, since the compass need not be reset for the circle at 
Y. Step (vii) now locates Mj, , and so we have further reduced the simplicity by 
two. Altogether the modified Carlyle construction has symbol 8S, + 48, + 22C, 
+ 11C,, for a simplicity of 45. 

The Smith construction [12] is similar to that described in (i)—(x), but it requires 
a perpendicular bisector at H, 4, which increases the simplicity to 58. The 
construction of Richmond [11] (also shown in [13], [16], and elsewhere) seems to 
appear more often than its efficiency warrants. Even arranging for several circles 
to do double duty, as in the modified Carlyle construction, the simplicity is 53 
(which constructs P,; the simplicity increases to 60 to locate P,.) The compara- 
tively recent construction of Tietze [14], also described in Hall [7], can be done 
with simplicity 50. Neither reference provides a proof of the construction, and 
indeed Hall remarks that “the proof that it is correct requires extensive calcula- 
tion.” His assessment applies equally well to the method of Richmond. 


5. Construction of the Regular 257-gon. Since g = 3 is also a primitive root of 
257, the roots of z*°° + z*° + --- +1 =0 are ordered in the cycle 


where ¢ = e*™'/”’. The period of length / = (p — 1)/2”" which contains the term 
e®’ is denoted by 
i-1 


jtnk r 
Ne= d €8 ; k= 2", 
n=0 


In particular, 7), = —1 and we also observe that 9, , = n;,, when j =i (mod k). 
To set up the hierarchy of quadratic equations to be solved in succession, it is 

necessary to know the sum, product, and relative size of pairs of periods. This 

allows the construction of the periods by means of the appropriate Carlyle circles. 
The addition formulas 


Nj2k + Nj+k,2k = Nj,k (13) 


are obvious. Formulas for products require effort but perhaps less than expected. 
The following sample calculation of 19 64° 732 64 illustrates some useful principles 
of computation. Since 19 64 = (1, 241,256, 16) and 139 64 = (64, 4, 193, 253) are 
represented by quadruples of exponents, the product we seek corresponds to a 
modulo 257 addition table. 
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The main diagonal, it can be checked, is 133 64 = (65,245, 192,12), and the 
“diagonal” just below it is 755 54 = (48, 3, 209, 254). Altogether we find that 


No,64 ° 132,64 = 133,64 + 155,64 + 123,64 + 1,64: (14) 
Thus products can be computed by knowing which periods of the same length 
contain the terms in the first column. 
The right side of (14) can be simplified by use of the addition formulas (13), 
giving 
0,64 ° 132,64 = 11,32 + 123,32: (15) 
The derivation of (15) is actually enough from which to deduce the general formula 
obtained by increasing the first subscripts by j. That is, 


17,64 °° 174+32,64 = Nj+1,32 + 1j7+23,32° (16) 


Additional discussion of these computational details can be found in Rademacher 
[10]. 

The last algebraic difficulty is the ordering of 7, >, and 7,,, 2,. It is obvious 
that 1.64 > 32,64 but showing that 1,4 15 > 1216 is somewhat delicate. The 
methods described in Klein [9] and Rademacher [10] for the 17-gon can be 
extended to the present case however, thus allowing the identification of the larger 
and smaller root of a pair. 

The results of the calculations are shown in TABLE 1. The same information is 
contained in Bishop [3], but in a different form, since we have used formula (13) to 
simplify the expressions for the period products. Since it will be sufficient to 
construct the point P, on the unit circle whose x-coordinate is cos(27/257) = 
(1/2) No, 128, We observe that we need only construct two of the period pairs of 
length 4 and six of length 8. 


TABLE 1 


Products of Required Period Pairs Order of Period Pairs 
022 = —64 
j,4° Nj4+2,4 = —16, j= 0,1 0,4 > 2,45 11,4 > 13,4 


0,8 > N4,8> 12,8 > N6,8> 
11,8 < 15,8) 13,8 < 17,8 


17,8 ° 1)4+4,8 = —2+ 31) 41,2 + 21, 42,4 
J = 0,1,2,3 


1;,16° 1j+8,16 = 1),2 + 1,8 + Nj+2,8 + 27,45,8 
j=0,...,7 


0,16 7 18,169 14,16 > 12,16: 12,16 > 110, 16> 
6,16 < 114,16> 11,16 > 19,169 15,16 > 113, 16> 
13,16 7 111,16 7,16 > 115, 16 


0,32 7 116,32» 78,32 < N24,32> 
1,32 > 117,329 N9,32 < 25, 32> 


17,32 < 23,32> 715,32 > 131,32 
17,64 ° 1; 4+32,64 = 1,+1,32 + Nj+23,32 


j = 0,24 No, 64 > 132,64> 24,64 < 156,64 


10,128 © 64,128 ~ 156, 64 10, 128 > 64, 128 


1),32 ° )+16,32 = 1),16 + 1j7+1,16 


+1, 42,16 + 1)45,16 
j = 0,1, 7, 8, 9,15 


Translating the results of the table into an explicit construction is straight-for- 
ward, although we must be alert to carrying out the required procedure in an 
efficient manner. Beginning with the first row of information in TABLE 1 we see 
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that 9. and 7, 4 are the larger and smaller roots, respectively, of x? +x —- 64 = 
0, and so the first step is to construct the center M,(—1/2, — 63/2) of the Carlyle 
circle C_, _¢4. Successive distance doublings along the positive x-axis will locate 
7T(31, 0), and then swinging an arc from Q'(—1/2,0) of radius Q'T, will locate M, 
on the perpendicular bisector x = —1/2 of O and Q(—1,0). As shown in FicurE 
4, the Carlyle circle C_; 44 then locates Ho (m9 2,0) and A, (7, 2, 0). 


Fic. 4 


The construction of the four periods of length 64 is depicted in Figure 5. The 
point Y(O, — 15) is located by swinging an arc from O of radius OS, where S(15, 0) 
was constructed in the previous step. The midpoint M) , of YH, , 1s the center of 
the Carlyle circle which determines Hy 4(%o, 4,0) and H, Ne, 4,0). Similarly the 
points at x = 7,4, and x = 93,4 are found by the Carlyle circle centered at the 
midpoint M, , of YH, ». 


Fic. 5 


The remaining pairs of periods are constructed in an analogous way, although 
extra distance transfers are required because of the more complicated product 
expressions shown in TABLE 1. A circle of unit radius centered at Hy ,2¢ will cross 
the unit circle at P, and P,,,, which completes the construction up to laying off 
the distance P,P, to locate all of the vertices of the 257-gon. 
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To optimize the simplicity of the construction it is essential to transfer distances 
efficiently. For example some useful savings in constructing 10 16, 4, 16> 18, 16> 712, 16 
are possible when advantage is taken of the relation 1” 16 * 7g 16 + 4.16 * 12,16 = 
39,2 + 27,4. The construction of the three other quadruples of periods of length 
sixteen are aided by the corresponding algebraic relations. 

Beginning with the unit circle and the x, y-axes, the construction of 
P(cos(2 7/257), sin(2 7 /257)) has symbol 


94S, + 47S, + 275C, + OC, + 150C, 


for a Lemoine simplicity of 566. Among the 150 circles are 24 Carlyle circles. 


6. Remarks on the Construction of the Regular 65537-gon. We have already 
observed that g = 3 is a primitive root of p = 65537. The sum, product, and 
relative order of period pairs must then be computed (one feels sorry for the 
computerless Hermes!). The first line of the required table of information is 


No.2+ 1,2 = —1, No0,2° 1,2 = —2", No,2 > 1,2 (17) 


The product formula derives from the fact that the exponent sums evenly divide 
into quadratic residues and nonresidues. Drawing the Carlyle circle C_, _ 14 may 
place impossible demands on any compass we own, but at the conceptual level 
there is no problem. 

Successive levels in the hierarchy of quadratic equations require 2,4,8,... 
Carlyle circles. Working upwards however, and assuming the worst case possibility 
that the product formulas and sums involve all distinct periods, the largest possible 
number of Carlyle circles required is 1, 2, 6, 30, 270, 4590,... . Actually the 4590 is 
spurious, since there are only 2!° periods with 2° terms and so these can all be 
computed with 2? = 512 Carlyle circles. Even though we know little about the 
details of the construction, we can conclude that the construction we seek requires 
14+24+4+4+84+-:: +512 + 270+ 30+6+2+1= 1332 or fewer Carlyle 
circles. 
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1. Introduction. This paper is a discussion of three metrics which have arisen in 
different contexts over a period of many years. One of them is the hyperbolic 
metric in the unit disc of the complex plane. This metric is so named because it is 
the metric of Poincaré’s celebrated model for hyperbolic geometry. The hyperbolic 
metric is invariant under conformal maps of the disc onto itself, and quite naturally 
plays a central role in much current research in conformal and quasi-conformal 
mapping. 

The second metric, called the part metric, was introduced by the author in the 
study of complex function algebras. The underlying space for a complex function 
algebra breaks into disjoint sets, Gleason parts, on which a kind of Schwarz lemma 
holds. The part metric, was originally defined in these parts. The concepts of part 
and part metric were later extended to apply to the underlying space for a linear 
space of real continuous functions, and finally in a purely geometric setting, to 
appropriate subsets, again called parts, of a general convex set. 

The third metric we consider is the Cayley-Klein-Hilbert metric. Klein used this 
metric in his model for hyperbolic geometry, in which the points were also the 
points of the open unit disc in the complex plane. Hilbert extended the definition 
from the disc to general bounded convex sets in Euclidean space. 

These three metrics are closely related. In fact, it would be more exact to say 
they are identical, each being simply an avatar in the appropriate setting of one 
general metric. It is our purpose here to sort out the connections between the 
three versions, and to provide some new facts and insights about each. 

We start in Section 2 with an exposition of the hyperbolic metric p(z, w) in the 
unit disc. Although the hyperbolic metric is a standard tool in complex analysis, it 
is frequently overlooked in beginning analysis courses, and elementary explana- 
tions are hard to come by. It is easy to write down a formula for p(z, w), and not 
too hard to show that distances add up correctly on the circular arcs which form 
the lines in Poincaré’s model for hyperbolic geometry. However, it is not so easy to 
show that p is a metric! 

Section 3 treats the part metric d in the open unit disc. In this setting the part 
metric is most naturally defined in terms of the positive harmonic functions. The 


*This work was supported by a grant from the Institut d’Estudis Catalans, Centre de Recerca 
Matematica, Bellaterra (Barcelona). The author wishes to thank Professor Castellet of the C.R.M. and 
the many Spanish mathematicians who were so graciously hospitable. 
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Poisson kernels, which play a central role in the study of harmonic functions on the 
disc, are extremal among the positive harmonic functions, in a sense we will make 
precise later. The gradient curves of the Poisson kernels turn out to be the “lines” 
of the Poincaré model. Moreover, the values that a Poisson kernel function takes 
on each of its gradient lines constitute a coordinate system on each such hyperbolic 
line. This coordinate system gives both the part metric and the hyperbolic metric p 
in the disc in the sense that 


2p(z,w) =d(z,w) =llog P(z) — log P(w)| 


where P is the appropriate Poisson kernel. The Poisson kernels are some of the 
best known and most widely studied functions in analysis. It is therefore satisfying 
to notice that some of the results of Section 3 are both new and elementary. 

Our discussion in Section 3 is not the shortest route to the conclusion that 
d = 2p, but the route which seems to uncover the greatest number of interesting 
relationships. The part metric is, unlike the hyperbolic metric, obviously a metric, 
so the d = 2p equality obviates the need for a lot of geometric structure to show 
that p is a metric. Moreover, d is defined in very general plane domains, and is 
obviously conformally invariant. It provides, therefore, a natural generalization of 
p to these more general settings. 

In Section 5 we discuss the Klein metric in the unit disc, and Hilbert’s extension 
to general bounded open convex sets in R”. We indicate how this metric is very 
closely related to the part metric for convex sets as studied in [2], [3], [1]. 

Finally, in Section 6 we pose a few questions on possible extensions of these 
ideas. 


2. The hyperbolic metric. It is most convenient, and perhaps essential, to 
introduce the hyperbolic metric in the context of hyperbolic geometry. Our 
approach will be via the Birkhoff axioms [4] for plane geometry (Euclidean or 
hyperbolic). There is an excellent comparison of the Birkhoff and Hilbert systems 
in Moise [10, p. 113]. (See also Mac Lane [8] for a somewhat different axiom-sys- 
tem in the Birkhoff vein.) 

The Birkhoff axiomatic structure for plane geometry consists of a set of points, 
subsets of points called lines, a distance function DCP, Q) on pairs of points, and a 
measure of angles. The distance function is assumed to have all the usual 
properties of a metric (DCP, Q) > 0, DCP, Q) = DCO, P), DCP, OQ) = 0 iff P = Q), 
except that the triangle inequality is not assumed. In addition, it is assumed that 
each line is equipped with a coordinate system which gives the metric. That is, for 
each line / there is a one-to-one function f, on / onto (—~, %) such that whenever 
P,Q lie on 1, D(P,Q) = |f,(P) — f(Q)|. The only connection between distances 
on two different lines is given by the side-angle-side axiom for congruence of 
triangles. The triangle inequality, that D(P,Q) + D(Q, R) > DCP, R) unless 
P,Q,R lie on a line with Q between P and R, is a theorem. The triangle 
inequality depends essentially on the side-angle-side axiom, but is independent of 
the parallel axiom, and therefore holds equally well in Euclidean and hyperbolic 
geometry. 

In the Poincaré model for hyperbolic geometry, the points are the points of the 
open unit disc D in the complex plane. Thus the points can be identified with the 
complex numbers z with |z| < 1. The “lines” in this model are the intersections 
with D of circles or lines perpendicular to the unit circle [. A few pictures will 
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convince the reader that two points determine a unique “‘line’”’; 1.e., for each pair 
z,w there is exactly one circular arc which goes through z and w and is 
perpendicular to [. If z and w lie on a diameter of D, then the diameter is the 
“circular arc” through z and w. For each pair a, B of distinct points of T° there is 
a unique hyperbolic line (hereafter “‘h-line”) which is an open circular arc from a 
to B, perpendicular to [ at both a and B. We will use (a, B) to denote this h-line, 
and emphasize that a and # are not points of the line or points of the model. 

Let us recall some basic properties of the M6bius transformations (fractional 
linear transformations) which are functions of the form 

az +b 
Tz) cz +d’ 

The transformations (1) are one-to-one conformal maps of the complex plane plus 
the point at infinity onto itself. Mobius transformations form a group under 
composition, and each transformation takes every circle or line onto a circle or 
line. Since there are just three essential constants in (1), any MObius transforma- 
tion I’ is determined by specifying any three values: T(z;) = w,, i = 1,2, 3. 

The Mobius transformations of the form 


T(z) =p- 


ad — bc # 0. (1) 


Zo 
———, |pl=1, lz |< 1, (2) 
1 — Z9z 

map the unit circle [ onto itself, and the disc D onto itself. We will call the 
functions (2) disc-maps. The disc maps form a subgroup of the Mobius transforma- 
tions. 

Notice that any disc map (2) maps any h-line (a, 8B) onto some other h-line 
(T(a), T(B)). This is because T maps TF onto I, and the circular arc (a, B) onto 
some other circular arc in D. The image arc is also perpendicular to [, because T 
is a conformal mapping, and so the image arc is again an h-line. 

We will use the cross ratio, defined next, to put a coordinate system on each 
h-line, and these coordinate systems will give the hyperbolic metric in D. For any 
four complex numbers Zo, Z,, Z5, Z3, with z,, z,, z3 distinct, define the cross ratio 
as follows: 


(Z. — Z9)(23 — 2) 
(2; — Z)(Z3 — 22) @) 


A geometric interpretation of the cross ratio will appear in Section 5. Notice that 
interchanging z, and z,, or interchanging z, and z, gives the reciprocal cross-ratio: 


(Zo, 21> 22> z3) = 


(Zo, £9» Z1> Z3) = (25, Z19 Z45 z3), 
(4) 


(23, £15225 Zo) = (Zo, 215225 z3) 


For distinct z,, z,, z3, we define a Mobius transformation T by 


T(z) = (22452323) = : = , a. (5) 


T is the Mobius transformation which sends z,, z,, z3, respectively, to ~, 0, 1. 
The fact that the hyperbolic metric is invariant under disc maps will depend on 
the fact that cross ratios are invariant under disc maps, and indeed under all 
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Mobius transformations. To see this, let z,, z,, z, be any three distinct points and 
let T(z) = (%, 21, 22, Z3) be the Mdbius transformation which sends z,, z,, z3 to 
0,0, 1 respectively. Let R be any Mobius transformation, and let 


S(z) = (z, R(z,), R( zz), R(z3)). (6) 
Then S is of course the Mobius transformation which sends R(z,), R(z,), R(z;) to 
2,0), 1 respectively. Hence S~'°T sends z,, z3, z3 to R(z,), R(z,), R(z3), respec- 
tively. Since three values determine a Mobius transformation, S~'oT = R, or 
T =SoR. Thus 
T (29) = (Zo) 245 Za 23) = S(R(Z)) _ (R(z), R(z,), R(22), R(z3)), (7) 
and the cross ratio is invariant under any Mobius transformation R. 

The argument above also shows that any three points z,, z,, z,; can be mapped 
onto any three points w,,w,,w, by some Mobius transformation; specifically, if $ 
sends z,, Z,, Zz; to »,0,1 and T sends w,,w,,w 3 to »,0, 1, then T~'o S sends z;, to 
w,. Since all Mobius transformations map circles and lines onto circles or lines, and 
circles (lines) are determined by three points, it follows that any circle or line can 
be mapped onto any circle or line by a Mobius transformation. 

The cross ratio (Zp, Z,, Z>, Z3) is real if all z; lie on the real line. Hence 
(29, Z1, Z25 Z3) is real if and only if all z; lie on some circle or line, since any such 
circle or line could be mapped conformally onto the real line, and the cross ratio of 
the four points would not change. In particular, if z,w are points of the disc and 
lie on the h-line (a, B), then (a, z, w, B) is real. Moreover, (a, z,w, B) is positive 
since a mapping of the circle containing (a, B) onto the real line would send z and 
w to points between the images of a and B. 

Now we introduce a coordinate system on the h-line (—1, 1). Since any A-line 
can be disc-mapped onto (— 1, 1), this will provide a coordinate system for every 
h-line. For —1 <r < 1, let 
1 1l+r 
f(r) = slog . (8) 

2 1-r 


Clearly f is a one-to-one map on (—1,1) onto (—%,«). According to this 
coordinate system, the hyperbolic distance p between 0 and r is 


1 l+r 
p(0,r) =If(r) = 5 |los5—, (9) 
and the h-distance between points r, and r, is 
p(t.) =Iflr) -— fr)! 
1 1+r, 1+y7, 
_ I oe] (—1) ! =A] 
2 r,-(-1) 1-,Ffr, 
1 
= 5 llos(— 1 rir. D) (10) 


where (—1,7,, 75,1) is the cross-ratio defined in (3). The factor 3 in (8), (9), and 
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(10) is there for the following reason. The hyperbolic metric can also be character- 
ized as the minimum length of the curves y from z to w, where the length of y is 
given by the line integral 


|dz| 


length(y ) = [oa (11) 


For z= 0, w=r>0, the y of minimum length is the segment [0,r] and the 


integral (11) gives (9) with the factor } as indicated. The factor 3 also gives the 


normalizing condition lim, _,, p(0O, z)/|z| = 1. 
If (a, B) is any h-line, there are many disc maps JT which map (a, 8) onto 
(—1,1). For any such 7, the function 


flap (2) =f(T(z)) (12) 


puts a coordinate system on (a, 8). For any points z,w © (a, B) we then have 
p(z,w) =| fra, py Z) — fea, pW) 
IA(T(z)) — f(TOw)) | 


1 
5 llos(—-1, T(z), T(w), I) 


1 
5 llos(@, z,w, B)|. (13) 


The last equality holds if T(a) = —1, T(B) = 1, no matter which disc map T is 
used, because cross ratio is invariant under disc maps. From (4) we see that 


(B,z,w,a) =(a,z,w,B), (14) 


so (12) still holds if T maps a to 1 and B to —1 instead of the other way around. 
Notice that the method above of putting a coordinate system on each h-line 
automatically gives a distance function which is invariant under disc maps. Notice 
also that while different disc maps T put different coordinate systems on (a, B), 
they all yield the same distance formula. 

To summarize, we define the hyperbolic metric p on D as follows. For any 
points z,w, there is a unique h-line (a, B) determined by z and w. The cross ratio 
(a, z,w, B) is real and positive for z,w © (a, B), and 


p(z,w) = 4|log(a, z,, B)|. (15) 


Interchanging z and w, or a and 8B, gives the reciprocal of the cross ratio, so the 
absolute value of the logarithm does not change. The hyperbolic metric (15) is 
invariant under disc maps, since the cross ratio is, and the distances add up on 
h-lines (p(z,, z,) + plz, 23) = p(z,, z3) when z, is between z, and z,) because 
distances on each line are given by a coordinate system. 

As a last but very important observation we note that the triangle inequality 
holds for p; namely, p(z,, z,) + p(zz, z3) > p(z,, Z3) unless z,, Z,, z3 are h-colin- 
ear with z, between z, and z,. We have not shown this, and it is not obvious. The 
triangle inequality is a theorem of both Euclidean and hyperbolic geometry which 
depends in an essential way on the side-angle-side axiom for congruence of 
triangles. Thus the triangle inequality for the hyperbolic metric in the disc depends 
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on the non-trivial verification of the side-angle-side axiom for hyperbolic triangles 
in the Poincaré model. We will give a simpler proof in Section 4 using the equality 
between the part metric and the hyperbolic metric. 


3. The part metric in D. If z,w are any two points of D, then there exists a 
constant a > 0 depending only on z and w such that for every positive harmonic 
function u on D 


] 
7 <u(z)/u(w) <a. (1) 


Equation (1) is called Harnack’s inequality, and holds for all positive harmonic 
functions and points z,w in any plane domain, or indeed any domain in R”. 

Suppose instead of positive harmonic functions on D we consider the positive 
functions in any linear space B of (continuous) functions on any set X. The space 
B could consist of the real parts of the complex functions in some function algebra, 
for example. The existence of a constant a so that (1) holds for all positive 
functions in B defines an equivalence relation, z ~ w, on the points of XY. The 
equivalence classes of this relation are the Gleason parts of X with respect to B, 
and these parts were the original setting for the part metric [2]. 

To return to the disc D, we define the part metric d(z,w) for z,w in D as 
follows: 

d(z,w) = sup|log u(z) /u(w) | 


u>O 


sup |log u(z) — log u(w)|. (2) 


u>O0 


'The supremum is over all positive harmonic functions on D, and is finite because 
of the Harnack inequality (1). From the last equality in (2) it is clear that d is a 
metric. It is also immediate from the definition that d is invariant under disc maps 
T, for v = ucT is a positive harmonic function on D iff u is, so 

d(T(z),T(w)) = sup|log u(T(z)) — log u(T(w)) | 


u>O0 


sup |log v(z) — log v(w) | 


d(z,w). (3) 


Let us denote by H* the positive harmonic functions on D, and by H*(z,) the 
positive harmonic functions u such that u(z,) = 1. If we give the harmonic 
functions on D the topology of uniform convergence on compact subsets of D, 
then H*(z,) is a compact set for each zy. Since H*(z9) is also obviously convex, 
the Krein-Milman theorem [11, p. 207] guarantees that H*(z,) has extreme 
points, and is the closed convex hull of those extreme points. The theorem of 
Herglotz [12, Th. 11.19] states that u © H*(O) iff there is a unique positive 
probability measure yw on I such that 


u(z) = [ P.(z) du(a), (4) 
r 
where P,(z) is the Poisson kernel 


Pz) =(1-\zl’?)/Ja- zl’, lal =1. (5) 


1991] PART METRIC AND HYPERBOLIC METRIC 115 


The Herglotz theorem is an explicit form of the Krein-Milman theorem for the 
compact convex set H*(0). It puts H*(0) into a one-to-one affine correspondence 
with probability measures on I’. The extreme points of the set of measures are the 
point masses (one at each a € [) so the extreme points on H* (0) are the Poisson 
kernels P,(z), one for each a € T. 

A positive harmonic function OQ € H™* was called by R. S. Martin [9] a minimal 
function if0 <u <Q and u © H* imply that u = cQ for some positive constant 
c. The minimal functions are sometimes easier to work with than the extreme 
points of the compact sets H*(z,), and the following lemma makes clear the 
relationship between these two ideas. 


LEMMA 1. The minimal functions are exactly the positive multiples of the Poisson 
kernels. The extreme points of H*(z,) are the functions P,,/P,(Z); i.e., the Poisson 
kernels normalized to be one at z,, or equivalently the minimal functions normalized 
at Z,. 


Proof. Let Q be a minimal function and let z, be arbitrary. We show that 
Q/Q(z,) is an extreme point of H*(z,), and so in particular Q/Q(0) is a Poisson 
kernel. 

If, contrary to our assertion, Q/Q(z,) is not an extreme point of H*(z,), then 
there are u,v € H*(z,), and A € (0, 1) such that 

Q 
Q( Zo) 
Clearly Q/Q(z,) is a minimal function, since Q is. Both functions on the right side 
of (6) are positive, so both are less than Q/Q(z,). It follows that both u and vu are 
multiples of Q/Q(z,) and hence of Q. Thus u =v = Q/Q(z,) and Q/Q(z,) 
must be an extreme point of H*(z,). 

Now assume that Q/Q(z,) is an extreme point of H‘(z,), for some z,. To 

show that QO is minimal, assume that u € H* and u < QO. Then Q —~uandQ+u 


are in H*, and in H*(z,) when suitably normalized. Since Q is the average of 
O-uandQ+u, 


1 (Q(z) =u) [=u | 
Q(z) 2 O( Zo) loc, ~ u(Z9) 

_ 1 Co) | QO+u | 

2 Q( zo) Q(z) + u( zo) | 


Equation (7) displays Q/Q(z,) as a convex combination of two functions (in 
brackets) of H*(z,). Since Q/Q(z,) is extreme, both of the functions in brackets 
equal O/Q(z,), so u is a multiple of Q, which shows Q is minimal. 


=Au+(1—A)v. (6) 


(7) 


LEMMA 2. For any z,w © D there are Poisson kernels P,, Pg such that 


max u(z)/u(w) = P,(z)/P(w), (8) 
min, u(z)/u(w) = Pa(2)/Pa(). (9) 

It follows that 
d(z,w) =|log P.(z)/P,(w)| (10) 


for some Poisson kernel P,(y = a or y = 8). 


116 H. S. BEAR [February 


Notice that it is not clear at this point which is larger, the maximum value of 
u(z)/u(w), or the reciprocal of the minimum value of u(z)/u(w), or how the 
distinction depends on z and w. 


Proof. Fix z and w, and let F be the linear functional on H defined by 
F(u) = u(z). Clearly F is continuous, and so [11, p. 208] assumes its maximum on 
the compact set H*(w) at some extreme point P,/P,(w). Hence, for some Poisson 
kernel P,, 

sup{log u(z) /u(w):u € H*} 
= sup{log u(z): u € H*(w)} 
sup{log F(u): u € H*(w)} 


log P,(z)/P,(w). (11) 


Similarly, for some Ps 
inf{ —log u(z) /u(w): u € H*} 
inf{ —log u(z): u € H*(w)} 
= min{ —log F(u): u € H*(w)} 
—log P,(z)/P3(w) 
log P,(w) /P,(z). (12) 


Let us notice the connection between the Poisson kernel P,(z), |a| = 1, and the 
Mobius transformation 


at+z at+z)(a-Z 1-|z\* @z-—az 
nie) atte (ete) Atel 


a-z (a-z)\(@-Z) |ja—z\ \a-2\?’ 


Since @z — @Z iS pure imaginary, we have 


AaAatrzZ 
P(z) = Re| ). 

a-Zz 
If |zi=1, z#a, then Re7T,(z) = 0. Hence T, maps the unit circle onto the 
imaginary axis. Since 7\(0) = 1, T, maps the disc onto the right half-plane. The 
level curves of the Poisson kernel P, correspond to the lines x = constant under 
the mapping T.,. It follows that the level curves of P, are circles in D, tangent to I 
at a. 

The gradient lines for P, are of course perpendicular to the level curves. The 
gradient lines therefore correspond under 7, to the rays y = constant, x > 0. 
These gradients are therefore also circular arcs in D, from a to all other points of 
I’. In other words, the gradient lines of P, are exactly all hyperbolic lines ending at 
a, and P(z) goes from 0 to + along each such h-line. This suggests that if 
z,w € (a, B), with P,(z) > P,(w) (and P,(w) > P,(z)), then the part metric 
distance might be given by log P,(z)/P,(w) or by log P,(w)/P,(z). The next 
results show that this is indeed the case. 


Lemma 3. If T is any disc map, then Q is a minimal function iff QT is a 
minimal function. 


Proof. For all u,u € H* iff uoT © H*, andO<u<Q iff0 <ucT<QoT. 
If Q is minimal, and 0 < vu < QoT, then v = ucT for some u, with 0 <u < Q. 
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Hence u =cQ, v = cQoeT, and QT is minimal if QO is. Conversely, if O°T is 
minimal, then Q° T° T 7! = Q is minimal. 


THEOREM 4. If a, B are points of the unit circle T and P,, P, are the correspond- 
ing Poisson kernels (5), there is a constant Cyg So that for allz € (a, B) 


P,(2)P.(z) = Cag: (13) 
It follows that for all z,w € (a, B), 
P(z)/P.(w) = Pe(w) /Po(z)- (14) 


Notice that from (14) it follows that if z,w € (a, B) and if the part distance 
d(z,w) = log P,(z)/P,(w), then 


d(z,w) = log P,(z)/P,(w) = log Pg(w) /P,(z)- (15) 
It remains to be shown that d(z,w) is indeed given as above. 


Proof. Fix z) © (a, B) and let T be a disc map which sends z, to 0 and B to 1. 
(T(z) = wz — Z)/A — Z9z) sends z, to 0 and I onto IT. For the right choice of 
yt, |u| = 1, B will go to 1.) We have seen that disc maps send hyperbolic lines onto 
hyperbolic lines, so T maps (a, 8) onto the hyperbolic line through 0, ending at 1. 
Hence T maps (a, 8) onto the diameter (— 1, 1). Since P, (.e., P, with a = l)isa 
minimal function, by Lemma 3, P,;°T is a minimal function. Since P,°T has its 
pole at B, P,° T = cP, for some constant c. Moreover, P;° T(z)) = P,(0) = 1, so 

= 1/P,(2,), and 


Pz = Pa(Zo)(P,°T). (16) 
The same argument applied to —T shows that 
P, = P.(Zo)(P,°(-T)). (17) 
Let z € (a, B), and T(z) = r € (—1, 1). Then 
Po(z) = Po( 29) P(r) = Pozo) +r)/U — r)I, (18) 
and 
P.(z) = PalZo) Pir) = Palzo)[C — 1)/0 + 1)]. (19) 
Hence for any z € (a, B) 
P,( 2) Po(z) = Pal Zo) Pe Zo): (20) 


Since Z,) was arbitrary, we have the result. 


Coro.iaAryY. For any a, B & T and any z & (a, B), 


(1 -|z/’) 4 
PP) Pipa" la BP 


(21) 
Proof. As z > B on (a, B), z approaches B radially in the limit, so |B — z|/ 
(1 — |z|) > 1, and 


2 2 
1 —|2z\ 1 —|z| 4 
lim Oak) = |j Oo 12)" = ——_>5. (22) 
z>B la — z|\"|B —2z| z>B la-z| la — Bl 
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The next lemma and theorem show that if z,w € (a, B), in the order a, z, w, B, 
then d(z, w) is indeed given by log P,(z)/P,(w) = log P,(w)/P,(z). 


Lemma 5. If z,w € (a, B), and T is a disc map which sends a, z,w, B to 
1,0, r,1, with r > 0, then 


d(z,w) = log(1 +r)/(1 — r) = log P,(r) /P,(0). 
Proof. Since d is invariant under disc maps, d(z,w) = d(0, r). By definition, 
and Lemma 2, and Theorem 4, we can write 
d(0,r) = max log P,(r) /P,(0) 
yer 


= max log P 
max log P,(r) 


= max log(1 — r*) /ly - r|?. (23) 
ye 
Clearly |y — r| is minimum when y = 1, so 
d(0,r) = log(1 — r?2)/(1 — r)’ 


= log(1 + r)/(1 —r) 
= log P,(r). (24) 


THEOREM 6. If z,w € (a, B), in the order a, z,w, B, then 
d(z,w) = log P,(z)/P,(w) = log P.(w) /P,(z). (25) 
Proof. The two ratios are the same by Theorem 4. There is some disc map T 


which sends a, z,w, B respectively to —1,0,r,1 with r > 0. Recalling that P,°T 
= cP,, we have 


d(z,w) = d(0,r) 
= log P,(r) /P,(0) 
= log P,(T(w))/P,\(T(z)) 
= log Ps(w) /P,(z). (26) 


Coro.iaAry 1. For all z,w € D, 
d(z,w) = 2p(z,w). (27) 


Proof. By the definition (2-8), if r > 0, then 


p(0,r) = slog(1+r)/(1 —7r). 
By the result above (Lemma 5), d(0, r) = log(1 + r)/(. — r). For any two points z 
and w there is some disc map 7 which sends z to 0 and w to some r > 0. Since 
both d and p are invariant under disc maps, the result follows from the fact that 
d(0, r) = 2p(0, r). 


Coro.iary 2. If z,w € (a, B) in the order a, z,w, B. Then 
d(z,w) = log(a, z,w, B). (28) 
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Proof. This follows from Corollary 1 and (10) of Section 2. 
Corotiary 3. If z, Zy € (a, B) with Pg(z9) = 1, then 
P,(z) = (a, 29,2, 8). (29) 


Proof. If P,(zo) = 1, then using (28) 
d(Z,2Z) = |log P,(z)| | 
= llog(a, Zo 25 B)\. (30) 


Since P,(z) > 1 and (a, Z9, z, 8B) > 1 iff z is closer to B than Zp, (29) follows. 
Now we summarize our result. The function g(r) = (1 + r)/(1 — r) is a one-one 
map of (—1,1) onto (0,) and hence f(r) = 3 log(1 + 7r)/(1 — 1) is a one-one 
map of (—1, 1) onto (—», ©). Thus f puts a coordinate system on (— 1, 1), and the 
hyperbolic distance between points r,,r, of (—1,1) according to this coordinate 


system Is 
P(r1,72) =If(r2) -f(rpl = 5 log(—1,74,7,1). (31) 


Of course p(7,, 72) + p(r2, 73) = p(r,, 73) if -1 <r, <r,<r, <1. By mapping 
any h-line (a, B) onto (—1,1) with a disc map T, the coordinate system can be 
carried to (a, B) giving the distance 


p(z,w) = p(T(z),T(w)) = pllog(-1, T(z), T(w), 1) (32) 


for points z,w € (a, B). Distances add up correctly on each h-line (a, 8B) because 
they do on (—1, 1). Since cross ratios are invariant under disc maps, the distances 
on (a, 8) can also be given by 


p(z,w) = sllog(a, z,w, B)l. (33) 


The triangle inequality for p is not obvious, and follows as a theorem in either 
Euclidean or hyperbolic geometry once the side-angle-side axiom for triangle 
congruence Is verified. 

The part metric d 1s defined by 


d(z,w) = sup{|log u(z) /u(w)|: u € A}. (34) 


This definition makes sense on any plane domain on which there are non-constant 
positive harmonic functions. (It is enough to assume the complement of the 
domain has at least two points.) More generally, the definition (34) makes sense on 
the Gleason parts of any function space. For z,w, in the disc, the sup in (34) is 
attained for both u = P, and for u = P,, where z,w © (a, B), and 


d(z,w) =|log P,(z) — log P,(w)| =|log P,(z) — log P,(w)|. (35) 


The part metric is invariant under conformal maps for any domain, and in the disc, 
d = 2p. 


4. The Schwarz-Pick lemma and the Riemann mapping theorem. The fact that 
the part metric and the hyperbolic metric coincide in the disc makes possible a 
very simple proof of the Schwarz-Pick lemma, and provides a more intuitive 
variant of the proof of the Riemann mapping theorem. We start with a simple 
proof of the fact that equality holds in the triangle inequality only for colinear 
points. 
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Lemma 1. If z,g,w © D and d(z,£) + d(é,w) = d(z,w), then ¢ lies on the 
hyperbolic interval [ z, w]. 


Proof. It suffices to consider the case z = 0 and w = r > 0 since any z and w 
can be disc-mapped into these points with distances preserved. So assume that 


d(0,g) + d(é,r) = d(0,r) (1) 
for some r > 0 and some ¢ € D. Clearly d(0, £) = d(0, |Z}) since rotation is a disc 


map and preserves distances. It follows that 0 < |¢| <r since d(¢,r) > 0. Since 
d(0, £) = d(0, ||), it follows that 


d(g,r) =d(lél,r). (2) 
We know that 


d(\¢l,r) = log P(r) /P,(1¢)) 


1-r? (1-|g))’ 
= log ———~ ——___—_ 3 
“dary 1-1P ©) 
and that 
d(é,r) => log P,(r)/P,(Z) 
1-r? ft-¢i? 
= ———s (4) 


log ————— 
(1—r)° 11g 
The right sides of (3) and (4) differ only by the factor (1 — |£))* in (3) and the 
factor [1 — ¢|* in (4). Clearly |1 — | > 1 — [Z| unless ¢ is real, so equality in (1) 
implies that ¢ € [0, r]. 
THEOREM 2. (Schwarz-Pick lemma [6, p. 41]). Jf fis an analytic map of D into D 


and f preserves the hyperbolic distance between any two points, then f is a disc map 
and preserves all distances. 


Proof. \t is immediate from the definition that any analytic map decreases (<) 
all distances: 


d( f(z), f(w)) 


sup [log u( f(z)) — log u( f(w))| 


< sup |log v(z) — log v(w) |, (5) 


v>0 


since every uo f is a harmonic v, but not vice-versa unless f is one-to-one onto D. 
Suppose that d( f(z), f(w)) = d(z, w) for some pair z, w, and let £ be any point of 
the hyperbolic segment [z, w]. Then 


d(z,w) =d(z,f)+d(Z,w) 
= d( f(z), f(S)) + a(f(2), Fw) 
= d( f(z), f(w)) =d(z,w). (6) 
It follows that equality holds throughout in (6), and that f(¢) lies on the hyperbolic 
segment [ f(z), f(w)] and moreover that f(£) is the same distance from f(z) that ¢ 


is from z. If T is a disc map which sends z into f(z) and the ray from z toward w 
onto the ray from f(z) toward f(w), then 7 also preserves distances along the 
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line, so T(Z) = f(Z) for all ¢ © [z,w]. Consequently f = 7, and f preserves all 
distances. 

We now outline a proof of the Riemann mapping theorem which provides a 
more geometric intuition than the usual one. First one shows as usual that if G isa 
simply connected proper subdomain of C there is a one-to-one map f, of G into 
the disc D. If z, € G, then h(z) = yVZ — Zp Opens the plane up so that some disc 
D(a, r) = {z: |z — al < r} is missing from A[G]. Since |h(z) — al >r for z € G, 
fo(z) = r[h(z) — a]~' has modulus less than one on G. Now consider the set S of 
all one-to-one analytic maps f on G into D which spread every pair, z,w farther 
apart in D than f, does; i.e., all f so that 


d( f(z), f(w)) = d(folz), fo(w)) (7) 


for all z,w € G. The set S is compact in the topology of uniform convergence on 
compact subsets of G. This is because S is a normal family (a precompact set), and 
a limit function is clearly also one-to-one, since it spreads all pairs z,w at least as 
much as f, does. 

For any fixed pair z),w, © G, the function d(f(z,), f(wo)) is a continuous 
function of f, for f © S. Hence there is some g € S such that d(g(z,), g(wo)) is 
maximum. If g[G] is not all of D, let a € g[G], and let T(z) = (z — a)/(1 — az) 
map a onto 0. Then T,° g maps Zz, and w, maximally far apart, and 0 is not in the 
range of T,° g. There is a square root branch R which is one-to-one on the simply 
connected range of T, ° g. Since the squaring function on D to D is not one-to-one, 
it strictly decreases all distances. Hence R strictly increases all distances for points 
in its domain. Hence Re T° g is a one-to-one function on G to D which sends Z, 
and w, farther apart than g does. The contradiction shows that g does indeed 
map g onto D if g.maps any pair z, and wy, maximally far apart. 


5. The Klein-Hilbert part metric in a convex set. In Klein’s model for hyper- 
bolic geometry, the points are the points of the open unit disc D, and the lines are 
the chords in D. If z,w are points of D and a and B are the endpoints of the 
chord through z and w, then the Klein metric is given by 6(z, w) = llog(a, z, w, B)l, 
where (a, z,w, B) again denotes the cross-ratio. Hilbert later observed that this 
metric works for points of an arbitrary bounded open convex set in R” (see [5, p. 
105]). We show below that the part metric as extended to convex sets in [2], [3], [1] 
is essentially the same as the Klein metric. 

If K is any convex set we define an equivalence relation x ~ y for points 
z,y €K by agreeing that x ~ y iff the segment [x, y] extends beyond x and y in 
K. That is, x ~ y iff x and y are both interior points of some segment [ x’, y'] C K. 
A simple two-dimensional picture shows that if the segment [x, y] extends both 
ways to a segment [x', y'] in K, and [y, z] extends to a segment [y", z'] in K, then 
[x, z] extends to a segment [x", z”] where x" lies on [x’, y"] and z” lies on [y’, z’]. 
Hence x ~ y and y ~z imply x ~ z, and ~ is an equivalence relation. 

The equivalence classes of the above relation are called the parts of K. For 
example, if K is a closed tetrahedron, the four extreme points are singleton parts, 
the six open edges are parts, the four open faces are parts, and the interior is one 
part. If K is an open convex set in R”, the K has just one part; 1.e., x ~ y for all 
x,yEK. 

This concept of part for a convex set becomes a generalization of the original 
idea of part for a function algebra or function space in the following way. Let B be 
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a linear space of continuous real functions u on a compact space X. The functions 
u are represented as linear functionals on the unit ball of the adjoint space B”*. 
This ball is a convex set K, and compact in the appropriate topology. There is a 
constant a so that 1/a < u(x)/u(y) <a for all positive u in B, if and only if the 
segment [x, y] extends in K; i.e., the two concepts of part coincide. Here we are 
using x and y to represent both points of X and the corresponding evaluation 
functionals in B*. Thus the parts of X are just the intersections of XY with the 
parts of the unit ball K of B*. (The details are in [2].) 

There is not necessarily a distance function in a general convex set, so the 
segments in a general K do not have a length a priori. Nevertheless, the ratio of 
lengths of segments on the same line makes sense, since we can define ay / ax to 
be r if y=a+r(x — a). The ratio ay / ax can also be thought of as the maximum 
of the ratios u(y)/u(x) as u ranges over positive linear functions on the segment 
[a,b]. A picture of a positive linear function on a real interval [a,b] with 
a<x<y<b, shows that u(y)/u(x) is maximum for uw > 0 when u(a) = 0, in 
which case u(y)/u(x) is the same as ay / aX. 

The Klein metric in a convex set is 


a xb 
6(x,y) = log= + log— = log(a, x, y,b). (1) 
The part metric would be 
u(x) 
d( x,y) = sup; jlog -u > 0, u linear}, (2) 
u(y) 


which would be the larger of log(ay / ax) and log(xb/ yb). Thus the Klein metric 
looks at how close the points are to the boundary on both sides of segment [| x, y], 
whereas the part metric looks only at the closest boundary point. Because of this 
one-sided aspect of the part metric, the part metric is defined in any convex set 
which contains no whole line, whereas the Klein metric is defined only for convex 
sets which intersect every line in a bounded interval. 

If one looks at the details of the proof of the triangle inequality for the part or 
Klein metric in a convex set [3], one sees that equality can hold (d(x, y) + d(y, z) 
= d(x, z)) for points x, y, z which are not colinear. This happens, however, only 
when the convex set contains line segments in its boundary. For example, if K is 
the interior of the unit square, K = (0,1) x (0,1), then d(x, y) + dy, z) = d(x, z) 
whenever x, y, Z are points such that the line determined by x and y and the line 
determined by y and z both intersect 0K only on the vertical sides. If K is strictly 
convex, so there are no line segments on 0K, then equality holds in the triangle 
inequality only when x, y, z are colinear. In this case we would have a model for a 
geometry which satisfies all the axioms for hyperbolic geometry except the side- 
angle-side axiom. This geometry would nevertheless have a bona-fide metric 
satisfying the strict triangle inequality. 

In defense of our apparently heartless abandonment of the side-angle-side 
axiom, we offer this quote of Choquet |7, p. 79] concerning the “unwarranted 
universal use” of angles. ““There are strong reasons for doing away with this and 
developing almost the whole of our geometry without measuring a single angle. 
Although it is an essential tool of analysis and applied mathematics, angle is often 
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merely baggage, as far as geometry is concerned, and sometimes a source of 
error.” 


6. Some questions. The part metric is defined in terms of the positive harmonic 
functions in quite general domains. In particular, if G is a plane domain whose 
complement has at least two points then there are non-constant positive harmonic 
functions, and d is a conformally invariant metric. There are other conformally 
invariant metrics on general domains, such as the Bergman metric and the 
Carathéodory metric. Are they different? 

For the disc, the gradients of the minimal functions fill up the domain nicely, so 
that for any two points z and w there is exactly one gradient line (one hyperbolic 
line) through z and w. The h-line (a, B), which is the gradient for both P, and Pa, 
is furnished with a coordinate function log P,, or equivalently log P,, which gives a 
hyperbolic distance. Does this happen in more general domains? Since minimal 
functions are preserved by conformal maps of any domain on itself, any such 
domain would provide a model for hyperbolic geometry. Are there multiply 
connected models for hyperbolic geometry? 

In linear programming one maximizes a linear function on some convex set. 
This maximum will always occur at an extreme point, and the art consists of 
finding one’s way as quickly as possible to the right extreme points. The convex 
sets in question will generally have flat boundaries, so the part metric will satisfy 
d(x, y) + d(y, z) = d(x, z) not only when x, y, and z are colinear, but when the 
lines x,y and y,z intersect on the boundary on the same segment. Is this 
property, or other properties of the part metric relevant to the search for an 
appropriate extreme point? 
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Arcs with Positive Measure and a Space-Filling Curve 


TIMOTHY LANCE AND EDWARD THOMAS 
Department of Mathematics and Statistics, State University of New York, Albany, NY 12222 


In this note we construct a one-parameter family of arcs in R* with positive 
two-dimensional Lebesgue measure, which converge uniformly to a space-filling 
curve. By an arc we mean a homeomorphism of J = [0,1] into J x J. The existence 
of such maps is not news, but our procedure for constructing them is so transpar- 
ently simple that it can be presented in undergraduate or graduate courses in 
advanced calculus, real analysis, or topology. Furthermore, each of these curves is 
the uniform limit of a recursively defined sequence of very simple functions. The 
examples of space filling curves of Peano or Hilbert, which are generally cited (e.g., 
[1], [2], or (3]), are also constructed recursively, but the recursions are more 
complicated. All of them pass from one stage to the next by splicing’ together 
reduced and rotated copies of an initial figure. In our construction, no rotation is 
necessary to get them to fit, so the recursion has a much simpler description. Our 
construction is similar to one found in Section 10.10 of [1]. This work was 
motivated by problem E 2975, posed by F. B. Jones in the December, 1982, issue 
of the American Mathematical Monthly. We wish to thank the referee for several 
helpful suggestions. 

Let us first describe a typical member of our family of arcs intuitively. Imagine a 
necklace composed of four congruent squares and three connecting line segments 
arranged as in FiGureE 1. In each square we put another necklace, but one in which 
the “+” shaped region is proportionately much thinner. By leaving the original 
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Fic. 1. The recursion operator R,. 
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connecting line segments in place, we obtain a necklace with sixteen squares and 
fifteen segments. Iterating this process yields a family of necklaces whose intersec- 
tion is an arc. (To prove this, note that the intersection A of the necklaces will be 
compact, connected, locally connected and, therefore, arcwise connected [2]. Let B 
be an arc in A which joins the northwest corner to the southeast corner. Clearly B 
contains all the connecting segments in each necklace. These are dense in A, 
hence B = A.) 

The arc we have constructed contains the two-dimensional Cantor set deter- 
mined by the system of squares. By choosing the squares appropriately the Cantor 
set—and hence the arc itself—can have any preassigned two-dimensional Lebesgue 
measure. By varying the system of squares continuously, we obtain a one parame- 
ter family of arcs whose limit will be a space-filling curve. Intuitively, the isotopy of 
these arcs to a space-filling curve is obtained by squeezing all the area out of the 
“+”? shaped regions. 

We begin by defining, for each s € (0, 1], the replication operator R,. Suppose 
g: 1 > IX /isany path with g(0) = (0,1) and g(1) = (1,0). Then Rg): IT >I XI 
is the path represented schematically by the following diagram. 

Thus on each of the subintervals [0, 1/7], [2/7,3/7], [4/7,5/7], and [6/7, 1], 
Rg) traces out a reduced and translated copy of the path g, and it is the 
indicated straight line paths on the remaining three subintervals. For any s € (0, 1] 
we define maps f, ,,: J > I X I recursively by letting f, y be the path t > (t,1 — f£) 
tracing one of the diagonals of J x J and setting 


fon — Rassy2( fos ats)n—1) 


For example, we have the following graphs for s = 1/2. 


Fic. 2. The first two iterations for f; /2. 


THEOREM. For each s & (0,1) and t € [0,1] the limit 
f(t) = tim f, (0) 


exists and defines a continuous map (0,1) XI >IxI. If 0<s <1 the path 
t > f(t) is an arc with Lebesgue measure s*. The path t > f(t) is a space-filling 
curve in I X I which is the uniform limit of the arcs t > f(t). 


Proof. The maps f,,, and f, ,,, 2 <m, agree except on 4” subintervals of I 
each of length (1/7)”. On each of those subintervals their values lie in a square at 
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most (1/2)” on a side, and the center of that square clearly varies continuously in 
s. The existence and continuity of (s, t) > f,(t) follows. If s < 1 each of the maps 
t > f, ,(t) is one to one. Thus if |t; — t,| > 1/7”, then either ¢; and ¢, both lie in 
an interval where f, ,, = f, or they are sent by f, , (and hence by f,) to different 
regions of J x I. Thus t > f,(t) is an arc for 0 <s < 1. 

Each application of operator R divides a given square into four smaller squares 
and a cross-shaped region in between, and in that region the graphs of f, ,, and, 
hence, the graph of f, clearly have measure 0. Thus if A, denotes the union of the 
4” squares in [XI] used in defining f, then the Lebesgue measure of 


»n? 


fd) equals that of ()A,. But it follows inductively that A, has measure 


n=1 
(1 + (2” — 1)s)/2”)*. When s = 1 the curve t > f,(t) not only has measure 1 but 
its image contains each of the lines x = k/2”" and y =k/2” for k = 1,2,...,2” 
— land vn = 1,2,3,....Thus fd) =7 xi. O 


Although a variety of different recursions will yield arcs of positive measure, 
one must be sure that the cross-shaped regions are squeezed to zero fast enough. 
For example, even if g, is a space-filling curve, the recursion g, = R,(g,_,) 
converges to an arc of measure 0 whenever s < 1. When s = 1, however, we do get 
the simple recursion 


tion — Rift n—1)- 


The resulting sequence of functions which converges to the space filling curve f, 
begins as follows. Notice that these curves are partially collapsed versions of the 
first three iterations in FIGURE 2. 
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Fic. 3. Three iterations for the space filling curve f;. 


Finally, we note that all of the maps above are related by continuous deforma- 
tions of the entire square J X J. Fix some s € (0, 1], and consider the Cantor set 
C, obtained by removing an open interval of length (1 — s)/2 from the middle of 
I, open intervals of length (1 — s)/8 from the center of the two resulting open 
intervals, open intervals of length (1 — s)/32 from the four resulting intervals, and 
so on. The Cantor set C, has one dimensional Lebesgue measure s. There is a 
standard mapping of C, onto J, the Cantor function ®, (see [1] or [2]), which can 
be embedded as the last stage of a one parameter family of maps t > Q, ,. In this 
family, ®, is the identity, ®, , is a homeomorphism for ¢ < 1, and ®, , restricted 
to C, equals ®,. It is possible to write formulas for ®, ,, but it is simpler to draw a 
picture which illustrates the dynamics. 
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Fic. 4. A homotopy from the identity to the Cantor function. 


The triangular regions are chosen with their top vertices at rational numbers of 
the form (2k + 1)/2”. Thus if we delete the interior of all such triangles, the 
intersection of the resulting set with the line y =f is precisely the Cantor set 
C,41—1) described above. Define a two-parameter family of maps F, ,: J x I > 
IXI by F, (x, y) = (®, (x), ®, (y)) for s € 0,1] and t,x, y € [0,1]. This is 
clearly jointly continuous in all four variables. For fixed s the family F,,° f, is 
precisely the family f,,..;,. Running this homotopy backwards gives us an 


unfolding of J x J into the arc f, with Lebesgue measure 57, 
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Unions and Common Complements of Subspaces 


K. P. S. BHASKARA RAO 
Department of Mathematics, Wesleyan University, Middleton, CT 06457 


A. RAMACHANDRA RAO 
Indian Statistical Institute, Calcutta-35, India 


The following exercise can be found in most undergraduate text-books on linear 
algebra: show that if S and T are subspaces of a vector space, S U 7 is a subspace 
if and only if either §S Cc T or 7 CS. In our experience, most of the students are 
able to show this and a few get curious about what happens for three or more 
subspaces. 

The most natural conjecture for the general case would be that the union of a 
family “ of subspaces of a vector space V over a field F' is itself a subspace of V 
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if and only if some member of —“ contains all the others. But one can immediately 
see that this is false since any vector space with dimension at least two is the union 
of its subspaces with codimension 1 and none of these can contain the others. So 
we raise the question: can we impose some reasonable restrictions on the cardinal- 
ities of .“ and F and the dimensions of the subspaces in .“ to make the 
conjecture true? The answer is, indeed, yes. For example, the conjecture is known 
to be true if ” is assumed to be finite and F infinite, see Problem 21, p. 177 of [2] 
or Theorem 1 below. In this note we present other reasonable conditions under 
which the conjecture is true. We also study the existence of a common complement 
for a family of subspaces, which turns out to be related to the above problem, and 
deduce a result of Lord [4]. 

To set up the notation let V be a (not necessarily finite dimensional) vector 
Space over a (not necessarily infinite) field F. We will use |A| to denote the 
cardinality of set A. For a subspace S, dim(S) denotes the dimension of S while 
cod(S) denotes the codimension of S. It may be noted that cod($) is the dimension 
of any complement of S and equals dim(V/S). Sp(X) will denote the linear span 
of the set X of vectors. 

To start with we will show that for any vector space V over F with dimension at 
least 2, we must have |J| <|F| +1 for the conjecture to have any chance of 
holding. To see this, fix a basis {x;: i € I} of V and two distinct elements j and k 
of J. Consider the |F| + 1 subspaces W, = Sp({x; + ax,} U {x;: i #j, k}) for all 
a € F and W, = Sp({x;: i # j}). It is easy to check that these are distinct subspaces 
with codimension 1 (so none of them can contain the others) and that their union 
is V. 

However, |J| < |/| + 1 alone is not sufficient: consider the vector space V of all 
polynomials over R and the subspaces S,,5,,5,... where S, consists of all 
polynomials with degree at most i—clearly no S; contains all the others though 
their union is V. Thus, to prove the conjecture, we have to make some further 
assumption: for example, the family is finite or the subspaces are of bounded 
dimension. We now proceed to prove the conjecture under each of these assump- 
tions. 


THEOREM 1. Let S,,S,,..., 5S, be finitely many subspaces of V with k <|F\ + 1. 
Then 8, US,U ++: US, is a subspace if and only if some S, contains the others. 


Proof. The “if” part of the theorem is obvious. We prove the other part by 
induction on k. The case k = 1 is trivial. We now assume the result for k — 1 and 
prove it for k. So let S,U--: US, be a subspace and k <|F\+ 1. If 
S,2S,U-+:: US,, we are through. So let S$; BS,U-+: US,. We shall show 
that S$, CS,U--: US,. To this end, fix a y €(S,U--- US,) — S, and let 
x © S,. Then for every scalar a, ax + y € S,. Since S$; U +>: US, is a subspace, 
axt+ty€8S,U-:: US,. Since ax + y # Bx +y whenever a # B and |F| > k, it 
follows that there exist j with 2 <j <k and a #8 in F such that ax + y and 
Bx +y are in S$, Then x € S;. Thus S$, CS, U--: US,. Now the result follows 
from the induction hypothesis. @ 


We now consider the case of a finite-dimensional vector space before going on 
to subspaces with bounded dimension in an infinite-dimensional vector space. We 
state only the non-trivial part in the remaining theorems. 
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THEOREM 2. Let {S;: i € I} be a family of subspaces of a finite-dimensional 
vector space V with |I| <|F|+ 1. If S:= U,;.,S; is a subspace then S = S, for 
somet € 1, 


Proof. We prove the theorem by induction on n, the dimension of S. For n = 1 
the result is easy to prove. Next assume the result for m — 1 and let n > 2. 
Suppose that no S, equals S. If H is any subspace of S with dimension n — 1, 


H=HOS=HO(US)- LJ) (H0S;), 
ic] ic] 
so by the induction hypothesis, H = H 1 S, for some j € J. Then H CS, and S, is 
a proper subspace of S. Since dim(H) = dim(S) — 1 it follows that H = S,. Thus 
every subspace of S with dimension n — | is one of the S,’s. Since, as we saw in 
the first example before Theorem 1, there are at least |F| + 1 such subspaces of S, 
we get |J| > |F| + 1, a contradiction which proves the theorem. M& 


For a denumerable family Theorem 2 was proved earlier by Byrd [1]. 
We now deduce the result for a family of subspaces with bounded dimension. 


THEOREM 3. Let {S;: i € I} be a family of subspaces of V with |I| < |F| + 1. Also 
let m be a positive integer such that dim(S;) < m for alli € I. If S = U,.,S; is a 
subspace then S = §, for somet € I. 


Proof. We first show that dim(S) <m. Suppose not. Then there exists a 
subspace W of S with dimension m+ 1. Applying Theorem 2 to the family 
{WO S,;: i € I} of subspaces of W, we get W = WO S, for some i € J, a contra- 
diction since dim(W ) = m + 1 and dim(S,) < m. Now an application of Theorem 
2 to the family {S,: i < I} of subspaces of S yields the present theorem. @ 


We now use the above results to give conditions under which a family of 
subspaces of a vector space has a common complement. 

Observe that if the S,’s have a common complement then they must have the 
same dimension and the same codimension. We restrict our attention to families of 
subspaces with common finite dimension or common finite codimension and we 
Start with the latter case which happens to be easier. 


THEOREM 4. Let {S,: i € I} be a family of subspaces of V with |I| <|F|+ 1. If 
the S,;’s have a common finite codimension p and either I is finite or V is finite-dimen- 
sional, then the S,;'s have a common complement. 


Proof. We prove the theorem by induction on p. If p = 0, {0} is a common 
complement of the S,’s. Next assume the result for codimension p — | and let 
cod(S,) = p for all i © J, where p > 1. Then by Theorems 1 and 2, U, S; # V. Fix 
an x in V— U,S, and consider T; = Sp(S, U {x}) for all i € I. Clearly cod(7;) = 
p — 1 for all i, so by the induction hypothesis, the T,’s have a common complement 
W. Sp(W U {x}) is then a common complement of the S,’s. & 


CorROLLARY 5 (Lord [4]). Let V be an n-dimensional vector space over IR or C 
and let {S,: i € N} be a countable family of subspaces of V each having the same 
dimension. Then the S;s have a common complement. 


The proof of Corollary 5 essentially appears in Byrd [1]. 
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THEOREM 6. Let {S,: i € I} be a family of subspaces of V with |I| <|F| + 1. If 
the S,;'s have a common finite dimension, they have a common complement. 


Proof. If V is finite-dimensional, this follows from Theorem 4. Otherwise, 
applying Zorn’s lemma to the family 7 of all subspaces T of V with the property 
T 1S, = {0} for all i gives a maximal element 7, of .7. We now prove that T, is a 
common complement of the S;’s. Suppose 7) is not a complement of S;. Let @ be 
the natural homomorphism from V to V/T,. Since TM S, = {0} it follows that 
dim(¢(S;)) = dim(S;) = m (say) for all i € J. Since Ty is not a complement of S; it 
follows that g(S,;) is a proper subspace of V/T,, so dim(V/T,) > m. So by 
Theorem 3, U; ¢(S;) # V/T,. Hence, there is a 1-dimensional subspace X of 
V/T, such that X 1 o(S,) = {0} for all i. Now @~ '(X) is a member of 7 properly 
containing 7, a contradiction which proves that 7) is a common complement of 
the S,’s. B 


We now show that when dim(V) > 2, the condition |J| < |F| + 1 cannot be 
dropped in Theorems 4 and 6. Regarding Theorem 4 it is enough to consider 
(again) |F| +1 subspaces of V with codimension 1 and union V. Regarding 
Theorem 6 we first choose a subspace Y of V with dimension 2. Then we consider 
a family {S,: i © J} of |F| + 1 subspaces of Y with common dimension 1 and union 
Y. Suppose the S,’s have a common complement T in V. Then S,+ (TOY) = 
(S.+ T)A Y=Y for all i, so TM Y is a common complement of the S,’s in Y, an 
impossibility which proves that the S,’s cannot have a common complement in V. 

It is easy to see that the condition “either J is finite or V is finite dimensional” 
in Theorem 4 cannot be dropped. For this, take a basis B = {x;: j © J} of an 
infinite-dimensional vector space V and consider {Sp(B — {x,}): i € J} for any 
countably infinite:-subset J of J. Indeed, the same example also shows that 
“dimension” in Theorem 6 cannot be replaced by “‘codimension.”’ 

We end this note by answering the problems raised in [3] and [4]. In [3] the 
problem was raised as to what are the results analogous to Corollary 5 which hold 
for vector spaces over finite fields. The answer is provided by our Theorems 4 
and 6. 

The problem raised in [4] was what is the maximum number of distinct 
m-dimensional subspaces of an n-dimensional vector space V over a finite field F 
that have a common complement? The answer is the number of distinct comple- 
ments of an (n — m)-dimensional subspace JT since this number is the same for all 
T. To find this number, fix an ordered basis B of 7. Any complement of T can be 
obtained as Sp(C) where BUC is an extension of B to a basis of V. This 
extension can be done in 


(\F\" — |F\""")(FI" — (FIO **) + (FI? — |FI"™) 


ways. But different C’s can give rise to the same complement of 7. The number of 
C’s giving rise to the same complement is the same as the number of bases of an 
m-dimensional subspace and this is (/F\” — 1)\F/”" —|F)---(F|” —|F|”~'). 
Thus the required number is 


(\F |” — FI?) (F |” — FY") wae (LF|” = |F|"~") 


m m m m—1 — |r, 
(\F|" — 1)(|FI" — |Fl) +++ (FI — |FI"") 
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The Generation of All Rational Orthogonal Matrices 


Hans LIEBECK AND ANTHONY OSBORNE 
Department of Mathematics, University of Keele, Staffordshire, STS 5BG, England 


In a letter to this MonTuLy [1] John Cremona shows how to generate all 3 x 3 
orthogonal matrices with rational coefficients. His method is based on the real 
algebra of quaternions. By a further application of quaternions one can obtain 
4 x 4 rational orthogonal matrices. (See du Val [2].) 

In this note we show how to generate all n Xn rational orthogonal matrices 
and hence all orthonormal bases of the rational vector space Q”. At the same time 
we obtain all real orthogonal matrices and all complex unitary matrices. Our 
method is based on a further piece of mathematics from the last century—Cayley’s 
formula for orthogonal matrices. 


Preliminaries. Let A and B be n Xn matrices. We shall say that B is 
equivalent to A, and write B ~ A, if and only if there exists a diagonal matrix D 
with diagonal entries selected from the set {—1, 1}'such that B = DA. Clearly B is 
equivalent to A if and only if, for each i = 1,...,7, the ith row of B is +ith row 
of A. The relation ~ is an equivalence relation on the set of m X n matrices, and 
the equivalence class @(A) which contains A has at most 2” members. 


LEMMA. Let A be an n Xn matrix over a field of characteristic # 2. Then at 
least one of the matrices in @(A) does not have eigenvalue 1. 


Proof. By induction on n. For n = 1 the result is true, since the 1 X 1 matrices 
A and —A cannot both be [1] over a field of characteristic # 2. Proceeding by way 
of contradiction let 1 be the least order for which the lemma is false. Then there 
exists an nm Xn matrix A such that all matrices in @(A) have eigenvalue 1. Thus 
for any matrix DA € @(A), there exists x # 0 such that DAx = x. Hence Ax = 
Dx, and so the matrix A — D is singular. Consider the determinant function 


d(t,,...,t,) = det(A — diag(t,,...,t,)). 


We have that d(t,,...,t,) = 0 for all 2” choices of t; = +1,i=1,...,n. 
Expanding d(t,,...,t,) according to the first row, we obtain 


d(t,,...,t,) = (a,, —t,)d*(ty,...,t,) + terms independent of t, (1) 
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By the induction hypothesis the determinant d*(t,,...,¢,) of order n — 1 is 
non-zero for at least one choice of t; = +1, j = 2,...,n. Make such a choice: 
ty,...,¢, =A5,...,A,. 


We have d(1,A,,...,A,) = 0 and d(—1,A,,...,A,) = 0. From (1) 
(a,, — 1)d*(A,,...,A,) = (ay, + 1) d*(AQ,...,A,)- 


Hence a,, — 1 =a,, + 1 which is clearly false in a field of characteristic # 2. So 
the lemma is proved. 


Construction of n X n orthogonal matrices. Let F be a subfield of the field of 
real numbers. By Cayley’s formula [3 p. 289] the mapping S — (S — J)7'(S + J) 
gives a one-one correspondence between the set of n X n skew-symmetric matrices 
S over F and the set of n X n orthogonal matrices over F which do not have 1 as 
an eigenvalue. The inverse correspondence is given by A > (A —1)7'(A + J). 

Now let A be any n Xn orthogonal matrix over F. Then the equivalence class 
6(A) consists of orthogonal matrices, and by the Lemma at least one of these, B 
say, does not have eigenvalue 1. By the Cayley correspondence there exists a 
skew-symmetric matrix S over F such that B = (S — 1)~'(§ + I). Thus we have 
the following result. 


THEOREM. The set of all n X n orthogonal matrices over F is generated by the 
n Xn orthogonal matrices (S — I)~'(S + I), where S is skew-symmetric over F, and 
their equivalence classes. 


The same technique can be used to generate all unitary matrices by considering 
the complex field and replacing “orthogonal” by “unitary” and ‘‘skew-symmetric” 
by “skew-hermitian” in the above analysis. 

It is worth pointing out that the n Xn orthogonal matrices (S — J)~'!(S + J) 
referred to in the theorem all have determinant (—1)”. (Proof: det(S + J) = 
det(S + J)? = det(—S + J) = (—1)"det(S — J), from which the result follows.) 
So they represent reflections when nm is odd and rotations when n is even. 
Consequently every n Xn reflection matrix when n is even and every n Xn 
rotation matrix when v7 1s odd is equivalent but not equal to a matrix of the form 
(S — 1)7'(§ + I). This can also be seen from [4] Theorem B, which states that 
every reflection of a (Euclidean) space of even dimension and every rotation of a 
space of odd dimension has a fixed vector. The corresponding orthogonal matrices 
then have eigenvalue 1 and, therefore, do not feature in the Cayley correspon- 
dence. 

In particular, John Cremona’s 3 X 3 rotation matrix M(a, b,c, d) is given for 
the case d # 0 by 


M(a,b,c,d) =D(S —1)"'(S +1), 


where S is the 3 X 3 skew-symmetric matrix which has a/d, — b/d, — c/d above 
the diagonal, and D = diag(1, 1, — 1). 

This note provides a theoretical method of generating all orthogonal matrices 
over a real field F, but as a practical classroom aid it is of limited use. The reader 
interested in a simple technique of generating a large class of 3 X 3 rational 
orthogonal matrices is referred to our Teaching Note [5]. 


We are grateful to a referee for some of the observations following our theorem. 
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Hypocycloids, Continued Fractions, and Distribution Modulo One 


NORMAN RICHERT * 
Division of Mathematics and Systems Design, University of Houston, Clear Lake, Houston, TX 77058 


In simulation and Monte Carlo approximation schemes, a random sequence 
(pseudorandom, really) is needed. How this sequence might be generated is a long 
story. Donald Knuth [2] encapsulates some of the difficulties neatly in The Art of 
Computer Programming , Volume 2, where he says that “random numbers should not 
be generated with a methad chosen at random. Some theory should be used.” A 
sequence with some very nice properties is the so-called (n@) sequence. Let 6 be 
irrational. Let a sequence {x,} in [0,1) be defined by x, = (n@), n = 1,2,..., 
where (-) denotes the fractional part. Let us explore this sequence. 

One way to visualize what the sequence is doing is to imagine a circle of 
circumference @ rolling to the right on the real line. The circle has a distinguished 
point P, which begins at zero. Then the sequence consists of the set of positive 
points where P touches the line, as long as each interval from integer k to k + 11s 
understood as a copy of [0, 1). So the points could be viewed as the set of cusp 
points of the cycloid. Better still, roll the real line up into a circle of circumference 
1 by identifying the integer points. Now the cycloid is a hypocycloid, at least if the 
circumference of the rolling circle is less than 1. There is no loss of generality in 
supposing this, so assume in what follows that 0 < 6 < 1. 

Calculus students are frequently asked to find parametric equations for a 
hypocycloid, using the angle between the x-axis and the line of centers of the two 
circles as the parameter. Frequently, 6 = .25 is illustrated as a special case, the 
hypocycloid of four cusps, because the parametric equations have a particularly 
simple form in that case. It is easy to infer a family of hypocycloids of qg-cusps, 
where 6 = 1/q, q > 2. The associated sequences could hardly be described as 
random. From a theoretical point of view, things get more interesting if 0 is 
irrational. Then the curve is not closed and {(n@)} is not a periodic sequence. 
However, on a computer all numbers are rational, and plotters have only a finite 
resolution. So for the purposes of this discussion the rational /irrational distinction 


*Appreciation is expressed to Marquette University Computer Services for the use of the large-for- 
mat drum-pen plotter used in the figures. 
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will be blurred. We might view irrational numbers as rationals with large, unknown 
denominators. 

If 6 = p/q is rational, in reduced form, it is not hard to see that the rolling 
circle must roll completely around p times to bring P back to the 0 point, with q 
cusps. If p = 1, a simple closed curve results (except in the degenerate case of 
q = 2); otherwise the curve is closed, but not simple. So the hypocycloids of 
q-cusps form a somewhat larger family than it seems at first glance. In particular, 
there are ¢(q) such curves, where ¢ denotes the Euler ¢-function. However, if we 
consider the hypocycloid only as a point set, the trace of the hypocycloid, we have 
over counted by a factor of 2. As a comparison of the curves associated with 
§@ = .25 and 6 = .75 suggests, the traces associated with 6 and 1 — @ are the same. 

Let us introduce a bit of notation. Consider the parameterized curves as 
functions y,: (0,0) > R*. Denote the trace of y, by {y,}. The domain (0, 2 pz] is 
convenient for plotting {y,} with no overlap where 6 = p/q. Let {y,}',t € (0, q], 
denote the trace of y restricted to (0, 27rtp/q], so that {y,}’ can be thought of as a 
“partially completed” plot of y,. Let (n@)' be the finite sequence associated with 
{y,}’, that is, {(n@)}'l,, where [-] denotes the greatest integer function. For 
6 = 1/q, the various {y,}' are easily predictable, but for p > 1 the situation is less 
clear. Various patterns of cusps seem to appear as ¢ increases. Moreover, these 
patterns do not seem to be continuously related to @. Compare FiGure 1 with 
6 = 101/251 and t = 169 with Figure 2, where @ = 102/251 and t = 155. The 
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cusps in the latter case group into 32 clumps, a phenomenon absent in the first 
case (additional figures with other ¢ values in each case would be more convincing). 
“Clump” is not a technical term, but I believe the figures show what is meant. 
Without going into the details, it seems reasonable that (n(102/251))' will fare 
worse in Statistical tests that emphasize uniformity of distribution than will 
(n(101 /251))'’, worse than the differing number of points will account for. 

The clumping noted in the figures occurs when the tracing point P comes closer 
to 0 than it previously has, i.e., when min{(7@), 1 — (n@)} attains a minimum. The 
trace from that point onwards is a copy of the initial trace, with a small rotation. 
The smaller this minimum is, the more the clumping. So we need to investigate the 
minimal values of |n@ — m| for integral m,n. Continued fractions [5] are exactly 
the mechanism to investigate these minimal values. The values |q,6 — p,| corre- 
sponding to the sequence of convergents p;,/q, associated with @ are exactly these 
minimal values. 

The continued fraction expansion of @ consists of a sequence of integers a,, 
positive for k > 0, the partial quotients (finite when @ is rational) and an associ- 
ated sequence of numbers &,, the complete quotients defined by 


fo = 9, ay = [&o], 


The partial quotients yield the convergents, given by 
P2=0,  p.y=1, Dy = 4g Pe-1 + Pra: 
q_.=\1, q_, = 9, Dk = Ady —-1 + WK-2> k>0. 


One of the fundamental results in the theory of continued fractions [5] is that 
the convergents to a number @ are the best approximations to that number in the 
sense of minimizing |b@ — a| over all denominators no larger than b. The 22/7 
approximation of 7 is a convergent. FiGuRE 3 shows {y,}*” for 6 = 1/z,, illustrat- 
ing the 7/22 convergent (if @ > 1, the sequence of partial quotients for 9 — [6] > 
1/0 is simply shifted one place to the right, with ay = 0, and the convergents are 
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Fic. 4. 


the reciprocals of the convergents for 6). Figure 4 is {y,}°°°, illustrating the less 
familiar approximation, 106/333. 

It is shown in the theory of continued fractions that the sequence |q,@ — p,| is 
decreasing, converging to 0. Thus the convergents p,/q, of @ correspond to nearly 
equally spaced sets of q, cusps. When |q,6 — p,| is much smaller than the 1/q, 
spacing between cusps, so that q,|q,@ — p;,| is small, there will be clumping. It can 
be shown that 

ak 1 
< 
Qk +1 Ak+1 


4119.9 — Dl < 
which says for the hypocycloid pictures that a high level of clumping is associated 
with large partial quotients. 

The partial quotients of the @’s in Ficures 1 and 2 are 101/251 = [0; 2, 2, 16, 3] 
and 102/251 = [0;2,2,5, 1,7]. In the latter case, the relatively large a; = 7 pro- 
duces clumping around the cusps associated with p,/q, = 13/32, and thus the 32 
clumps in Figure 2. The partial quotient a, = 16 in the former case will yield 
clumping corresponding to p,/q, = 2/5. This would be seen in {y}’ plots for ¢ 
closer to 5. 

The study of uniformly distributed sequences modulo one provides useful tools 
for investigating the clumping that shows up in these hypocycloid plots. Let E bea 
subset of [0,1) and for the sequence w = {x,} let ACE; N; ow) = #{nll<n< 
N,(x,) € E}. Then @ is said to be uniformly distributed modulo 1 (u.d. mod 1) if 
for every pair of real numbers a, b with O < a < b < 1 the equation 

A([a, b); N; @) 
lim ————_———— = 
N-o 


— a 


holds. That is, each interval [a, b) contains the appropriate portion of terms in the 
limit. It can be shown [4] that for @ irrational (”@) is u.d. mod 1. The clumping in 
FiGurE 5 suggests that the sequence is not u.d. mod 1, but the limiting situation is 
all that matters in the definition. The tool used to measure finite segments of 
sequences is the discrepancy, D,,, defined by 


A([a, B); N;@) 
Py(o) = sup [Ara _(p-a)| 
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Clearly Dy < 1 and it can be shown that D, > 1/N, with D, = 1/N if {x,}*®_, = 
{(n — 1)/N}_,. Visually, it seems clear that the sequence represented in FiGuRE 
2 has a higher discrepancy than the sequence of Ficure 1. In fact, Dj y = .010 for 
6 = 101/251 and D,;, = .018 for 6 = 102/251, so the former is closer to optimal, 
1/169 = .006. 

An easier measure to calculate is the star discrepancy, D*, where the a in the 
definition of D,, is fixed at 0. It is shown in [6] that if (x,),...,(x,) are ordered by 


increasing magnitude, then D* can be calculated by 


(i— 1) 
“h cs) 
This value satisfies D¥ < Dy < 2D, so 1/QN) < D* <1, and Ds = 1/QN) 
iff x, = (Qn — 1)/2N. 

An important application of pseudorandom sequences is in Monte Carlo ap- 


proximation methods. It is a result of Koksma [3] that if f is a function of bounded 
variation V(f) on [0, 1], then 


1 N 1 
wun) — [FCO at) < VP) DS. 


n=1 


This application is an important source for interest in low discrepancy se- 
quences. But why not simply always use x, = (2n — 1)/2N? First, this gives only a 
single sequence for each JN, and a typical strategy in applying Koksma’s approxi- 
mation is to use several sequences, and perhaps average the results. Second, it may 
be desirable to add new points to an existing approximation to sharpen the results. 
But these must be added in multiples of N, since intermediate sequences would 
have relatively large discrepancies. 

What is desired is a family of sequences for which D,, is as close as possible to 
1/N. It turns out that if 6 is of a certain class of numbers, then (14) is just such a 
sequence. It is shown in [4, 6] that if @ has bounded partial quotients, then for 
every ¢ > 0, Dy(w) = O(N“ log N). In fact, if @ has partial quotients bounded 
by K, then 


ND,(w) <3 + log N, 


—_—. + ee 
logé log( K+ 1) 
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where € = (1 + 75) /2, and this is the least order of magnitude possible. So a 
particularly good sequence is provided by 9 = € — 1 = (—1+ V5) /2, for which 
the continued fraction expansion after a, = 0 consists entirely of 1’s. 

What does this mean for the hypocycloid plots? For these “good” 6, the {yp}' 
plots will never show the clumping seen in FiGuRE 2, and also seen in FIGURE 5 
with 6 = 1/m and t = 163.4. Ficures 6, 7 and 8 show {ye} for t = 14.6, 16.2, 
and 17.8. In a rough sense, the hypocycloids and their associated sequences divide 
into three categories. First there are those for which @ is rational, which are the 
farthest from being random. Then there are the @ irrational, but with excellent 
rational approximations, so that even though (v@) is u.d. mod 1, it does not have 
uniformly low discrepancy. Finally, there are the irrational @ with poor rational 
approximations, and correspondingly low discrepancy in the (”@) sequence. 


g=é-1 


Fic. 6. Fic. 7. 


As a footnote, it should be recalled that € = (1 + y5) /2, sometimes called the 
“golden ratio,” is closely connected with the Fibonacci sequence. One of the many 
appearances of the Fibonacci numbers in nature has to do with the phenomenon 
known as spiral phyllotaxis, exemplified by the helical arrangement of leaves on 
many stemmed plants [1]. It turns out that in many species the angle between 
consecutive leaf buds is relatively constant at 27(é — 1). Hence, the leaf buds, 
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viewed as points on a circle when looking down the stem, are the cusp points of the 
€ — 1 hypocycloid. The result is that the number of spirals required for leaves to 
roughly line up on the stem is a Fibonacci number, and the number of leaves 
appearing in that collection of spirals is a Fibonacci number. Although the 
mechanisms are not well understood, it could be argued that the resulting leaf 
pattern, by avoiding clumping to the optimal extent, maximizes the exposure of 
leaves to the sun. 
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The Perimeter of a Rose* 


JAMES P. BUTLER 
Department of Enviromental Science and Physiology, Harvard School of Public Health, 
Boston, MA 02115 


Introduction. Piranian [5] posed some interesting questions regarding the length 
L(t) of the lemniscate level set 


E(Q,,t) = {z:1Q,(z)I = t}, 

where Q(z) =z" — 1, n=1,2,..., and 0 <t < o, (The case n = 1 is trivial; 
L(t) = 27t. In what follows, take n > 2.) E(Q,,1) is a rosette of n leaves; 
E(Q,,t) is a disjoint or smoothed version of the rosette for t <1 and t>1 
respectively. Fic. 1 shows E for a few values of n and t. The basic questions 
concern whether (or where) L,(t) is an increasing function of t. Piranian gives 
elementary arguments to show several things. First, there are only finitely many 
indices n such that dL,(t)/dt = L'(t) > 0 on 0 < t < ©, (He shows that L,(2) < 
LQ) for n > 4.) Second, L(t) > 0 for all t¢ in the interval 0 < t < 1. He suggests 
that L(t) < L,(1) for some t > 1 (true) and that perhaps L4(t)=0 at t= 1 
(false). He poses three questions, all of which we answer in this note (including the 
case n = 2). 


(a) Do the left- and right-hand derivatives of L,(t) exist at t = 1, and are they 
equal? Answer: they are respectively +o and —o for all n > 2. 

(b) Is it true that for m = 2,3,..., the function L,(t) decreases in some 
interval 1 < t < T, and increases in T, < t? Answer: yes (note that n = 2 
is included here). 

(c) What is the asymptotic behavior of T,, as n — ©? Answer: T, ~ yn/2. 


*Supported in part by NIH grant HL33009. 
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t = 0.9, 1.0, 1.1 


Fic. 1. 


Explicit solution for L(t), t > 1. The equation |Q,(z)| =t becomes, with 

z= re'®, 
r°? — Ir" cos(né) —u = 0, 

where u = t” — 1. In the formulas which follow, the point t = 1 is understood to 
be the limit as t > 1*. The differential arc length ds = do(r* + (dr/d@)’)'/”, and 
so, solving the above equation for r and replacing 6 by € = 76 (to count the n 
leaves separately), we have 
1/2 


, 1/2] 1/" 
|cos € + (cos? + w)'”"| 


u=t*—1. 


L,(t) -2f" ae 


Now, being mindful of the sign of the cosine, write x = +cos € on (0, 7/2) and 
(1/2, 7), respectively. This yields 


cos7é + u 


; 1/2 L/n ; 2 L/n 
L,(t) rift E29 ry) ral le 7 | 


The transformation w = 2x*/u yields 
wy (Wwt2+Vw) "4+ (Wwt2—-Ww)” 
Lt) = rae | dw, 
0 yw(wWy — w)(w +t 2) 


A slightly generalized form of this integral appears in Gradshteyn [4] as formula 
3.277.7, who obtained it from Bateman [2], formula II: 13.112). Unfortunately, it 
was incorrectly copied; unfortunately, there is also a typographical error in 
Bateman’s formula which propagated as well. The correct formula can be derived 
from Bateman [3], formula I: 3.7(8), and is 


[ve ae ay (EEE + vay" + OF — Vey" 


= Vir T( p20, y(y + 2)) PA PI ey + 1), 
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where P?(z) is the associated Legendre function of the first kind, of degree a, 
order b, and argument z. Our case corresponds to 4 = 1/2, so we thus obtain the 
explicit solution for L(t), t > 1 in terms of Legendre functions: 


9) l—-—»n 
L(t) =2ru®V1+uP,{1+—-|], O<u=t?-1<%, B= 
2 
(1—n)/2n t“+ 1 
— 2 t(t2 — 1 P me 9 t>1. 
TT ( ) (l-n)/2n\ 424 


A referee pointed out that this can also be conveniently expressed in terms of the 
Gauss hypergeometric function. In particular, 
l—-—»n 
2n ° 
Explicit solution for L(t), 0 <t <1. The procedure here is identical to the 
case t > 1, except that some care must be taken in the limits of the € integral. 
(Here the point ¢ = 1 is the limit as t — 1°.) For the smoothed rosette above, é 
had limits 0 to z (which we divided into (0, 7/2) and (77/2, 7r)), and dr/dé < Oin 
(0, zr). In the disjoint case t < 1, é has limits (0, €,), where €, = cos” ‘(V1 — t*). 
For € in this range, there is an outer and an inner portion of the curve, for which 


dr /dé <0 and dr/dé > 0, respectively. Summing these yields the same types of 
formulas as those given above, with some sign changes. The result is 


L(t) = 2qt'/"F(-B,-B;1;1/t*), B= t>1. 


2 1-—n 
L(t) = aVT Fu (—u)?P,| ~1 — “|: -1<u=t’?-1<0, B= 5 
u n 
1+? 
_ - 9\U-n)/2n 
In terms of the hypergeometric function, this is equivalent to 
l-—n”n 
L,(t) = 27tF(-B, — B31;t?), B= a t<l. 
n 


We may combine the above results by writing our final answer in the form 


t* +1 


_ 2 (1-n)/2 
L(t) = 2at|t* — 1| " "Pa -ny/2n it? — 1 


, O<t<o, 


This function is shown in Fic. 2 for n = 2, 3, and 5. Note the singularity at ¢ = 1, 
and, near t = 1.3, the curious near coincidence of the curves. 


Limits. We distinguish here first the limits as t > 0,1°,1%*, ©, for n fixed, and 
second, the limit as nm — o for fixed t < 1, = 1, and > 1. For n fixed, the limits 
t — 0 and t > ~ correspond to the Legendre argument approaching 1*, for which 
P->1and L(t) ~ 27t and 27t'/", respectively. For n fixed and t > 17 or 1*, 
the argument approaches +, and we use the leading behavior of P,(z) aS Z—> @: 


pl G+ 26) , pri (>! = 28) —p-1 — 
rq +B) r(-p) ~ a 


For fixed n at t = 1, only the first term survives in the expression for L,(1), while 
the second term contributes to the approach to L,(1). 


P,(z) ~ 2 


142 JAMES P. BUTLER [February 


15 
CON 
10 
L, Sf 
5 ; 
0 
0 i 2 3 4 


Fic. 2. 


The second set of limits to consider are those for nm — © with ¢ fixed. For t = 1, 
the result above suffices, and we get the obvious limiting behavior L,(1) ~ 2n. For 
fixed ¢ #1, the limit nm — © corresponds to the Legendre degree approaching 
~1/2, for which we need the formula (1) P_, (cosh n) = (2/7)K(tanh( /2))/ 
cosh(y /2), where K is the complete elliptic integral of the first kind, with modulus 
(not parameter) given as its argument. Using these limiting forms and the duplica- 
tion formula for the gamma function in the explicit solutions given above, we may 
summarize these limits as follows (limit and asymptotic behavior are indicated with 
— and ~, respectively): 


fixed > Ot ~ Qt 
] 
ie "(-35] 
n n 
fixed > 1+ ~ vr 21/8 a lt 12" 
ri—+-] r{-—+- 
an 2 an 2 
fixed —> 0 ~ Qrt}/" 
> 0 fixed, < 1 — 4tK(t) 
—> 0 fixed, = 1 ~2n 
> © fixed, > 1 > 4K(1/t) 


The second entry answers question (a) of the introduction, and part of question 
(b). That is, the exponent of |t — 1| describing the approach to L,(1) is 1/n; 
furthermore, [(—1/2n)/T(—1/2n + 1/2) is negative for 2<n, and thus 
dL (t)/dt approaches + for t > 1~ and — for t > 1°. Except for the trivial 
case n = 1, L,(t) exhibits an infinitely sharp (symmetric) cusp at t = 1. 


Where is dL, / dt = 0? For 1 < t, we know that L,(t) decreases up to some 
point 7|. Can we find an expression for T7,? It is straightforward to differentiate 
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the expression given above for L,(t), t > 1. Setting dL,(t)/dt = 0, and using the 
relations among the contiguous Legendre functions to eliminate dP,( z)/dz, we get 
a very pretty formula satisfied by T,: 
T? +1 l-n 
(1 + B)P3(z) + BPz_,(z) = 0, oS p27? B= an 


n 


Now as 1 —> ©, T| increases, and so we may examine this expression in the vicinity 
of z = 1”. For this, we use the first two terms in the expansion of Ps: 


1 
Pilz) = 1+ 5B(B + I(z- I) +o, z> 1". 


The resulting solution is the leading term in the asymptotic behavior of 7): 


T~ yn/2. 


This is the answer to question (c). 

The second half of question (b), equivalent to T, being unique for each n, is 
answered by showing that the function A,(x)=L(t), x =t'/" is convex in 
1 <x < o, This is straightforward, though a bit tedious. We will not prove this 
here. 


Some heuristics. Many of these results are satisfying intuitively. Consider ¢ in 
the vicinity of 1, and examine the level curves of Q(z) in the complex plane. As tf 
moves from 1~ through 1 to 1*, nothing very exciting happens to the curves except 
near the origin. As ¢ passes through 1, we find curves lying in every other valley 
sharply descending to the valleys’ bottoms and “popping” through to the comple- 
mentary set of valleys. We call such a pop an nth order pop, the strength of which 
is given by the exponent of the cusp, namely 1 /n. 

We conclude by giving a recipe for the behavior of the length of the level set 
E(A,, t) corresponding to an arbitrary nth order polynomial A,(z) of the form 
IT7_,(z — z,), where for convenience we will take the roots z, distinct. Let 
d,, = |z; — z,|, and let D, = [1,,;d,;. Draw the magnitude contour map of the 
given polynomial in the complex plane. For small t, the contours will be small 
circles surrounding the zeroes of A. In this limit, it is easy to see that L ~ 
2atX"_,D, '. For large t, the contour will approach a single circle, and, trivially, 
L ~ 2art!/". For intermediate values of t, examine the contour map. As the small 
t circles increase in size, they will bump into each other with pops of varying order. 
At these values of t, L will exhibit cusps of the appropriate order. 


Acknowledgements. I thank Mile. H. Peslin for great help in performing the delicate numerical 
calculations to check this work, and in producing the graphics. 
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A Tree Among the Stepping Stones 


A. R. MEVER 
Department of Mathematics and Applied Mathematics, University of Natal, Durban 4001, South Africa 


The transportation problem is a linear programming (LP) problem of the form: 


n m 
minimize )) )\c 
j=1i=1 


ij*ij 
subject to the constraints 


n 
Lx, =4; fori=1,...,m, 


Ms 
a 
! 


I 
—_ 
= 


b; tor J 


and x;; > 0 for all i and j. 

The traditional interpretation of this problem is that of m factories with outputs 
a,,...,@,, and n warehouses with requirements 5,,...,5,, and with c;,; denoting 
the cost of moving one unit from factory i to warehouse j, the goal being to 
minimize the total cost of transporting the goods from factories to warehouses. 


Note that, since 
La; = Vix = Lb, 
i a) J 


the x,;; must satisfy m+n -—1 independent equations, so any basic feasible 
solution will contain m + n — 1 basic variables. 

Because of the special structure of this LP problem, special techniques for 
solving it have evolved, popular among them being the so-called stepping stone 
method [1]. The essence of this method lies in the following fact: If a basic feasible 
solution has been obtained, then for any nonbasic variable x,,, there exists a 
(unique) “loop” 


ij? 


Xi55X x 


ij X ijy> Siqiy> Vigig? Finigr +> Sing ving? Sing pi? Wii 

beginning and ending at x,; and alternately changing rows and columns, in which 
all the other variables are currently basic. For each nonbasic variable x,; one may 
now calculate the net cost involved in increasing x;,; by one unit, which necessi- 
tates decreasing x; jn by one unit, which necessitates increasing Xi, by one unit, 
and so on. If this net cost is negative, that is, if it is advantageous to do so, we 
incorporate x,; into the basis and give it the largest value possible without violating 
the nonnegativity constraints, which implies that (at least) one of the basic 
variables x. becomes zero and this variable (arbitrarily chosen if there is more 


‘ti+1 . 
than one) is now dropped from the basis. 
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Most of the standard treatments of the stepping stone algorithm, such as 
[4,5, 7], do not make clear why such a “loop” should necessarily exist, nor why it 
should be unique. It is interesting to note that the authors of [1] missed the 
uniqueness property, advising the reader that “when alternate stones are available 
along the route he should take the shortest path from the beginning row to the 
ending column” [1, p. 54]. Existence and uniqueness are proved in [6, p. 385] by 
means of a linear algebraic argument. The purpose of this note is to sketch a 
transparent graph-theoretical proof, which depends only on the following two 
properties of trees (see, e.g. [2, p. 69)): 


1. A connected graph on v vertices is a tree if and only if the graph has exactly 
v — | edges; 
2. In a tree, given any two vertices, there is a unique path connecting them. 


With the increasing emphasis on discrete mathematics, many, if not all, students 
are familiar with these ideas. 

If a basic solution to the transportation problem is given, we consider the graph 
with vertex set {r),%5,.--5 ins Cy» Co5--+5C,t, Where r, and c, are connected by an 
edge if and only if x,, is a basic variable. The loop above then corresponds to a 
path from r; to c;. The required results will have been proved once it has been 
shown that the graph is a tree. (The latter fact is equivalent to Dantzig’s Basis 
Triangularity theorem [3, p. 303]). Since the graph has m+n vertices and 
m+n — 1 edges, it is sufficient to prove that it is connected. 

Note, however, that any iteration of the stepping stone algorithm preserves 
connectedness, since if x,,; is taken into the basis (that is, if r; and c; are 
connected by an edge), a cycle is completed in the graph, this cycle then being 
removed by the deletion of x; Ji,, from the basis (that is, by the removal of the 
edge from r; toc, _). It therefore suffices to show that the initial basic feasible 
solution will indeed yield a tree. 

We shall discuss this only for the simplest method of finding a starting basic 
solution, namely the so-called “north-west corner” method. In this method one 
starts by choosing x,, as a basic variable and giving it the value min(a,, b,). If 


x1, = 4,, all the variables x,(j = 2,...,) are declared to be nonbasic; if x;, = b,, 
the x,,(i = 1,..., m) are designated nonbasic. If a, = b,, one, but not both, of the 
sets {X)5,..., X,,} and {x,,,..., X,,,} is designated to consist of nonbasic variables. 


The subgraph {r,, c,} is now connected, obviously. 

At any subsequent stage, choose x,;, from among the variables that have not yet 
been designated basic or nonbasic and with either 1 or j minimal among all the 
indices appearing in this set. Designate this x,, as basic, giving it the maximum 
value possible within the constraints and the values already given to the basic 
variables selected earlier. (This value may have to be zero, but this degeneracy 
does not cause problems.) As before, all the remaining variables in either the ith 
row or the jth column (but not both) are designated nonbasic. From our choice of 
X jj, X;, has already been selected as basic for some k <j, or else x,,; has, for some 
I <i. Thus we connect r; and c, where either (7;,c,,) or (7;,c;) was a previously 
existing edge and (by the induction hypothesis) the graph previously constructed 
was connected. Because, from the constraints, every row and column must contain 
a basic variable once the north-west corner procedure has been concluded (that is, 
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once m+n -— 1 variables have been designated basic), this completes the argu- 
ment. 


The author thanks the referees for their constructive suggestions. 
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An Analytic Technique to Prove Borel’s Strong Law 
of Large Numbers 


Liu WEN 
Department of Mathematics, Hebei Institute of Technology, Tianjin 300130,People’s Republic of China 


Borel’s strong law of large numbers asserts that if S,, n > 1, is the number of 
successes in the first n independent repetitions of a Bernoulli trial with success 
probability p, 0 < p < 1, then 


S 
P| lim — -») = 1. (1) 
n—-o 


This result is commonly proved in textbooks through the application of powerful 
probabilistic tools or by a combinatorial argument (see, e.g. [1, p. 42], [2, p. 97], or 
[4, p. 273]). Tomkins [6] and Taylor and Hu [5] provided additional proofs which, 
although simpler than the proofs common in most textbooks, use the same basic 
probabilistic results. In this note we present a new analytical technique to prove 
Borel’s strong law of large numbers, using tools different than is usual. The crucial 
part is the application of Lebesgue’s theorem on differentiability of monotone 
functions (see [3, p. 358]) to the study of a.e. convergence. 

We first give a representation of a set of independent and identically distributed 
Bernoulli random variables with success probability p (0 < p < 1) in the probabil- 
ity space ((0,1),4,A), where @ denotes the Borel sets and A is Lebesgue 
measure. Divide [0,1) into two right-semi-open intervals: d, = [0,1 —p), d, = 
[1 — p,1). These intervals will be called intervals of first order. Proceeding induc- 
tively, suppose the 2” nth-order intervals d, ..., (x; = 0,1; 1 = 1,2,...,) have 
been defined. Then divide each interval d, ...,. into two right-semi-open intervals 
d,...y,9 and d, ...,, according to the ratio (1 — p): p; this creates 2”*" intervals 
of order n + 1. Define, for n > 1, random variables X,: [0, 1) — {0, 1} as follows: 
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X,(x) =x, if x Ed, ...,. It is easy to see that 


x n—S,(x) 
Md y cn xice)  X,(x)) = p"(1 — p) , (2) 


where S,(x) = X,(x) + +++ +X(x). It follows from (2) that the X,, n = 1, are 
independent and identically distributed Bernoulli random variables with 
ProbCX,, = 1) = p and ProbC(x, = 0) =1-p 

In order to prove (1) we first construct an auxiliary function. Let r € (0,1) be a 
constant. Dividing the interval [0, 1) successively according to the ratio (1 — r):r 
as before, another set of nth-order intervals D, ...,, (x, = 0,1; i = 1,2,...,7) is 
created. Let d, x, and dy x, be, respectively, the left and right endpoints of 
d,....,3 define ‘D, _y, and ‘DS. _y, similarly. Let Q be the set of endpoints of all 
d-intervals. 

Now we define an increasing function f,: [0, 1) — [0, 1) as follows: 


Pde, x) = Dey x,3 
PAGE x) = De 2,3 
f(x) =sup{f(t):¢ €Qn[0,x)}, if x ] [0,1)\Q. 
Let 
Ads, x) ~fAds, +) 


t(r,x) = a FF 
. tea, 


"Xy, 


for x €d, ...,. It is easy to see that 


—$,( ) 
MDayco xy) _ PP = ry 
Mdy or X(x)) pr _ py? > 


r(1—p)\"( (=r) 
Fre renal @) 


Letting A(r) be the set of points of differentiability of f., we have ACA(r)) = 1 by 
Lebesgue’s theorem on differentiability of monotone functions. By a property of 
the derivative (see [3, p. 345]) we have lim, ,., t,(r, x) = f(x) < © for x € A(r). 
Thus 


t(r,x) = 


log t,(r, x) 
lim sup ——————- < 0, forx € A(r), (4) 
n 


niweo 


where log denotes natural logarithm. By (3) and (4), 


S( x r(1 —p 1—p 
lim sup ( dy ( < og 5 , (5) 
n— 00 nN p(1-r) r) 1-r 
If p <r <1, we have by (5) that 
log 
S (x a 
tim sup 22 < — oy (6) 


log 


p(i—r) 
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Assuming p <r, <1, r, > p (as k > ©), and letting A = N2_,A(/,), for each 
k = 1,2,..., we have, by (6), 


1 —p 
s(x) FI 
lim sup 7 r(l =p)” forx €A. (7) 
p(1 — rx) 
Since 
1 —p 
log 
; 1-r, 
poe dp) 
p(1 - r;) 
(7) implies that 
S,( x) 
lim sup , <p, forx €A. (8) 


Similarly, assuming 0 <1, <p, 1, ~ p (as k ~ &), and letting B= N7_,A(,), 
we have 


Srl X) 
lim inf >p, forx €B. (9) 
From (8) and (9) it follows that 
_ S,Cx) 
lim =p, forx EC ANB. (10) 
no nh 


Because ACA M B) = 1, (1) follows from (10). 
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A Topological Mean Value Theorem for the Plane 


IRA ROSENHOLTZ 
Department of Mathematics, Eastern Illinois University, Charleston, IL 61920 


Cauchy’s Generalized Mean Value Theorem states that if x and y are functions 
that are continuous on [a,b] and differentiable on (a,b), then there is a ft, in 
(a,b) such that x'(t,))Ly(b) — y(a)] = y(t) x(b) — x(a)]. One geometric interpre- 
tation of this is that if Q(t) = (x(2), y(t)) is a path in the plane, then, under 
certain hypotheses, there must be an instant at which the velocity vector is parallel 
to the vector joining the endpoints of the path. Can this conclusion be improved? 
In particular, can we get a velocity vector to point in the same direction as the 
vector from the first endpoint to the last and not just be parallel? This is the 
question we address in this note. Surprisingly, the answer turns out to be a 
consequence of the Jordan Curve Theorem! This result provides some additional 
insight into such old standards as the Mean Value Theorem and Cauchy’s 
Generalization, l’H6pital’s Rule, and Darboux’s Theorem, while demonstrating 
further interplay between topology and analysis. 

Consider the related question: Can one go north without ever heading north? A 
little reflection shows that the answer to this question is Yes and, in fact, this can 
happen in a variety of ways. First of all, one might travel up to the North Pole on a 
globe by “spiraling up” to it (see FiGurE 1a). (Of course, it is even easier to go east 
by heading west, but that’s another story.) Even if traveling is restricted to the 
plane, it can still be done. One way is to stop (i.e., have zero velocity vector at 
some instant). Then one’s path can have a “differentiable corner” (see FIGURE 1b). 
And another way to go north without ever heading north is to head SOUTH! (See 
FiGur_E lc.) Notice that in this example the path crosses itself. As we shall soon 
see, these are the only things that can go wrong. 


North Pole 


Fic. 1(a) Fic. 1(b) 
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x=4-f° 
y=t°-3t 
—-2<t<2 

Fic. 1(c) 


Let us agree on the following vocabulary: A path in the plane is a continuous 
function O(t) = (x(t), y(t)) defined on some closed interval [a, b] of real num- 
bers, where a < b. A path OQ. is differentiable provided ()'(t) = (x(t), y(t)) exists 
for a <t <b. (We do not assume that ( is continuously differentiable.) A 
differentiable path is nonstop if ('(t) # (0,0) for all ¢ such that a <t<b. 
Finally, a path that is one-to-one is called an arc. 

We prove the following: 


TopoLoGicAL MEAN VALUE THEOREM. If 0: [a,b] > R? is a differentiable, 
nonstop arc in the plane, then there is a positive number yw and a t, in (a, b) such 
that Oty) = plQ(b) — O(a). That is, at some instant ty, the velocity vector O'(t,) 
points in the same direction as the vector from Qa) to Q(b), or, what amounts to 


the same thing, O'(t,)/O'C oll = LAC) — OCaI JAM) — OCadIL 


(Notice that this generalizes the Mean Value Theorem: let Q(t) = (4, f(t)) on 
[a, b].) 


Proof of the Theorem. First note that it suffices to consider the special case 
where (b) is due north of (a) and both lie on the y-axis. Uf OQ(b) is not due 
north of Qa), consider the auxiliary path B(t) = (0, ILO) — Ol(a)// 
[Q(b) — O(a)], where all points in the plane are to be thought of as complex 
numbers, and all operations are as such. Then B is also a differentiable nonstop 
arc in the plane, and B(b) = (0,1) is due north of B(a) = (0,0). Furthermore, the 
w and ¢t, that work for B also work for 0.) Let t, in [a,b] be chosen so that O(t,) 
lies on the y-axis and is the most northerly such point. (It is possible that t, equals 
b—a situation we shall have to address—but certainly t,; does not equal a.) There 
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are two possibilities: Case 1: There is an increasing sequence s, < 5s, < +: such 
that lim s, = ¢, and ((s,) lies on the y-axis for all n; and Case 2: There is no such 
sequence. 


Case 1. We may assume in this case that ¢, is less than b. (If not, choose a c 
less than b so that O(c) is due north of O(a), and choose a new ¢, for the path 0 
restricted to the interval [a,c]. Note that with this new choice of t,, we might now 
be in Case 2.) So Q is differentiable at t,. Then 0'(¢,) = lim[Q(s,) — O(,)1/ 
(s, —¢,). But since ((s,) lies on the y-axis below O(¢,), and s, <t,, we may 
conclude that O'(t,) is of the form (0, y’(t,)) with y'(t,) > 0. Finally, since 
Q'(t,) # (0, 0) by hypothesis, y’(t,) must be greater than 0, and thus, at time f,, the 
velocity vector is pointing due north as claimed. 


Case 2. In this case, there is a t, < t, such that 0(¢,) lies on the y-axis (below 
O(t,), of course) and Q(t) does not lie on the y-axis for all t between ¢, and f¢,. 
Then for t, < t < t,, 0 lies entirely to the left of the y-axis or entirely to the right. 
Without loss of generality assume the former, and concentrate on the restriction of 
Q, to the interval [t,, t,]. Since © is continuous on this interval, there is a t, such 
that t, < ty <¢, and O(¢,)) has minimal x-coordinate. Then x’(t,) = 0. Therefore, 
since 0'(t,) # (0,0), O(¢,) points due north or due south. If 0'(t,) points north, 
we are done. So assume, to arrive at a contradiction, that 0'(t,) points south. 

Now Q((f,,¢,) lies in the vertical strip {(x, y)|x(t)) < x < 0}. Furthermore, 
since (t,) points due south, then for all t < t) (resp., t > t,) sufficiently close to 
ty, Q(t) lies above (resp., below) the line y = y(t,), and as close to vertical as you 
please. But O({t,, ¢,]) separates the vertical strip, and in fact separates points 
nearly due south of O(¢t,) from QO(t,). One easy way to see this is to extend 
O([t,, to]) to a simple closed curve by going due west from O(f,), then going due 
north far enough to avoid the image of 12, next going due east to get to the right of 
the y-axis, then proceeding south to the height of O(t,), and finally due west to 
O(t,) (see Figure 2). Points of Oty, t,)) almost due south of O(¢,) lie on the 
outside of this simple closed curve and Q(¢,) lies on the inside of this simple 
closed curve. So by the Jordan Curve Theorem, O((¢,, ¢,)), which lies inside the 
strip, must cross this simple closed curve, necessarily at a point of O({t,, t,)). This 
contradicts the hypothesis that Q is an arc and, therefore, 0'(t,) points due north, 
as claimed, completing the proof. O 


O(t,) 


y= V(ty) 


x = x(ty) 


Fic. 2 
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We now derive several corollaries. 

The first is about simple closed paths in the plane. (Here it may be convenient 
to think of a differentiable simple closed path in the plane as a nondegenerate 
periodic differentiable function from R to R.) It says roughly that if you traverse a 
simple closed curve in a differentiable, nonstop fashion, then you must have been 
facing in every direction at least once. An amusing interpretation of this is that if a 
differentiable, nonstop closed path is one-to-one (i.e., simple), then the set of 
tangent directions (i.e., unit tangent vectors) is onto the unit circle. The reader is 
invited to compare this situation to closed paths that are either nonsimple, not 
nonstop, or nondifferentiable at even a single point. 


Coro.iary 1. If 6 ts a differentiable, nonstop simple closed path in the plane 
and u is any unit vector in the plane, then there is a t, such that 5'(t,)/\\6'(t,)|| = u. 


The idea is to use the Jordan Curve Theorem again. Take a point in the inside 
of the simple closed curve and construct the line through this point parallel to uw. 
This line intersects the simple closed curve in (at least) two points A and B. Part 
of the time our path goes from A to B, and part of the time it goes from B to A, 
and it is now a simple matter to apply the Topological Mean Value Theorem to 
obtain the result. 

One of the pretty applications of Rolle’s Theorem is the Intermediate Value 
Theorem for Derivatives (Darboux’s Theorem): If f is differentiable on the closed 
interval [a,b], then f’ attains all values between f'(a) and f(b). We prove the 
following analogue: 


CoROLLARY 2. (Intermediate Value Theorem for Derivative Directions). Suppose 
that Q: [a,b] > R? is a differentiable, nonstop arc in the plane. Suppose further that 
Q, has a nonzero derivative at both a and b. Then O'(t)/||Q'(t)|| takes on every value 
on at least one of the two circular arcs between Q'(a)/\\Q'(a)|| and 1'(b) /|,Q'(b) I). 
And, therefore, the set of derivative directions forms a connected subset of the unit 
circle. 


(Recall that we are not assuming that 0 is continuously differentiable, or this 
wouldn’t be very surprising. Also notice that this corollary implies Darboux’s 
Theorem: let Q(t) = (t, f(t)) and use the fact that (' never points left.) 


Proof. Consider the set S = {(s,t)\a <s <t <b}. Then S is connected, and 
the functions A(s,t) = [O(@) — Os) /[t — s] and H(s,t) = h(s, t) /\|hGs, t)|| are 
continuous on S. Therefore the image of AH is a connected subset of the unit 
circle. And since it contains points arbitrarily close to 0'(a)/||OQ'(a)\| and 
0'(b) /||Q'(b)\|, it must contain one of the two arcs of the unit circle between these 
two points. Now let A denote this circular arc. If u belongs to A, there is an (s, f) 
in S so that H(s, t) = u, ie., so that [OG@) — OCs) JLOG@ — O(s)]i| = u. We now 
apply our Topological Mean Value Theorem to the differentiable, nonstop arc 2 
restricted to the interval [s, t] to obtain a f, in (s, t) such that Ot.) NOt) =U, 
completing the proof. 


Notice that we really do not require the full strength of the hypothesis that 2 
be an arc. For example, this result can easily be generalized to the cases in which 
Q) is “locally one-to-one” and ‘“‘piecewise one-to-one.”” We leave it to the inter- 
ested reader to find his or her favorite generalization. 
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The statement of Corollary 2 suggests several natural questions. Suppose 2 is a 
differentiable, nonstop path which also has nonzero derivatives at its endpoints. 
Must the set of velocity vectors be a connected subset of the plane? If not, must the 
set of speeds be a connected subset of the real line? Finally, must the set of derivative 
directions be a connected subset of the unit circle even if Q is not piecewise 
one-to-one? It turns out that the answer to all three questions is ““No.” Here is a 
monster-making “machine”*: Let B: [1/2,1] — [1/2,1] x [-—1/2,1/2] be any 
continuously differentiable path such that: 


(a) B is nonstop; 

(b) BU /2) = 1/2, 0) and BC) = C, 0); 
(c) B'1/2) = B’'C); 

(d) B’(t) # C, 0) for all ¢. 


Roughly speaking, we extend B to a path © on the interval [0, 1] which looks 
like B on [1/2,1] stuck to two copies of B of 1/4 the dimensions on the interval 
[1/4,1/2] stuck to four copies of B of 1/16 the dimensions on the interval 
[1 /8, 1/4], and so on, with (1(0) = (0, 0) (See Fic. 3). The path © has as its velocity 


1/2 


maT MH Z\ 
0 LA ALA A 


~1/2 


Fic. 3 


*For readers who may be interested only in the answer to the first two questions, simpler examples 
exist. For example, if p,q, and r are real numbers, the path given by 
O(t) = (pt + rt? cos(1/t), qt + rt? sin(1/t)) if0 <t <1 and (0) = (0,0) 


has as its set of velocity vectors a spiral about a circle of radius r centered at (p, q), together with its 
center. However, our “machine” answers all three. In addition, the “machine” may be modified to 
produce a differentiable path whose set of velocity vectors is the entire plane. 
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vectors precisely the velocity vectors of B together with '(0) = (1,0). Finally, if B 
has length L, then 0 has length 2L, and if we parametrize (1) using arc length 
then we obtain a path 0, with 0,(0) = (0,0), 0,(2L) = (1,0), and QO, has unit 
speed except at time 0 at which time its speed is 1/2 L. 

We conclude with a two-dimensional version of L’H6pital’s Rule. (Perhaps this 
is appropriate as L’Hopital’s Rule seems to be one of the more popular applica- 
tions of Cauchy’s Generalized Mean Value Theorem.) 


CoroLiary 3. Suppose that Q(t) = (x(t), y(t)) is a differentiable, nonstop path 
defined on the closed interval {a, b]. If is one-to-one on some neighborhood of a, 
and lim ((t) = (0,0) as t approaches a*, then as t approaches a‘, lim Q(t) /\|\O.(4))| 
= lim 0'(t)/||O'()||, assuming the latter limit exists. 


One interpretation of this is that if a particle starts at the origin in the plane, 
then, under reasonable conditions, its “position directions” are closely approxi- 
mated by its “velocity directions.” Another interpretation, which is perhaps even 
more intuitive, is that if a particle has constant mass and zero initial velocity, then 
(under suitable conditions) its unit velocity vectors near the start are approximately 
its unit acceleration vectors (i.e., its force direction vectors). In other words, if a 
particle is at rest, then the direction it goes is pretty much the direction you push 
it. I guess the only real surprise here is that we are not even assuming that the 
force is continuous. 

The proof of this corollary is identical to the usual proof of L’H6pital’s Rule 
given the Generalized Mean Value Theorem and is omitted. 


Disks and Shells Revisited 


WALTER CARLIP 
Department of Mathematics, Ohio University, Athens, OH 45701 


It is a common practice in calculus courses to use the definite integral to define 
the area between the graph of a function and the x-axis (see, e.g., [2, p. 252], 
[3, p. 221], and [4, p. 238]). Soon after, the student is taught two methods to 
calculate volumes of solids of revolution—the disk method and the shell 
method—usually with no mention of how volume is defined. Most calculus books 
follow the introduction of disks and shells with several examples in which it is 
shown that both methods of calculating the volume yield the same answer. The 
alert student is sure to wonder whether this is always the case. 

The equivalence of the disk and shell methods was proven in [1] using integra- 
tion by parts. We present here a different approach, one that uses only elementary 
ideas and illustrates an important proof technique. 


THEOREM. Let f(x) be a continuous, invertible function on the interval [a, bl, 
where a > 0. Suppose the region bounded by y = f(b), x = a, and the graph of f(x) 
is rotated about the y-axis. Then the values of the volume of the resulting solid 
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obtained by the disk and shell methods are equal. That is, 


(7[ f(y) ]° - 7a?) ay = [-2~x[ f(b) — f(x)] de. 


f(b) 
f(a) 


The Ingredients. Early in most calculus curricula, Rolle’s Theorem is used to 
prove the following principle, which is then applied repeatedly. 


Principle. If f(x) and g(x) are two functions that satisfy: 
(a) f(x) and g(x) are differentiable on an interval [a, b], 
(b) f(x) = g(x) for allx € [a,b], and 
(c) f(c) = g(c) for one point c € [a, b], 

then f(x) = g(x) for all x € [a, bl. 


Informally, this says that two functions are equal if they are equal at one point 
and have identical derivatives. Although this principle is surprisingly simple, and 
easily absorbed by students, it has numerous applications and reappears often. 
Emphasizing this principle helps students see the similarity between proofs that 
otherwise seem unrelated. 

The other ingredients of the proof of our theorem are the Fundamental 
Theorem of Calculus, the chain rule, and the product rule. 


Proof. Let t € [a,b]. We define two functions of t as follows: 
f@ _ 2 
V0) = f(a 02) 
and 


W(t) = f2mx[ f(t) — f(0)] ae. 


We need to prove that V(b) = W(b), as these are the two volumes in the 
theorem. It is easy to show that V(t) = W(t) for all t © [a,b] by applying the 
principle given above. 

First simplify W(t): 


W(t) = Qarf(t) [xdx — 2a f'xf(x) de. 

Now, by the Fundamental Theorem, both V(t) and W(t) are differentiable on 
the interval [a,b]. Furthermore, V(t) = (al f-'Cf(@)) -— wa’)f't) = 
m(t* — a”)f'(t), and 

W'(t) = 2arf'(t) fxdx + [2af(t)t — 2ref(t)] 


t2 — 2 


: | = (0? ~ a?) f(t). 


= 2=P(0| : 


Thus, W(t) = V(t) for all ¢ € [a, b]. It remains only to observe that V(a) = W(a) 
= 0, by a fundamental property of integrals. O 
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L’Hopital’s Rule Via Integration 


DONALD HARTIG 
Mathematics Department, California Polytechnic State University, San Luis Obispo, CA 93407 


In elementary calculus texts L’H6pital’s rule is usually proven only for the case 
0/0, x > x, (finite), by applying the Cauchy mean value theorem. Extension to 
x — © is then accomplished by replacing x with 1/x. Verification of the rule for 
the «/co indeterminate form is regarded as too difficult and may be discussed in 
an exercise, an appendix, or not at all. In this note we give a proof for the »/ 
case that does not make use of the Cauchy mean value theorem. Instead, we 
require that the functions have continuous derivatives and take advantage of the 
order properties of the definite integral. The argument adapts nicely to the case 
0/0 as well. 


L’HGPITAL’s RULE. © / ©, Let f and g have continuous derivatives with g'(x) # 0. 
If lim,_,.. f(x) = ~, lim, ,,, g(x) = ~, and lim,.,,, f'(x)/g"(x) = L, then 
lim, ,. f(x)/g(x) = L also. 


Proof. We assume that L is finite; the other case can be handled in a similar 
fashion. The limit hypothesis on g allows us to assume that it is a positive function. 
Moreover, since g’ is continuous and nonvanishing it too must be positive. 

Let « be some positive number. Choose M so that 


whenever x > M. Since g’'(x) is positive we have 


| f(x) — Lg'(x)| < eg"(x), 
so that 


fre — Lg'(x)| de 


b b 
< [ol f'(x) — Le'(x)|de < foeg'(x) dx 
whenever M <a <b. Therefore, for such a and b, 
| f(b) — f(a) — L[g(b) - g(4)]| < ele(b) — g(a)]. (*) 
Dividing through by the positive number g(b) we obtain 


fb) _ fla) -1}1- 25 <¢- S| <eé 
g(b)  g(b) g(b) g(b) | 
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It follows easily that 


fo) |e, OL, 8@) 
g(b) g(b) g(b)- 
As b increases, g(b) grows larger and larger without bound. Consequently, both 


[fCa)| /g(b) and |L|g(a)/g(b) will eventually become (and remain) smaller than e«, 
implying that 


for all b sufficiently large. This shows that lim, ,,, f(x)/g(x) =L. O 

Since our proof of this version of L’H6pital’s rule makes no use of the 
assumption lim, _,,, f(x) = ~, that condition can be dropped from the hypotheses. 
A straightforward variation of the preceding proof works when x — x,; alterna- 
tively, that case can be derived from the x — © case by considering F(x) = 
f(x, + 1/x) and G(x) = g(x, + 1/x). 

This type of proof also applies to the indeterminate form 0/0. For example, if 
we assume that L is finite and g’ is a positive function (as was the case above), 
then allowing b to increase without bound in inequality (*) will force f(b) and 
g(b) towards 0, implying that 


|-f(a) - L|-g(a)|| < e[-g(a)]. 
Dividing by the (positive) number — g(a) reveals that 
| f(a) 
—f 
g(a) 


<€ 


whenever a > M. 
As you can see, the algebra for 0/0 is a bit simpler making this proof even more 
suitable for popular consumption. 


Acknowledgment. The author wishes to thank the referee for helpful comments on the arrangement 
of the proofs. 
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polynomial we mean a line tangent to the graph of the polynomial at more than 
one point. For example, the x-axis is a multiple tangent of x* — 2x7 + 1.) 


(a) Prove that M(n) < (” 5 *), 
(b)* Prove or disprove: M(n) = (" 5 *). 
(Cf. E 2886 [1981, 349; 1984, 312]. 


E 3424. Proposed by Paul Erdos, Hungarian Academy of Science, Budapest. 


(i) Given an integer r > 1, prove that there is a positive integer n, such that for 
every n>n, at least one of the binomial coefficients (i> l<k<« 
n — 1, is divisible by the r-th power of some prime. 

(ii) Prove that n, can be taken as 23. (The binomial coefficients (1); l<k< 
n — 1, are all squarefree when n = 2,3,5,7, 11, 23.) 


E 3425. Proposed by Shmuel Rosset and Tamir Shalom, Tel Aviv University, 
Israel. 


Suppose A and B are n by n matrices over a field k. 

(i) Prove that there is an n by n matrix X over k with AX + XA = B if and 
only if trace (BC) = 0 for every n by n matrix C with AC = —CA. 

Gi) Assume char(k) = 0. If AB = —BA and if the matrix equation AX + XA 
= B has a solution over k, prove that B is nilpotent; for each positive integer n 
give an example in which B” = 0 but B"~! + 0. 


E 3426. Proposed by J. Michael Steele, Princeton University, Princeton, NJ. 


Suppose that {a,}?_, is a sequence of non-negative real numbers such that 
Laz, <o, Show that, for any positive constant c, there exists an increasing 
sequence of positive integers {n,}?_, such that n, = ck? + O(k) and such that 


fore) Hy 
> a; a; < @, 
k=1 j=l 


SOLUTIONS OF ELEMENTARY PROBLEMS 
The Thinnest Annulus Containing Four Points 


E 3296 [1988, 953]. Proposed by Robert M. Young, Oberlin College, Oberlin, OH. 


Given four arbitrary points in the plane, describe a procedure for constructing 
the thinnest annulus containing all four points. (An infinite strip is regarded as an 
extreme case of an annulus.) The thickness of an annulus is the difference between 
the two radii (or the width of the infinite strip). 


Solution by Allen J. Schwenk, Western Michigan University Kalamazoo, MI. We 
will show below that a minimum thickness annulus must have two of the four 
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points on each bounding circle (or each side of the strip). Given this, we need only 
consider the three possible pairings of two points for each boundary. For each 
pairing, we use the fact that the center of any circle through two points lies on the 
perpendicular bisector of the segment between them. Thus, we determine the 
bisectors for the two pairs in a pairing, and their intersection is the center of 
the corresponding annulus. Finally, we compare the thickness of the three result- 
ing annuli. If the bisectors are parallel, then they intersect at infinity and the 
annulus is an infinite strip. If the bisectors are identical, then the four points are 
cocircular or collinear and the minimum thickness is zero; for if the common 
bisector is taken as the y-axis in a Cartesian coordinate system and if the points 
have coordinates (+a, 0) and (+b, h), then (when h # O) the points lie on a circle 
with center at (0,c), where c = (b? + h? — a”)/(2h). 

Compactness assures that the minimum thickness is attained. To see that each 
boundary circle must contain (at least) two of the four points, note first that each 
must contain at least one of the points, or else we can expand or shrink, reducing 
the thickness, until it does. Now, suppose that only one of the four points, A, lies 
on the outer (inner) circle of a given minimizing annulus with center O. Then we 
can translate O toward (away from) A by an appropriate distance ¢, simultane- 
ously shrinking (expanding) both circle radii by « to avoid changing the thickness, 
so that at most one of the points lies on the other circle, with equality only if the 
line OA contains that point. If the inner (outer) circle now contains no points, we 
can expand (shrink) it to reduce the thickness. If it contains one of the four points, 
say B with OBA collinear, then we can rotate the center around whichever of B or 
A is between the other and O. Having done this, the outer circle now contains none 
of the points and can be shrunk. The argument is essentially the same in the case 
of an infinite strip. 


Editorial comment. All solvers noted that the possible locations of the center are 
at the intersections of two perpendicular bisectors each obtained from two of the 
four points. Schwenk, Harvey Lipman, and the Western Maryland Group reduced 
the possibilities to the three described above; the others included the four 
possibilities in which one point is left out and one of the circles contains the other 
three points. 

When there are more than four points, the argument that each circle of the 
thinnest bounding annulus must contain at least two of the points holds as before. 
This yields a quartic algorithm (as a function of the number of points) to find the 
minimum bounding annulus, but this is not best possible, and it is reasonable to 
ask for the complexity of finding the thinnest bounding annulus. See T. J. Rivlin, 
Approximation by Circles, Computing 21(1979), 93-104. 


Solved also by N. Cohen (israel), H. Lipman, O. P. Lossers (The Netherlands), T. L. McCoy, A. 
Nijenhuis, Victor Pambuccian, A. Pedersen (Denmark), D. Secrest, Western Maryland College Problem 
Group, and the proposer. 


Caught in a Pigeonhole 


E 3312 [1989, 155]. Proposed by Robert Blakley and Douglas Hensley, Texas 
A&M University College Station, TX. 


Given positive integers m,n with n/2 <m <n, suppose n tennis matches are 
to be scheduled on m successive days, with at least one match per day. Determine 
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the largest positive integer K such that, no matter how the matches are scheduled, 
for each k < K there is a consecutive sequence of days on which a total of exactly 
k matches are scheduled. 

Solution material prepared by the editors. Variants and simplifications of this 
problem appear in almost every textbook that presents the pigeonhole principle. In 
most versions, the total number of matches is at most n rather than exactly n, and 
the argument requested suffices to show that a consecutive subsequence totalling 
k is forced for any k < 2m — n — 1, although a minor refinement of this argu- 
ment also works for k = 2m —n. To wit, if a; denotes the total number of 
matches played on the first j days, then the numbers a,,...,a,, and a,+k, 

..,4,,_, + k (where a, = 0) are 2m integers in the set {1,...,n + k — 1} (since 
An—1 <n), so that k < 2m — n guarantees a duplication of the form a; + k = a,, 
O<i<j<m. 

In “Some pigeonhole principle results extended’, this MonTuty, 87 (1980) 
648-651, Joan P. Hutchinson and Paul B. Trow gave a complete characterization 
of which values of & are forced to arise as the total number of matches played in 
some block of consecutive days. They show that &k is forced if and only if 
r<2m —n,where m =r (mod k) with O < r < k. A concise presentation of their 
proof follows. See also references [1] and [2] below. 


THEOREM. For fixed m,n with n > m, let A be the set of positive integers k such 
that every sequence of m positive integers summing to at most n has a consecutive 
segment summing to k. Then k € A if and only if r < 2m —n, where m =r mod k 
with O<r<k. 


Proof. If r > 2m — n, then the sequence alternating k — 1 ones and one k + 1 
until m values are reached has sum 2m —r<n and avoids k. To prove the 
converse, it suffices to show that 2m — r is the minimum sum of all sequences of 
length m that avoid k. We prove this by induction on m, with m = 0 a trivial 
basis. If m > 0 and m is not a multiple of k, then the sum must be at least one 
more than the sum of the initial segment of length m — 1, which also avoids k. 
Now suppose m = tk. If a sequence avoids k and sums to less than 2m, then the 
partial sums a,,...,4@,, consist of m numbers from S = {1,...,2m — 1} — {k}, 
which has size 2m — 2. Partition S into m — 1 pairs by pairing x and x +k 
whenever | x /k | is even. By the Pigeonhole Principle, a uses two elements from a 
pair, which implies that the sequence cannot avoid k. O 


Since the present problem requires the sum to equal n, its feasible sequences 
are a subset of those in the Hutchinson-Trow formulation, so any value forced in 
that formulation is also forced in the current problem. Hutchinson and Trow 
observe that a number larger than m cannot be forced, as shown by a sequence of 
all 1’s. For the problem at hand, such a sequence is forbidden, since we require the 
total number of matches to equal n. Dean S. Clark and James T. Lewis consider 
this precise situation in their paper ‘“‘Avoiding-sequences with minimum sum,” 
Discrete Applied Mathematics, 22(1988/89), 103-108, independently of the earlier 
work. If n =m + 1, then every feasible sequence has a single 2 and m 1’s, and 
every value through m + 1 is forced. For any other k satisfying r > 2m — n with 
m=rmodk,0 <r <k, the Hutchinson-Trow sequence that avoids having k as a 
consecutive sum totals to at most m, but the sequence can be modified to achieve 
the total n without forcing k. The sequence consists of k — 1 ones, then one 
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k + 1, then k — 1 ones, etc. The sequence comes in groups of k items totaling 2k 
with r trailing ones. Hence the total is 20m — r) + r <n. We can augment any of 
the values equaling k + 1 to make the sequence total m without losing the 
property of avoiding k. 

Based on the Hutchinson-Trow result, the solution of the problem proposed is 


K(m,n)=m+1 ifn=m+1, 
K(m,n) = max{k:mmodj<2m-—n7 for j =1,2,...,k} otherwise. 


(Here m mod /j means the least non-negative integer congruent to m modulo /.) 
Hutchinson and Trow note numerous special cases of m,n in which a more 
“exact” or at least more satisfying solution can be given, but for general m,n no 
better answer than that above is known. The values through 2m — n are always 
forced, as are those from m — x through m if x = 2m — n — 1; this means that 
whenever n < 3m /2 the answer is m. For n > 3m/2 the answer is less than m. 


REFERENCES 


1. R.L. Hemminger and B. D. McKay, Integer sequences with proscribed differences and bounded 
growth rate, Discrete Math., 55 (1985) 255-265. 

2. W.D. Wei and C. L. Liu, Solution to a problem of J. P. Hutchinson and P. B. Trow, J. Math. Res. 
and Exposition, 3 (1986) 139-140. 


Lewis and Clark contributed the reference to their recent solution. Several readers contributed the 
usual elementary argument that begins the discussion above, and there were several incorrect solutions. 


A Greedy Change-Making Machine 


E 3323 [1989, 357]. Proposed by Thane Plambeck, Stanford University, Stanford, 
CA. 


A change-making machine gives change in chips of three integer denominations, 
1 cent, a cents, and B cents, where 1 < a < B. The machine uses the following 
greedy algorithm: In making @M cents change it dispenses chips of value B as long 
as possible, then chips of value a as long as possible, then chips of value 1 to 
complete the total of M. We say that a pair a < B 1s frugal if, for every M, the 
machine’s algorithm dispenses the minimum number of chips among all ways of 
making change for M cents with denominations 1, a, 8. (For example, (2,5) is 
frugal but (4,9) is not, because the (4,9) machine dispenses four chips in change 
for 12 cents instead of the minimum of three.) 

For fixed n > 2, suppose the values a < B are two distinct integers chosen at 
random from {2,3,...,”} with all pairs equally likely. Obtain an asymptotic 
formula (when n is large) for the probability that (a, B) is frugal. 


Solution by Richard Stong, Harvard University. Cambridge, MA. Let F(n) be the 
number of frugal pairs (a, B) with 2 < a < B <n. We will show that the desired 
probability is 


___ Fir) 8 “1/2 + Of ~1) 
Pn“ (n—1)(n—-2) 3° aa 


More specifically, F(n) = 4k°/3 — 5k*/2 + k/6 + 1 — (k* — n)\2k — 3) for 
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k(k -—1)<n<k? and F(n) = 4k7/3 — 5k?/2+k/6+1+(n —k*)Qk — 2) 
fork? <n<k(k +1). 

We begin by showing that (a, B) is non-frugal if and only if there is an integer f 
such that (¢ — l)a < B < t(a — 1). If there is such a ¢ (the inequality implies 
1 <t <q), then for ta the (a, B) machine dispenses one B and ta — B ones (by 
the first inequality), which exceeds the ¢ copies of a that are optimal (by the 
second inequality). Conversely, suppose M is a value for which the (a, B) machine 
is not optimal. It can fail only by using too many B’s, which means that the 
minimum must be achieved by changing s copies of B and i copies of 1 into r 
copies of a, where i1<a, s+i>r, and s8+i=ra. Substituting the two 
inequalities for i into the equality yields (yr — l)a < sB < r(a@ — 1) + s. Now we 
divide by s and set t=[r/s]. Because |(r — 1)/s|= |[(r —s)/s|, we have 
(t-lha<B <tla—-1)4+1. 

We next use this characterization to count, for a given value B, the number of 
values a such that (a, B) is frugal; this equals \2V8 | — 3. The characterization 
implies that a non-frugal (a, B) satisfies B/t + 1 <a < B/(t — 1) for some inte- 
ger t > 2. Since we also have t < a, we may assume that 2<t<k if B is in 
the interval (k(k — 1), k(k + 1)]. For B in this range, we have k — 2 intervals 
that a must avoid for (a, B) to be frugal. Although a cannot be higher than the 
first interval ending at B — 1, it can be lower than the last, and this yields 
\(B — 1)/k| —1 values of a with 2 <a < B/k +1 such that (a, B) is frugal. 
Between the interval for ¢ and the interval for ¢ + 1 we have B/t+1>a2>B/t, 
and there is exactly one such integer a, so we have k — 2 additional values of a 
such that (a, B) is frugal. Since |(8 — 1)/k| is k — 1 for k(k — 1) < B < k? and 
is k for k* < B < k(k + 1), we obtain a total of 2k — 3 or 2k — 2 values of a in 
the two ranges. This is summarized by the formula \2VB — 3 for the number of 
frugal pairs with fixed B, as claimed. 

This yields the formula F(n) = D%_,(|2Vvb| — 3). Thus F(n) = 4n3/2/3 + 
O(n), which yields the asymptotic formula claimed for the probability that a pair is 
frugal (since there are (n — 1)(n — 2)/2 pairs to choose from). 

If k(k —1) <n <k?’, then 

k(k—1) 


F(n) =(n—k(k — 1))(2k - 3) + p (|2vb | - 3) 


2k—-1 
= (n—k(k—1))(2k- 3) + Le Gi - 3)L7/21 
j=4 
k-1 k-1 
=(n —k(k — 1))(2k — 3) + yy (2r —3)r+ yy (2r — 2)r 


2 
= (2k — 3 8 ke ok + LL 
( n—~ 3 2 6 


A similar calculation for k* <n < k(k + 1) yields the second formula claimed for 
F(n). 


Editorial comment. Paul Zwier characterized the frugal pairs for fixed a. These 
consist of all (a, 8) with B >a’ and the pairs (a,ka—j) such that k € 
{2,3,...,a} and 7 € {0,1,...,k — 1}. This yields another way to calculate F(n) 
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and translates directly to Joel Spencer’s statement of the condition for frugality: 
With j,k being the integers defined uniquely by the conditions B = ka —j and 
0 <j <a, the pair (a, B) with a < B is frugal if and only if j < k. The proposer 
gave the reference L. Chang and James F. Korsh, “Canonical coin changing and 
greedy solutions,” J. Assoc. Comp. Mach., 23 (1976), 418-422, in which similar 
problems are discussed. 


Solved also by H. Lipman, J. Spencer, P. J. Zwier, and the proposer. Partially solved by M. Bazant 
(student). 


A Deviation From A Binomial Coefficient Identity 


E 3327 [1989, 445]. Proposed by Stanley Rabinowitz, Westford, MA. 


Suppose v7 1s a positive integer greater than 3. Put 


F(x)= ¥ 


k=0 


(i. } — x"), S,= minF,(x). 
x>O0 
(a) Prove that 
c2” <S,<F(1) =2”-n-1 
for a suitable positive c. 
(b) Is it true that lim, ,,.2~”S, = 1? 


Solution by Ilan Kozma, Tel Aviv, Israel. (a) By definition, S, < F,(1). To prove 
the strict inequality S, < F,(1) for n > 3, choose x, = n'/". For 1 <x <x, and 
1<k<n-—1, we have (") —x*¥>n—-—x">n-—x%>0; hence F(x) =x" — 
1+ DR (") —x*) for 1 <x <x. The derivative of this expression is F/(x) = 
nx”-1 — prot kx*-!) Since F/(1) =n —(%), which is negative for n > 3, we 
conclude that S, < F.(1 + «) < F,() for sufficiently small positive «. The exis- 
tence of c satisfying S$, > c2” follows from (b). 


(b) We shall prove that the answer is yes by using the following lemma. 


Lemma. If n is sufficiently large and 0 < k <m = |2n/3], then 
i) >" 


Proof of Lemma. lf n > 2and1<k < (n+ 1)/3, then 


(i (an )- > 


and so the desired inequality follows by induction for 1 < k < (n + 1)/3. On the 
other hand, if (n + 2)/3 <k <m, then (") > (”) and so it suffices to prove that 
( n) > 2”: but if n is sufficiently large, Stirling’s formula gives 


n c 3 . 2n /3 m 
(in) ° al gr > 2 2 


for a suitable positive constant c. Thus the lemma is proved. 
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We return to the proof of the assertion of (b). If x > 2, then 
bet YZ) 
k=0 


k=0 
Hence we need consider only 0 < x < 2. By the lemma we have for 0 < x < 2 and 


F(x) > 


> ¥2k-2"=2"-1>F (1). 
k=0 


n large 
2-"F(x)>27 Ye (i. — xk 
k=0 
>27" Y (i) -2” yy 2*. ( *) 
k=0 k=0 


As n goes to infinity, the second term in (*) approaches zero exponentially, while 
the Law of Large Numbers implies that the first term approaches 1. Hence 
lim, 502 "5, = lL. 


n-o 


Editorial comment. Instead of using the Law of Large Numbers in the above 
proof, we could use the explicit estimate 


rae E(E)a2" ES (e) <2’ 1) 02% (zr) | 


k=0 k=m+1 
given by Stirling’s formula. 


Solved also by H. Ki (Korea), H. Lipman, and the editors. 


Embedding a Planar Triangulation with Vertices at Specified Points 


E 3341 [1989, 642]. Proposed by P. Gritzmann, University of Siegen, West 
Germany; B. Mohar, University of Ljubljana, Yugoslavia; J. Pach, New York 
University and Mathematical Institute of the Hungarian Academy of Sciences; and R. 
Pollack, New York University. 


Let G be a planar graph obtained from a convex n-gon by triangulating its 
interior, 1.e., by subdividing the interior into triangles using non-crossing diagonals. 
Let P be an arbitrary set of m points in the plane no three of which lie on a line. 

Show that G can be straight-line embedded on P, 1.e., show that there exists a 
bijection f from the vertex set of G to P such that if (a, b) and (c, d) are distinct 
edges of G, then the open segments (f(a), f(b)) and (f(c), f(d)) are disjoint. 


Solution by the proposers. We establish a stronger statement by induction on n: 
if v,,U, are any two consecutive vertices of the convex n-gon from which G is 
obtained, and p,, p, are consecutive vertices of the convex hull of P, then G can 
be straight-line embedded on P such that f(v,) = p, and f(v,) = po. 

The statement is trivial for n = 3; assume n > 3. Let v,,vU5,...,v,, denote the 
counterclockwise ordering of the vertices of G on the original n-gon, and let v; be 
the third vertex on the triangle containing v,,v,. We claim there is a point 
p © P — {p,, p,} with the property that (a) no point of P is inside p, p,p and (b) 
there is a line / through p that separates p, from p,, meets P only at p, and has 
exactly i — 2 points of P on the side of / containing p,. To obtain p, we rotate the 
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line p,p, about p, until we reach a line /' = p,p' with p’ € P such that exactly 
i — 3 points of P are separated from p, by /'. Among the points of P — {p,, p»} in 
the closed halfplane determined by /' that contains p,, let p be the point 
minimizing the angle p,p,p. By this choice, p satisfies (a), and there are at most 
i — 2 points of P on the side of p, p containing p,. If we rotate this line about p, 
then before it becomes parallel to /' it reaches a position / satisfying (b). 

Let H, and H, denote the closed halfplanes determined by / containing p, and 
Pp», respectively. By the induction hypothesis, the subgraphs of G induced by 
{v,,U3,...,U,} and {u,,U;,1,...,U,,U,} can be straight-line embedded on H, 1 P 
and H, 1 P, respectively, so that v,,v,,v; are mapped to p,, pj, p. Combining 
these embeddings yields a straight-line embedding of G with the desired proper- 
ties. 


No other solutions were received. 


The Fixed Points of a Permutation 


E 3342 [1989, 642]. Proposed by Irving Adler, North Bennington, Vermont. 


Let S = {1,2,...,7} and define a permutation f:S — S as follows. Take n? 
cards numbered from 1 to n’ and lay them in a square array, with the i-th row 
containing cards G@ — 1)n + 1,@/ — 1)n + 2,...,in in order from left to right. Pick 
up the cards along rising diagonals, starting with the upper left-hand corner. If 
card j is the k-th card picked up, put f(j) =k. For example, if n = 3, then 
fa) =1, f4=2, f2)=3, f=4, f6)=5, fB)=6, f@)=7, fO=8, 
f(9) = 9. 

For each value of n both 1 and n? are fixed points of f. If n is odd, then 
(n? + 1)/2 is also a fixed point of f. Characterize those n for which f has a fixed 
point not in {1, 7*,(n* + 1)/2}. 


Solution by José Heber Nieto, Universidad del Zulia, Maracaibo, Venezuela. The 
permutation f has a fixed point not in the set {1,*,(n* + 1)/2} if and only if 1 
has at least two different prime factors. We may assume n > 1. 

Let the rising diagonals be numbered 1 through 2 — 1, beginning at the upper 
left-hand corner and proceeding to the lower right-hand corner. By symmetry a 
number x in diagonal k with k <n is fixed if and only if n* + 1 — x in diagonal 
2n — k is fixed. It thus suffices to consider numbers in the first n diagonals. The 
number of the card originally in diagonal k and row i, where 1 <i <k <n, is 
x=(G-1)n+k -—i +1. Further 

k-1 
f(x) = Vijrk—-iti. 


j=l 


Hence x = f(x) if and only if 
(i-1)n=k(k -1)/72 ( *) 


It follows from (*) that each diagonal contains at most one fixed point. Thus f 
has a fixed point not in the set {1,,(n? + 1)/2} if and only if (*) has a solution 
(i,k) such that 1 <k <n and1 <i <A. Since k and k — 1 are coprime, no such 
solution exists if m is a prime or prime power. 


1991] PROBLEMS AND SOLUTIONS 167 


If n is not a prime or a prime power, then m = rs, where r and s are coprime 
and both greater than one. Further, we may assume that s is odd. Since 2r and s 
are coprime, there are integers a and B such that 


2ra+sB =1 (* *) 


Now, if the pair (a, B) satisfies (* *), so does the pair (a + st, B — 2rt) for any 
integer t. Thus we may assume that —r < B <r. In view of (* *) we must have 
B # 0, r and thus there are two cases to consider. 

Case I, —r < B < 0, a > O. In this case we take k = 2ra,i = 1 — aB. 

Case II, 0 < B <r, a < 0. In this case we take k = sB, i = 1 — aB. 

In both cases k(k — 1)/2 = —aBrs = —aBn = (i — 1)n. Straightforward cal- 
culation shows that3 <k <n —2and2 <i <n/2. Thus we have a fixed point of 
f in either case. 


Editorial comment. Several other solvers also reduced the problem to finding a 
solution of («) with 1 <k <n and 1<i<n.B.M.M. de Weger showed that if 
m is the number of distinct prime divisors of n, then the number of fixed points of 
f Gncluding the trivial ones) is 2”*! — 1 if n is odd and 2” if n is even. 


Solved also by D. Callan, K. David & A. Shilepsky, J. Fukuta Japan), M. Hartman & D. S. Rubin, 
R. J. Hendei, R. H. Jeurissen (The Netherlands), H. Lipman, O. P. Lossers (The Netherlands), L. F. 
Martins (siudent), W Raffke (West Germany), J. H. Steelman, F. van Dyke, J. T. Ward, B. M. M. 
de Weger (The Netherlands), J. Zhang (China), and the proposer. 


Rationals as Products from a Sequence of Rationals 


E 3347 [1989, 735]. Proposed by Vlastimil Dlab, Carleton University, Ottawa, 
Canada. 


Given an arbitrary natural number N, show that every rational number between 
QO and 1 can be written as a finite product of numbers of the form 


n(n + 3) 


= ——__—_ ; >N. 
"(n+ Din+2)’ ” 

Solution by Kerry Behler (student), Shippensburg University, Shippensburg, PA. 
By induction on 2, the product I1"*>R,,,,; telescopes to n/(m + 1). Given a 
rational number a/b between O and 1, we can write a/b = Na/Nb = 


T14?, %*~ "(Na + i)/(Na +it+1)= T1475 Ne! Tot Rye tiay- 


Editorial comment. This solution uses many factors; Roger B. Eggleton showed 
that O( N°) factors always suffice, and that O(N”) factors may be needed. John T. 
Ward noted that the argument can be generalized to other sequences, such as 
those of the form R,(n) = n(n + 3k)/(n + k(n + 2k). 


Solved also by I. C. Bivens & L. R. King, A. Christos (student), R. B. Eggleton (Brunei), K. Ford 
(student), Z. Franco, M. Getz, J. C. Jacobson, H. Ki (South Korea), N. F. Lindquist, O. P. Lossers (The 
Netherlands), R. Martin (West Germany), L. F. Martins, T. McDonald, P. P. Ray, J. H. Steelman, R. 
Stong, J. T. Ward, K. S. Williams (Canada), Western Maryland College Problems Group, and the 
proposer. 
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Coprime Sequences Summing to Unity 


E 3359 [1989, 928]. Proposed by Daniel Ullman, George Washington University, 
Washington, DC. 


Does there exist a finite increasing sequence 1 <a, <a, < ‘+: <a, of posi- 
tive integers such that 7?_, 1/a; = 1 and such that a, and a, ,, are relatively prime 
fori = 1,2,...,n + 1? 


Composite solution by O. P. Lossers, University of Technology, Eindhoven, The 
Netherlands, Richard Stong, University of California at Los Angeles, and the 
proposer. Such sequences exist. Here are four examples: 


3, 4,5, 9, 22, 25, 99, 100 (Lossers) 
3,5, 7,9, 11, 21, 22, 45, 154 (Lossers) 
2,3, 11, 23, 43, 127, 1771, 1807, 32131, 3263442 (Stong) 
2,3, 14, 33, 35, 88, 115, 273, 296, 403, 609, , 
943, 1062, 1073, 1206, 1519, 2419, 3283 (Ullman) 


No other correct solutions were received. 


ADVANCED PROBLEMS 


6649. Proposed by D. E. Knuth, Stanford University, Stanford, CA. 


Let P be a monic polynomial of degree m with complex coefficients and let w 
be a primitive m-th root of unity. Let A be the m by m matrix in which the 
element in the j-th row and kth column is 


Aj = J (w*t)e Po) dt (0<j,k <m-—1). 
0 


Prove that A is nonsingular. 


6650. Proposed by Keith Ball, Trinity College, Cambridge, England and Herman 
J. Tiersma, University of Technology, Eindhoven, The Netherlands. 


Let S be the set of natural numbers k& such that every matrix of zeros and ones 
containing exactly 2k ones must have a submatrix containing exactly k ones. (In 
the language of graph theory S is the set of natural numbers k such that every 
bipartite graph with 2k edges has an induced subgraph with k edges.) 

(a) Show that if p is a prime congruent to 1 modulo 5 such that 2p + 1 is also 
prime, then 2p and 2p + 1 are not in S. 

(b)* Does S contain infinitely many natural numbers? Are the powers of 2 all 
in §? 
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6651. Proposed by Richard L. Bishop and Lee A. Rubel, University of Illinois at 
Urbana-Champaign. 


Prove that the differential equation 


d*W 
zi 
dz* 
has no nonconstant entire solutions, but that, for every R > 0, it does have a 
non-constant solution analytic in {z: |z| < Ry}. 


4 


SOLUTIONS OF ADVANCED PROBLEMS 
Some Elliptic Curves Modulo p 


6506 [1985, 740]. Proposed by Ronald J. Evans, University of California, San 
Diego. 


Let p > 3 be a prime and write f(n) for the Legendre symbol (n/p). Prove that 


p-1 
L flm' + 2){1 — f(4m + 5)} = 1. 


Solution by Benedict H. Gross, Harvard University, Cambridge, MA. We consider 
the elliptic curves defined by the plane equations 


Ciy*=x3 4 2, 
E:y? =(x° + 2)(4x +5), 
J:y* =x? — 120x + 506 


over the field Q of rational numbers. (A good reference for the needed material 
on elliptic curves is [2], particularly Theorem 2.3 on page 24 and Exercise 5.4 on 
page 145.) 

In the table in [1, p. 129] C is the curve denoted by 1* — 1 and J is the curve 
denoted by 77* — 1 (see [1, pp. 123-125]). In particular the curves C and J are 
3-isogenous over © and hence over Z/pZ for any prime p greater than 3. (Since 
the discriminants of C and J are —2° - 3° and —2° - 3° respectively, these are the 
primes where the two curves have good reduction.) Thus C and J have the same 
number of points in projective space over Z/pZ for any prime greater than 3. 
Now, C and J each have one point at infinity, namely (w, x, y) = (0,0, 1), and so 
they have the same number of finite points over Z/pZ. If m° + 2 is a non-zero 
square in Z/pZ, there are two points on C over Z/pZ with x-coordinate m, while 
if m? + 2 is zero in Z/pZ, there is just one point (m,0) on C over Z/pZ with 
x-coordinate m. Thus the number of finite points on C over Z/pZ 1s equal to 

p-1 
ye {1 + f(m? + 2)}, 
m=0 


and similarly for the number of finite points on J over Z/pZ. Thus for p > 3 we 
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have 
y {1 + f(m ++ 2)} = y {1 + f(m — 120m + 506) }. (1) 
m=0 m=0 


On the other hand, as m runs through (Z/pZ) \ {25/4}, it is immediate that 
n = (32 — 5m)/(4m — 25) runs through (Z/pZ) \ {—5/4}. Now if we make the 
substitution n = (32 — 5m)/(4m — 25), we have 


9 
3 _ 3 
(n°? + 2)(4n + 5) (4m — 25) (m 120m + 506). 
Thus 
p-l 
\y) f(n? + 2) f(4n + 5) = > f(n? + 2)f(4n + 5) 
n=0 nE(Z/pZ)\{-5/4} 
= » f(m? — 120m + 506) 
m€(Z/pZ)\{25/4} 
p-1 


= )° f(m? — 120m + 506) — f(9/64) 
m=0 


p-1 
= )° f(m? — 120m + 506) — 1. (2) 
m=0 
To recapitulate, the curves E and J are isomorphic over Z/pZ, and the number of 
finite points on E over Z/pZ is one less than the number of finite points on J over 
L/pt. 
Combining (1) and (2) gives the assertion of the problem. 


REFERENCES 


1. Modular Functions of One Variable IV, Proceedings 1972, edited by B. J. Birch and W. Kuyk, 
Lecture Notes in Mathematics No. 476 Springer-Verlag, 1975. 

2. Joseph H. Silverman, The Arithmetic of Elliptic Curves, Graduate Texts in Mathematics No. 106, 
Springer-Verlag. 


Editorial comment. Daniel Grayson has pointed out to us that the 3-isogenies 
between C = {(p,q):q* =p? + 2} and J = {(z,w): w? = z? — 120z + 506} can 
be given explicitly, namely 


24( p + 1) 24 pq 
Z=p t+ ——> w= q -—77 
(p + 2)° (p + 2)° 
and 
z>— 1227+ 12z + 152 w(z> — 18z? + 132z — 376) 
re ae, ie 
9(z — 6)” 271(z — 6)” 


Although these explicit maps apparently do not make it possible to give a proof of 
(1) avoiding the theory of elliptic curves, they do serve to eliminate the depen- 
dence of the above solution on the tables in [1]. 


No solutions were received before the customary deadline. The above solution was prepared by 
Professor Gross at the request of the editors. 
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Expressing a Positive Integer As the Sum of As Many Distinct Squares as Possible 


6574 [1988, 559]. Proposed by Moshe Laub, Jerusalem, Israel 


Put M, = L_,j? forr = 1,2,... .Ifn & N (the set of positive integers), define 
m(n) to be the unique positive integer r such that M. <n < M,,,. 

(a) Let S, be the set n € N such that n is expressible as a sum of m(n) distinct 
squares of positive integers. Prove that S$, has asymptotic density one but that 
N\ 8S, is infinite. 

(b) Let S, be the set of n € N such that n’ is expressible as a sum of m(n”) 
distinct squares of positive integers. Prove that S, has asymptotic density one. Is 
N\ 8S, infinite? 

Solution I by L. E. Mattics, University of South Alabama, Mobile, AL. (a) First 
of all, we note that no number n such that M, <n <M, + 2r is in S, and thus 
N\ S;, is infinite. For if n > M, and n is expressible as a sum of r distinct positive 
squares, then one of the squares must exceed r’ and so 


n>?+2?4---4+(r—-1)° 4+ (r 41) =M,4 2r 41. 
In the other direction we shall prove that if 
M, + 36r°/* <n <M.,, ( *) 


and r is sufficiently large, then n is in S,. Since 


», 36k3/? < 36r°/? = O( MP”’), 
k=1 


this would show that S, has asymptotic density one. We begin with a lemma. 


LemMA. Let s be a fixed positive integer greater than 1. Suppose ty,Uy,Uny are 
positive integers depending on the positive integer N. Then the number of solutions of 


X, +X, +++ 4+x,=N (* *) 


in distinct positive integers different from ty, Uy, Uy Is 


Gopi | +o). 


Proof of the Lemma. The number of solutions of (* *) in positive integers with 
no restrictions is 


N- 1 __ | s—l1 s—2 
(yor (s— 1! + O(N"). 

Now the number of solutions (* *) in which two x, are equal is O(N‘ 7) and the 

number of solutions in which one of the x, takes a particular value t, is also 

O(N*~*) uniformly in t,. Thus the lemma follows. 

Now we turn to the proof that if n satisfies (*), where r is sufficiently large, 
then n € S,. Putn — M, = 27p + q withO <q < 2r. Put P=p —2,Q=q+4r 
if p —qiseven and P=p—1,Q=q +4 2r if p — q is odd. Then n — M, = 2rP 
+ Q, where 2r < Q < 6r and P — Q is even. Since r* + 2r >n — M. > 36r°”, 
we have r/2 > P > 18r'/* — Q/(2r) > 18r'/* — 3. Choose a positive integer b 
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such that b? < QO < (b+ 1)’. Then 

O-(b-1)’ <(b4+1)’- (b—- 1)’ = 4b < 4(6r)'”, 

Q—(b-1)° > (2b- 1) = 2(Q'” — 1) —1 > 2(2r)'” — 3, 
and 

P—(b—1) > 18r!/”2 — 3 — (6r)'” +1 > 157!” 
for sufficiently large r. Since P — Q is even, there is an integer m < r/4 with 
2m =P —(b-1)—{Q-(b~1)'}, 
where 
2m > 15r!/? — 4(6r)'” > 5r'/2 > 2(b — 1) 


if r is sufficiently large. Now let s = 2 if Q — (b — 1)’ is even and s =3 if 
OQ —(b — 1)” is odd. Recall that Q — (b — 1)? > 2(Q2r)'/”? — 3. Thus, if r is 
sufficiently large, we may apply the lemma with 2N + s = QO — (b — 1)* to deter- 
mine positive integers j, <j, < °‘:: <j,suchthatj,#b—2,j,#m—1,j,#m 
fori = 1,2,..., 5 and 


5 


Q-(b-1)°= ¥ (2i, + 1). 


k=1 
Clearly j, < {Q — (b — 1)*}/2 < 26r)'”* <r for sufficiently large r. Thus we 
have 


n=M,.+2rP+Q 
=M,+(r+b—-1)’-r? 4+ 2r{P—(b- 1)} + {Q - (b- 1)’} 
=M,+(r+b-1)?—r?+ 2r{[P- (b-1)] -[Q- (b- 1)]} 
+(2r + 1I){Q — (b- 1)} 


=M,+(r+b—1)°—r? 4+ 4rm + (2r +1) ¥ (2), + 1) 
k=1 


=M.+(r+b—-1)?-r?4+ (rtm) —(r—-m)’ 


+ > {(r + J, + 1)° —(r ~j,)'}. 
k=1 


Since 0 <j, <j, < ++: <j, and j, #m for k =1,2,...,5, the squares r?, 
(r —m)*,(r —j,)*,(r — j,)’,...,(r —j,)* are distinct. Since j, # m — 1 and j, # 
b—2 for k =1,2,...,s, the squares (r+ b — 1)?,(r+m)’,(74+j, + 1’, 


(r+j,+1)’,...,¢7+j, + D* are all distinct. Thus n is equal to the sum ob- 
tained from 17 + 2*+---+4+-7r7 by deleting s + 2 of the squares and replacing 
them by s + 2 squares greater than r’. Hence if n satisfies (*) and r is sufficiently 
large, then n € Sj. 

(b) If r = 48a’ for some positive integer a and n = 192a° + 3a, a simple 
calculation shows that n* = M, + r/48, so that 


M, <n’? <M, 4+ 2r. 
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Thus, if r is of the above form and n is chosen as indicated, n” 


as a sum of r distinct positive squares. Hence N \ S, 1s infinite. 
In the other direction we note that if 


Mi? 4+32<n<MhZ (* * ¥) 


r 


is not expressible 


and r is sufficiently large, then n is in S,; for if (+ * *) holds, then 


M,,, >n?>M,+ 64M)? > M, + 64(r3/3)'” > M, + 3673”, 


} 


so that M,,, >n* >M,+ 36r°/*, n* € S,, and n€S,. Since 32r = 
O(M!/*)/2), this shows that S, has asymptotic density one. 

Solution II by the Editors. We prove the generalization of the assertion of the 
problem in which squares are replaced by k-th powers and M, becomes Lj_, i*. 
We require the following theorem which can be readily proved by the Hardy- 


Littlewood method as presented in [1] or [2]. 


THEOREM. Given a positive integer k > 1 and a positive number «e, every suffi- 
ciently large positive integer N can be expressed as a sum of 2* + 1 distinct positive 
k-th powers each of which is greater than N/{(2* + 1) + «)}. 


For k = 2 this theorem (and more) was proved by E. M. Wright, Quart. J. 
Math. (Oxford) 41 (1933) 37—51 and 228-232. 
To obtain the first part of (a) we prove that for large r any n satisfying 


M, + 2(2* + l)er* <n <M.,,, (*) 


can be expressed as a sum of r distinct positive integer k-th powers. For if n 
satisfies (*), then 

. ak 

n—M,_y«_,> Xi (r—j)* + 2(2* + Der* 


} fa 
> (2*+1)(1 + e)ré 


if r is large. By the above theorem n — M,_,«_, can be expressed as a sum of 
2* +1 distinct integer k-th powers each greater than r*. But then n= 
(n — M._»«_,) + M._»x_, can be expressed as a sum of (2* + 1) + (r — 2* — 1) 
=r distinct positive integer k-th powers. 

To obtain the second part of (a), note that if nm > M, and n is expressible as a 
sum of r distinct positive integral k-th powers, then one of the k-th powers must 


exceed r* and so 
n>ke+2k4+---4(r—1)8 4+ (74+ 18 =M, 4 (74 1k = rk. 


Thus, none of the integers M.+ 1,M,+2,...,M.+(r+ 1) —r* —1 is ex- 
pressible as a sum of r distinct positive integer k-th powers and so N\ 5S, is 
infinite; in fact the argument shows that the number of elements of N \ S, not 
exceeding x is greater than x'~!/““** for large x. 

To obtain the first part of (b) note that the differences between consecutive 
integer k-th powers in the interval [M,, M.,,) are less than kM/,|/*. This is small 
compared to 


1-1/(k +1) 


M,,,-M,=(r+1)*={(r4+1)*} > Mia MerD, 
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Thus “most” integer k-th powers must be in some interval of the form 
(M, + 2(2* + l)er*, M.,,) and so are expressible as a sum of r distinct positive 
integer k-th powers by our solution of (a). 

To obtain the second part of (b), we recall (thanks to Jacob Bernoulli) that 


peti r* k 
M = po pk k-2 
’"k+1° 2° 12" or) 
r k+1)\* k?-3 
= +——J| - ) 4 O(r*~*), 
k+1\" 2k 24k | ve) 
Thus if r = (k + 1)(2ka)* for some positive integer a, then 
r k+1 k k 
———|r + ——] = {(k + 1)(2ka)**" + = uk 
ha] r 5k | {( )(2ka) a(k + 1)} u 


is the k-th power of an integer u, and further for large a we have 
k k 
O<ut—M,< sors <(r4+1)* -r*. 


Thus u* is not expressible as a sum of r distinct positive integer k-th powers. Since 
a can be any sufficiently large positive integer, the set N \ S, is infinite. 


REFERENCES 


1. T. Estermann, On Waring’s theorem: A simple proof of a theorem of Hua, Sci. Rep. Nat. Tsing 
Hua Univ., 5A (1948) 226-239. 

2. R. C. Vaughan, The Hardy-Littlewood method, Cambridge Tracts in Mathematics, 80 (1981) 
specifically Chapter 2. 


No other solutions were received. 


Cyclic Permutations of the g-adic Digits 


6601 [1989, 454]. Proposed by Sajal K. Das, University of North Texas, Denton, 
TX and the late Patrick O’ Hara, University of Central Florida, Orlando. 
Given positive integers q > 2 and k > 1 let 7, be the permutation of the set 
{0,1,2,...,q* — 1} defined by 
m,(i) = qi — (q* — 1)| qi/(q* — 1)| ifi = 0,1,2,...,q* — 2, 
(1) =ijifi=q* —1. 
If 7, is factored into disjoint cycles, determine the cycle lengths which occur and 
the number of cycles of each length. 


Solution by B. M. M. de Weger, Universiteit Twente, Enschede, The Netherlands. 
Let 0 <i <q* — 2 have the q-adic expansion 


i=rot+¢nqtrng?+-:: +r,_14*, 
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with ro, 11,---5%,—1 © {0,1,...q — 1}, not all equal to q — 1. Note that 
Gi=rpytrog tra? tes: tr, 2g"! + ria" — 1), 
and by 


O<r,_,+1roq + rq? tose +r, .q*—' <(q- 1 t+qtq? tert q*—') 
= q* —] 
it follows that 


| gi/(q* _ 1)| = Feey: 
Hence 


(i) — ql _ (q* —_ 1)r,_, = Ve + rod + rq? fee 4 r, oq’. 


Thus the mapping i — 7,(i) can be described as a cyclic permutation of the q-adic 
digits (ry,7,,.-.,7,_,) of i. Note that this is also true for i = q* — 1. 

It follows at once that Th is the identity, so that cycle lengths can only be 
divisors of k. Let d|k, and let c(d) denote the number of cycles of length d. If i is 
in a cycle of length d, then its q-adic digits consist of the block (79, 7,,...,7r4_1) 
repeated k/d times, and there are just d numbers 7 in the same cycle. The 
number of blocks (79, r,,..., 7 _1) is equal to q“; they correspond to cycles whose 


length is a divisor of d. Thus we obtain the formula 


gq’ = Yi d'c(d’). 


d'\d 


The Mobius Inversion Formula followed by division by d yields for every divisor d 
of k (including 1 and k itself), the following expression for c(d): 


1 | 
c(d) = 7 du d')qe/*. 
d'|d 


Here w is the well known Mobius p-function. 


Editorial comment. In a similar proof, O. P. Lossers avoided Mobius inversion 
by a direct application of the inclusion-exclusion principle. Stephen M. Gagola, Jr. 
pointed out that c(d) is also the number of irreducible polynomials of degree d 
over GF(q) when q is a prime power. A. A. Jagers asserted that 7, is equivalent 
to a permutation 7, with 7,(s) = st for all s © S, where S is the set of all 
complex roots of the polynomial X7 — X. The expression c(d) is also given 
various interpretations in Dickson’s History of the Theory of Numbers (vol. I, pp. 
84—86). The solution printed above was chosen by an editor fond of q-adic 
expansions. Shorter solutions were provided by a number of readers. 

Solved also by R. J. Chapman (United Kingdom), Stephen M. Gagola, Jr., A. A. Jagers (The 


Netherlands), Marcin E. Kuczma (Poland), O. P. Lossers (The Netherlands), Yoshio Mimura (Japan), 
John Henry Steelman, Ondrej Such (Czechoslovakia), and the proposers. 


176 PROBLEMS AND SOLUTIONS [February 


Roots on the Unit Circle 


6602 [1989, 454]. Proposed by Allen R. Miller, Naval Research Laboratory, 
Washington, D.C. 


Suppose 0 <k, < 1,0 <k, < 1, and ¢ is real; if k, = k, = 1, suppose further 
that ¢ is not a multiple of 277. If 


a =e'?(e'* — kik), 
B=ky+k3 — 2k,k,e', 
y = (ki +k} —k,k, cos — 1), 
prove that the roots of the self-inversive quartic equation 
azi'+ Bze+yz*7+Bz+a=0 
have absolute value one. 


Solution by David Secrest, Princeton University, Princeton, NJ. Call the polyno- 
mial on the left F(z). We shall set z = e“°~-* and show that 


f(0) = F(z)/(2z*) 
is zero for four values of 6, counting multiplicities. Now 
f(@) = A(@) + A(8), 
where 
2A(0) = (1 — kyk,e'*)e* + ((k? + k3)e' — 2k, kp )e” 
+ki+k3—-1-—k,k,cos 4, 
Ne) 
f(0) = cos20 — k,k, cos(20 — p) + (k? + k5)cos(0 — ¢) 
—2k,k,cos0+k?+k3—1-—k,k,cos¢ 
cos20 — 1+ (ki? + k3)(1 + cos(@ — @)) — 2k,k, cos 0 
— 2k,k, cos(@ — ¢)cos 6 
= —2sin?6 + (k? +k} — 2k,k,cos6)(1 + cos(@ — #)). 
We claim that it suffices to establish the result for 0 < @ < 7. If 7 < ¢ < 27, 
substitute @ = 277 — ¢' and 0 = 277 — @’. This gives the same expression with ¢’ 
and 6’ in place of ¢ and @, but with 0 < @’ < 7. We take the four solutions for 6’ 
and substitute them back to get the four solutions for 6. 
Now define 6) by 0 < 6, < 7/2 and cos 0, = k,k,. We first consider the case 


in which k, #k, and ¢ is distinct from 0,77, and 0). Observe that the following 
inequalities are valid: 


f(0) =(k, —k,)°(1 + cos) > 0, (1) 
f(@)) < —2sin? 6) + 2(k? + k2 — 2k,k, cos 6,) 

= -2(1 - k7)(1— k3) <0, (2) 

f(a) = (k, + k,)°(1 — cos @) > 0, (3) 


f(7 + 6) = -2sin’*(7 +) < 0. (4) 
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Since f(@) is continuous and periodic, there must be four distinct zeros interlaced 
with 0, 0), 7, 7 + , and 277. Also, if d = A, we still have f(@,) < Oif k, # 1 and 
k, # 1: the strict and nonstrict inequalities of (2) simply interchange. 

To complete the proof, recall that the roots (and hence the absolute values of 
the roots) of a monic polynomial depend continuously on its coefficients. Thus for 
d& # 0 we easily deduce the result by continuity even if k, = 1,k,=1,k, =k, or 
b= TT. 

If @¢ = 0 then 

F(z) = (24+ 1)°((1 — kyka) 2? + (k? + k3 — 2)2 + (A — ky ky)). 


Here the roots are —1,— 1,+ 1,+ 1 if k, =k,. Otherwise it is easy to show that 
there is a pair of complex conjugate roots, each of which is the reciprocal of the 
other. Hence each has absolute value one, and the proof is completed. 


Editorial comment. No simple factorization of F(z) was found in general, but 
Secrest did observe that for k, = 1, k, = cos 6), and ¢ = 6, we have 


F(z) = i(sin 6,)(z — 1)*(eoz? + i(sin 0,)z — e-'*), 
and that fork, =k,=k we have 
F(z) = e'*(z —e*)'G(2), 
where 


G(z) = {(e'?7? +k)zt+e "7 +ky((ee”? —k)z +e — ky}. 


No other solutions were received. 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


The Mathematics of Projectiles in Sport. By Neville de Mestre, Australian Mathe- 
matical Society Lecture Series 6. Cambridge University Press, 1990. xi + 175 pp. 


Victor G. SZEBEHELY 
Department of Aerospace Engineering and Engineering Mechanics, University of Texas, Austin, TX 78712 


Past. I have heard many times that good ideas might emerge when the mind is 
allowed to wander. So I stopped reading this book and suddenly found myself in 
high school listening to two of my physics teachers. I was 16 when one of my 
teachers told us to concentrate on the basic principles in mechanics and dynamics 
and have fun applying these to practical problems. The following year, another 
teacher forced us to memorize equations by numbering them and the examination 
went like this. Teacher: “Write equation 16.” Student: “I am not sure if it is 
v = ¥2gh ors = (g/2)t*.” Teacher: “Your second guess is correct. You get a C.” 

Today I feel sorry for the second teacher and J am glad that he did not succeed 
in making me hate mechanics. I am grateful for having the first teacher who made 
me fall in love with dynamics and whom | relate to Professor de Mestre. 

Then my mind went to the first year’s first semester mechanics classes at the 
University where only those problems were discussed which could be solved, 
usually by some specific trick. 

If you knew the trick you got an A, if not you were in trouble. I know many 
professors who emphasize tricky problem-solving instead of the principles. 

The second semester offered dynamics and the professor told us that he would 
not concentrate on solvable problems since they are boring but would apply the 
basic methods and principles to real problems. This was an intellectual: challenge 
which I could not resist and I still jump on “unsolvable and unsolved problems.” 


Present. At this point my mind stops wandering and I continue reading 
de Mestre’s jewel. He teaches mechanics in a clear, understandable language to 
some very lucky students whose reaction is to fall in love with dynamics. “‘How can 
dynamics be that straightforward and almost too easy?” My mind wanders off 
again recalling my students whose response to most problems is to run to their 
computers. So I ask them if their answers are correct? How to verify them? Why 
not use some approximations which allows them to get a quick (no-computer) 
answer? And if x is small, why not write x/2 + 1 for yx +1? 

Well, Professor de Mestre tells about approximate solutions of exact equations. 
He explains the technique of perturbations which, in his competent hands, be- 
comes an undergraduate subject. And he shows graphical integration which helps 
the reader to understand what is really going on. 

When he goes to the concept of mathematical modelling of real problems he 
addresses engineers who make their living in this way. Which terms shall we keep 
and which neglect? How will approximate modelling influence the results? 
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Our limitations regarding analytical solutions versus numerical integration teach 
us honesty and offer realistic evaluations of our basic problems. Professor 
de Mestre calls attention to problems not yet treated in the literature, which puts 
the book on a graduate level, offering subjects for theses or dissertations. 

It is often asked, when text books are reviewed, “for whom is the book 
written”? This is probably the easiest question to answer regarding de Mestre’s 
book: for everybody. The undergraduate student will learn how to attack problems 
in mechanics, his professor will learn how to teach mechanics. The graduate 
student will understand the technique of approximations and his professor will 
learn about unsolved problems. Sports enthusiasts will not be able to resist the 
charts and tables and they might even conclude that it is often easier to compute 
the trajectory of a football than to kick it with a required speed and direction. 

The author states that “The aim of the current book is to present a unified 
collection of the many problems that can be tackled and of all the mathematical 
techniques that can be employed.” Here he shows his bravery, since “all the 
mathematical techniques” expand and change every day. 

I feel that the aims of this book are to separate the dilettanti from the 
cognoscenti, to show how to do mechanics well, to explain what mechanics and 
dynamics are about and, last but not least, how to say it in a beautifully short book 
satisfying the dictum that less is more. The profession is indebted to Professor 
de Mestre for his delightful book which treats a special set of dynamical systems of 
considerable interest. 


A Transition to Advanced Mathematics. By Douglas Smith, Maurice Eggen, and 
Richard St. Andre. Third Edition. Brooks/Cole, Pacific Grove, CA, 1989. 
xlii + 285 pp. 


Foundations for Advanced Mathematics. By Carol Avelsgaard. Scott, Foresman 
and Company, Glenview, IL, 1990. vili + 325 pp. 


Introduction to Advanced Mathematics. By William Barnier and Norman Feldman. 
Prentice Hall, Englewood Cliffs, NJ, 1990. xv + 320 pp. 


JUDITH PALAGALLO 
Department of Mathematical Sciences, University of Akron, Akron, OH 44325 


Mathematics has been described as a discipline in which early courses give little 
indication of what will be required in the future and reveal nothing of what is 
required of a practicing mathematician. During the last few years renewed interest 
has been shown in “bridging the gap” between the traditional calculus course and 
more advanced work such as abstract algebra or real analysis. One might ask why 
this gap exists. The gap has become more prevalent over the years because of the 
nature of the calculus courses. The theoretical content and rigor of the courses has 
diminished and the emphasis is placed on computational skills and techniques. 
Linear algebra has been the course in which the student learns methods of proof 
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and is introduced to another level of mathematical abstraction. Many times, 
however, the linear algebra course is also reduced to a methods and skills type of 
mathematics course. Thus students still have little exposure to the world of 
mathematics before they enter an analysis or algebra class. where they are asked to 
function like budding mathematicians. 

These textbooks make a good attempt at putting together topics needed by 
students but generally not taught in any other traditional course in an undergradu- 
ate curriculum. Each requires the students to have some calculus background. The 
similarities of the books in content and perspective perhaps indicate that we are 
precisely aware of what has been lacking in the training of our students. Topics 
covered by all three textbooks include fundamentals of logic, methods of proof, 
sets, relations, functions and some counting techniques—material which is indis- 
pensable to any mathematician. Beyond these topics the authors diverge into 
various applications of the principles presented earlier. 

The goals of the books are two-fold. First, they desire to provide a firm 
foundation in the major ideas needed for continued work in mathematics, drawing 
on the fundamental relationship between logic and mathematics. Second, they 
intend to guide the students to think and to express themselves mathematically. In 
building the foundations, the books differ. Barnier and Feldman discuss the 
propositional calculus in the most sophisticated manner, using quite formal nota- 
tion. They emphasize symbolic logic before discussing its relationship to methods 
of proof. The other two books integrate mathematical statements into their 
examples of logical statements from the beginning, and the presentations are less 
formal. After the discussions of formal logic, all the books present the standard 
proof techniques. Good examples of uncomplicated proofs concerning elementary 
number theory, geometry or beginning calculus are presented. My students prefer 
the number theory proofs and find any proof involving the concepts of limit or 
continuity much more difficult. However, it is these proofs that provide the 
“bridge” we are seeking to build in their mathematical education. 

The real test of understanding the methods of proof is whether students can 
apply the methods to other topics fundamental in their background. As Paul 
Halmos has stated, ‘‘“Every mathematician agrees that every mathematician must 
know some set theory.” Our students are no exception, so a detailed discussion of 
set theory in this course is appropriate. Students are acquainted with sets, but their 
understanding of set theory is very limited. The next step is to have students 
practice their newly acquired skills in formal statement-proving by establishing 
results about sets. Arbitrary families of sets, indexed sets and power sets are new 
topics and require a deeper level of understanding by the students. Similar 
remarks could be made about the study of relations and functions. Equivalence 
relations and inverse images of functions are new topics of some mathematical 
sophistication. They are also examples of topics which help in the transition from 
calculus to upper division courses. 

I was particularly interested in how each book handled the topic of induction. 
All students at this level have previously worked with elementary induction 
problems, but this course should provide an opportunity to relate induction to 
more appealing properties of the natural numbers. Barnier and Feldman, and 
Avelsgaard. discuss induction among other proof techniques. They use a rather 
traditional approach, requiring the students to verify that various equations are 
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true for the natural numbers. Both books do give at least one example of a physical 
situation which must be modeled by an inductive statement. While these problems 
are difficult to formulate, they are the ones which give insight into the real nature 
of the inductive process. Students need more problems presented in such a way. 
This type of problem requires the student to formulate the inductive statement and 
verify it with the principle of mathematical induction. Smith, Eggen and St. Andre 
introduce the topic in a chapter on sets. They take a somewhat different approach 
by integrating the discussion of induction with the Well Ordering Principle and 
even the Division Algorithm for the natural numbers. This approach gives the 
students a look at some real topics in mathematics at a higher level of abstraction. 
It also demonstrates how a mathematician might discover relationships between 
topics. 

All three books are well written and each contains features worth noting. The 
Avelsgaard book contains interesting bibliographic references at the end of each 
chapter, most of which are accessible to the undergraduate student. The books by 
Smith, Eggen and St. Andre, and Barnier and Feldman both have “proofs to 
grade” in the exercise sets of the chapters. These are lists of claims with alleged 
proofs which the students are to evaluate. They expose the students to different 
styles of writing proofs and develop in the students the ability to read proofs 
critically. The Barnier and Feldman book has a collection of good supplementary 
exercises which should not be overlooked. The book by Smith, Eggen and 
St. Andre ends with three chapters of more advanced mathematics including a 
discussion of the Axiom of Choice, some group theory and a few important topics 
from real analysis. Similarly, Barnier and Feldman conclude with four chapters on 
cardinality, Boolean algebra, the structure of the integers, limits and real numbers. 
These chapters supply a rich source of examples of the previous material, but they 
exceed the level of rigor of the earlier chapters. I also question whether there 
would ever be time in one term to touch on such topics. 

Few of the students who take such a course will become professional mathe- 
maticians. Nonetheless, the understanding of the logical structure of the language 
of mathematics and the ability to apply deductive reasoning will be valuable assets 
to aspiring computer and physical scientists and certainly to those who will 
eventually teach mathematics. Judging from these books, and others like them, 
some good attempts are being made to aid students in making the transition from 
calculus to more advanced mathematics courses. 
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General, C. Derive: A Mathematical Assistant Pro- 
gram, Third Edition. Albert Rich, Joan Rich, David 
Stoutemyer. Soft Warehouse, 1988, vi + 126 pp, (P). 
Derive is a computer algebra system that runs on 
IBM PC’s and compatibles. Major changes between 
Version 1.22 and Version 1.6 include greater support 
for multivariable calculus, the addition of a large col- 
lection of special functions, improvements to the an- 
tidifferentiation algorithm, functions for solving ordi- 
nary differential equations, and functions for Fourier 


series approximations and Laplace transformations. 
AO 


General, T(14-15). Foundations of Higher Mathe- 
matics: Exploration and Proof. Daniel Fendel, Diane 
Resek. Addison-Wesley, 1990, xviii + 457 pp, $32.76. 
[ISBN: 0-201-12587-0] A book whose aim is to 
bridge the gap between calculus and upper-division 
courses. ‘Topics covered include logic, set theory, 
methods of proof, basic number theory, equivalence 
relations, order, cardinality, limits and continuity, 
and elementary group theory. JPH 


General, S, L. Innumeracy: Mathematical Illiter- 
acy and Its Consequences. John Allen Paulos. Pen- 
guin Books, 1988, 135 pp, (P). [ISBN: 0-14-012255- 
9} British paperback reprint of Paulos’ 1988 sur- 
prise U.S. bestseller (TR, April 1989; Extended Re- 
view, January 1990). Innumerable anecdotes reveal 
appalling public innumeracy, often with serious con- 
sequences for personal decisions. LAS 


General, P. Chairing the Mathematical Sciences 
Department of the 1990s. National Research Coun- 
cil, 1990, ix + 97 pp, (P). Presentations (with 
audience comments and questions) from the Octo- 
ber 1989 meeting of mathematics department chairs 
sponsored by the Board on Mathematical Sciences 
of the National Research Council: dealing with ad- 
ministrations; funding opportunities and strategies; 
curricular issues; statistics programs. Speakers deal 
primarily with topics relevant to research universi- 
ties. LAS 


Elementary, T(13: 1). Fundamentals of Trigo- 
nometry, Second Edition. Charles D. Miller, Mar- 


garet L. Lial, David I. Schneider. Scott Foresman, 
1990, xiv + 418 pp, $33. [ISBN: 0-673-38961-8] Ba- 
sic trigonometry text. Key features include an early 
discussion of circle functions, emphasis on calcula- 
tor usage throughout, and chapters on complex num- 
bers, exponential functions, and analytic geometry. 
Provides a set of computer-assisted tutorials and ap- 
proximately 50 transparencies as supplements. MPR 


Mathematics Appreciation, T(13-14). A Math- 
ematics Sampler: Topics for the Liberal Arts, Second 
Edition. William P. Berlinghoff, Kerry E. Grant. 
Ardsley House, 1988, xiii + 437 pp, $33.95 (P). 
{ISBN: 0-912675-18-7] A book with genuine math- 
ematical content in each area sampled: number the- 
ory, geometry, probability and statistics, computers 
(Basic programming), infinity (countability), groups, 
the fourth-dimension. Each chapter closes with the 
subheading LINK that describes an area of applica- 
tions. Worth consideration, but a bit unwieldly with 
437 pages sized 8 1/2 x 11. AWR 


Mathematics Appreciation. Goop & Gook or 
1.f(math) = laughter. Paul Yearout. Sympun Pub- 
lishing (2174 N. 250 E., Provo, UT 84604), 1990, 64 
pp, $5 (P). A slim collection of mathematical puns, 
rebuses, and old saws like sinz/n = 6. Possibly use- 
ful as a source of humorous filler for departmental 
newsletters. BC 


Education, P*. Making Mathematics Work for 
Minorities: Framework for a National Action Plan 
1990-2000. National Research Council, 1990, 24 pp, 
(P). Highlights of issues and recommendations that 
emerged from an ril 1990 national symposium 
sponsored by the Mathematical Sciences Education 
Board. Introduces the agenda of the new Alliance to 
Involve Minorities in Mathematics (AIMM). LAS 


Education, S, P, L. Assessing Higher Order Think- 
ing in Mathematics. Ed: Gerald Kulm. AAAS, 
1990, 209 pp, $24.95 (P). [ISBN: 0-87168-356-3] 
Twelve papers analyzing new approaches to assessing 
school mathematics. Topics include problem solv- 
ing, computer-based testing, cognitive perspectives, 
the role of calculators, and quantitative reasoning. 
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A useful and timely elaboration of issues raised in 
the NCTM Standards which will become increasingly 
important as the U.S. seeks to measure progress in 
school mathematics. LAS 


Education, S, P, L*. Constructivist Views on the 
Teaching and Learning of Mathematics. Eds: Robert 
B. Davis, Carolyn A. Maher, Nel Noddings. Journal 
for Research in Math. Edu., No. 4. NCTM, 1990, ix 
+ 210 pp, $9.50 (P). [ISBN: 0-87353-300-3] Twelve 
papers delving into the evidence for constructivism— 
the notion that children construct mathematical 
knowledge for themselves—and its impact on teach- 
ing and learning. One corollary: as a “right answer” 
is no evidence of correct work, so a correctly written 
algorithm is no evidence of appropriate understand- 
ing. Much emphasis is placed on the importance of 
developing a “mathematical community” to sustain 
important emotional factors such as persistence, self- 
confidence, and trust that are necessary for students 
to succeed in mathematics. LAS 


Education, P, L*. Changing the Odds: Factors In- 
creasing Access to College. Sol H. Pelavin, Michael 
Kane. College Entrance Examination Board, 1990, 
ix + 85 pp, (P). [ISBN: 0-87447-396-9] Longitudi- 
nal analysis of data from 16,000 members of the high 
school class of 1982 to determine factors that cor- 
relate with college attendance and graduation. Key 
finding: 80% of students who take high school geom- 
etry go to college, regardless of ethnic background. 
No other variable in the study is as effective as ge- 
ometry in leveling this statistical playing field. Of 
course, only correlation is documented, not causa- 
tion. Nonetheless, this report merits close study 
since it conveys a powerful message. LAS 


Education, P*. Mathematical Challenges for the 
Middle Grades From the Arithmetic Teacher. Ed: 
William D. Jamski. NCTM, 1990, v + 48 pp, $6.25 
(P). [ISBN: 0-87353-296-1] Includes 125 problems 
that appeared either “in form or concept” in the 
Arithmetic Teacher during the last fifteen years. In- 
cludes references to the articles that contained these 
problems and to other relevant articles. JNC 


History, C**, L**. Glory and Failure. Michael 
Lindgren. Transl: Craig G. McKay. MIT Pr, 1990, 
414 pp, $45. [ISBN: 0-262-12146-8] Social and tech- 
nical analysis of the first attempts to mechanize the 
production of numerical tables. History of the differ- 
ence engine from its conception by Johann Miller in 
1784 to the efforts of Charles Babbage (1791-1871) 
in the years 1823-1833 to the eventual production in 
1843 by Georg Scheutz (1785-1873) and his son Ed- 
vard Scheutz (1821-1881). Tells why Babbage failed 
technically and why the Scheutzes failed commer- 
cially. Substantial technical descriptions. Superb 
bibliography of original source material. JDEK 


History, S, L*. The Last Problem. E.T. Bell. MAA, 
1990, 326 pp, $17.50 (P). [ISBN: 0-88385-451-1] A 
casual, selective, idiosyncratic history of the cultures 
that gave birth to Fermat’s Last Theorem, from 
Babylonia to seventeenth century France—the age 
of Fermat. Numerous digressions on remote topics 
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(e.g., Cleopatra’s dalliances; Louis XIV taxes) re- 
mind us of Bell’s reputation for historical fiction. 
Originally published in 1961, this edition is enveloped 
by a preface and contemporary notes by editor Dud- 
ley. The ratio of entertainment to enlightenment in 
this delightful volume might set a record for mathe- 
matics books. LAS 


Logic, T(17-18: 1, 2), P. Model Theory, Third 
Edition. C.C. Chang, H.J. Keisler. Stud. in Logic & 
Found. of Math., V. 73. North-Holland (US Distr: 
Elsevier Science), 1990, xvi + 650 pp, $118. [ISBN: 
0-444-88054-2] The Third Edition of this, the orig- 
inal and still standard introductory text in the field, 
retains its emphasis on classical first-order model 
theory, which “remains a prerequisite for all of the 
newer branches of model theory.” Changes include 
expanded treatment of model completeness; intro- 
duction to recursively saturated models; added exer- 
cises. (First Edition, TR, August-September 1974.) 
RB 


Foundations, P. Categories, Allegories. Peter J. 
Freyd, Andre Scedrov. Math. Lib., V. 39. North- 
Holland (US Distr: Elsevier Science), 1990, xviii + 
296 pp, $30.75 (P); $82. [ISBN: 0-444-70367-5; 0- 
444-70368-3] Category theory with emphasis on its 
geometric nature and connections to mathematical 
logic, via logoi (generalizations of the Heyting alge- 
bras which replace Boolean algebras in intuitionistic 
logic), topoi (categories modeled on the properties of 
the category of sets), and allegories (which are to the 
theory of binary relations and sets what categories 
are to functions and sets). RM 


Foundations, T(14), L. Introduction to Advanced 
Mathematics. William Barnier, Norman Feldman. 
Prentice Hall, 1990, xv + 320 pp. [ISBN: 0- 
13-477084-6] An appropriate text for “Logic and 
Proof” courses offered at the sophomore level before 
real analysis or abstract algebra. The intent is to give 
students an introduction to the structure of theorems 
and proofs in a more user-friendly environment. The 
first half of the book consists mostly of the tools of 
proving theorems; logic, methods of proof, set theory, 
etc. The second part consists of topics to which the 
student may apply these newly-developed skills; car- 
dinality, boolean algebra, the integers, and limits and 
real numbers. This appears to be a well-constructed 
and useful text. MPR 


Foundations, T(17-18: 1, 2), P. Abstract and 
Concrete Categories: The Joy of Cats. JitiAdamek, 
Horst Herrlich, George E. Strecker. Wiley, 1990, xii 
+ 482 pp, $54.95. [ISBN: 0-471-60922-6] A high- 
level compendium of definitions and theorems about 
the language called category theory intended as both 
advanced text and reference. JAS 


Foundations. Lecture Notes in Mathematics-1432: 
Recursion Theory Week. Eds: K. Ambos-Spies, G.H. 
Miller, G.E. Sacks. Springer-Verlag, 1990, vi + 393 
pp, $37 (P). [ISBN: 0-387-52772-9] Selection of pa- 
pers from the Second Oberwolfach Conference (1989) 
on recursion theory, which gives an overview of cur- 
rent work in recursion theory. RM 
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Graph Theory, S(17-18), P. Fundamentals of 
Graph Theory. Alexander A. Zykov. ‘Transl: L. 
Boron, C. Christenson, B. Smith. BCS Associates, 
1990, vi + 365 pp, $48 (P). [ISBN: 0-914351-04-4] 
Translation with corrections and supplements of a 
Russian volume which is a recast version of author’s 
Theory of Finite Graphs, the first Soviet graph the- 
ory text. Contains descriptive introductions to clas- 
sical and contemporary topics, developing many to 
research level. Interesting culturally due to differing 
language, choice of emphasis, exercises, and many 
valuable references to Soviet research articles (usu- 
ally with MR or Zbl reference). JPH 


Graph Theory, T*(14-16: 2), S, L*. Pearls in 
Graph Theory: A Comprehensive Introduction. Nora 
Hartsfield, Gerhard Ringel. Academic Pr, 1990, ix + 
246 pp, $29.95. [ISBN: 0-12-328552-6] Innovative, 
introductory text beginning with elementary con- 
cepts covers most traditional topics (e.g., colorings, 
Eulerian and Hamilton cycles, trees, algorithms), but 
also has clear exposition of unusual and more ad- 
vanced topics, like the Oberwolfach Problem, infinite 
graphs, and magic graphs. Especially good chapters 
on planar graphs and their colorings, graphs on sur- 
faces, Map Color Theorem, and Heawood’s Empire 
Problem. Develops material to substantial level, in- 
cluding open conjectures. JPH 


Graph Theory, T(17-18: 2), P. Eulerian Graphs 
and Related Topics, Part 1, Volume 1. Herbert Fleis- 
chner. Annals of Disc. Math., V. 45. North-Holland 
(US Distr: Elsevier Science), 1990, xiii + 487 pp, 
$94.75. (ISBN: 0-444-88395-9] Intended for a broad 
range of readers, including undergraduate students 
and researchers. Can be read by a mathematician 
not yet familiar with graph theory, but will also be 
of interest to researchers since it contains many re- 
cent results and partial solutions to some unsolved 
problems. LC 


Graph Theory, S(18), P. Computational Graph 
Theory. Eds: G. Tinhofer, et al. Computing Sup- 
plementum, V. 7. Springer-Verlag, 1990, 282 pp, $79 
(P). {ISBN: 0-387-82177-5] Contains papers edited 
by G. Tinhofer, E. Mayr, H. Noltemeier, and M. 
Syslo on computational techniques applied to graph 
theoretical problems. Includes planar graphs, path 
problems, coloring problems, orders and graphs, data 
structures, probabilistic behavior of algorithms, par- 
allel algorithms, VLSI-structures. AD 


Combinatorics, T(16). Introduction to Lattices 
and Order. B.A. Davey, H.A. Priestley. Cam- 
bridge University Pr, 1990, viii + 248 pp, $18.95 (P); 
$49.50. (ISBN: 0-521-36766-2; 0-521-36584-8] Cov- 
ers ordered sets, lattices, Boolean algebras, and some 
applications. Particular attention paid to distribu- 
tive lattices. Minimal prerequisites are group theory 
and discrete mathematics. Plentiful exercises and 
supplemental material suitable for beginning gradu- 
ate students. AD 


Discrete Mathematics, T(13-14: 2), L. Applied 
Discrete Structures for Computer Science, Second 
Edition. Alan Doerr, Kenneth Levasseur. Macmil- 
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lan, 1989, xxiv + 654 pp. [ISBN: 0-02-329934-7] 
This text acquaints students with the wide array 
of topics in a standard discrete structures course, 
and develops mathematical maturity to a level be- 
low that of transition to higher mathematics. “The 
main change [in this edition] is the addition of a large 
number of exercises at all levels.” (First Edition, TR, 
December 1986.) RB 


Discrete Mathematics, P. Mathematical Prelimi- 
naries for Computer Networking. David Claiborne. 
Wiley, 1990, x + 180 pp, $32.95 (P). [ISBN: 0- 
471-51062-9] A non-rigorous survey of mathemat- 
ical techniques and methods used in data communi- 
cation protocols, coding schemes and network con- 
figurations, by a consultant with electrical engineer- 
ing background, intended for programmers. Assumes 
college algebra and some calculus. Chapters on prob- 
ability, information theory, graph theory, queueing 
theory, finite state machines, and advanced coding 
techniques. RB 


Number Theory, P*. Number Theory, Volumes 
I-IT. Eds: K. Gyéry, G. Haldsz. North-Holland (US 
Distr: Elsevier Science), 1990, $179.50 set, [ISBN: 
0-444-70476-0]. Volume I: Elementary and Analytic, 
573 pp; Volume II: Diophantine and Algebraic, 495 
pp. Proceedings of the Janos Bolyai Mathematical 
Society Conference on Number Theory in Budapest 
from July 20-25, 1987. Contains most, but not all, of 
the papers presented at the conference. A compre- 
hensive picture of current research in the field. CEC 


Number Theory, S*, P, L. Number Theory. Na- 
tional Research Council, 1990, vii + 51 pp, (P). 
Three superb expositions of key ideas of contempo- 
rary number theory presented at a public symposium 
held in conjunction with National Science Week at 
the National Academy of Sciences in May 1989: H.W. 
Lenstra, Jr. on number theory applied to primality 
testing and cryptology; B. Mazur on number theory 
as a “gadfly”—a constant source of deep problems; 
and A.M. Odlyzko on relations between quantum 
chaos and the Riemann hypothesis. Being written 
for a general audience, these timely surveys can be 
tread by any mathematician. LAS 


Number Theory, P. Lecture Notes in Mathemat- 
ics-1434: Analytic Number Theory. Eds: K. Na- 
gasaka, E. Fouvry. Springer-Verlag, 1990, vi + 218 
pp, $22 (P). [ISBN: 0-387-52787-7] Seventeen pa- 
pers from a Tokyo conference in 1988. BC 


Linear Algebra. Matriz Perturbation Theory. 
G.W. Stewart, Ji-guang Sun. Comput. Sci. & Sci- 
entific Comput. Academic Pr, 1990, xv + 365 pp, 
$49.95. [ISBN: 0-12-670230-6] Begun as the trans- 
lation of a book in Chinese by Sun, this new version 
emerged. Attention ts given to the solution of lin- 
ear systems, least squares problems, and eigenvalue 
problems. A quote from the preface deserves inclu- 
sion: “We debated whether to include numerical ex- 
amples. Our decision not to was directed by the in- 
creasing availability of interactive systems that ma- 
nipulate matrices. With these systems it is a small 
matter to play with a perturbation bound, watching 
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it perform under a variety of circumstances. Besides 
such lively exercises, a printed example is a thing of 
lead and stone.” AWR 


Linear Algebra, T(14: 1). Matrices and Lin- 
ear Transformations, Second Edition. Charles G. 
Cullen. Dover, 1990, xii + 318 pp, $7.95 (P). 
[ISBN: 0-486-66328-0] A corrected republication of 
the 1972 Second Edition originally published by 
Addison-Wesley (TR, August-September 1972). An 
attractively priced alternative which covers the stan- 
dard topics. (1966 Addison-Wesley text, TR, March 
1967; Extended Review, August-Septembez 1968.) 
JNC 


Linear Algebra, T(14-15: 1). Linear Algebra 
with Applications, Third Edition. Steven J. Leon. 
Macmillan, 1990, xv + 458 pp. [ISBN: 0-02-369821- 
7] The most notable change in this edition is a 
new appendix containing computational exercises to 
be solved using MATLAB. Also includes a few new 
theorems, exercises, examples, applications, and fig- 
ures. (First Edition, TR, February 1982; Second Edi- 
tion, TR, June-July 1986.) JNC 


Group Theory, T(16-17: 1), L. Presentations of 
Groups. D.L. Johnson. London Math. Soc. Stud. 
Texts, V. 15. Cambridge University Pr, 1990, 204 
pp, $49.50; $16.95 (P). [ISBN: 0-521-37824-9; 0-521- 
37203-8] A textbook introducing selected topics in 
combinatorial group theory, representing an option 
for an advanced undergraduate algebra course. Pre- 
sumes knowledge of abstract algebra and linear al- 
gebra. Much revised from previous (1976 and 1980) 
editions (TR, May 1976). New chapters on Nielsen’s 
method, relation modules, quotient computation al- 
gorithms. Generous assortment of exercises. RB 


Algebra, T*(15: 1), S, L*. Abstract Algebra, Sec- 
ond Edition. I.N. Herstein. Macmillan, 1990, xix + 
293 pp. [ISBN: 0-02-353822-8] The choice of con- 
tent in this excellent treatment of the subject re- 
mains the same in this edition. Errors have been 
corrected and some minor rewriting has been done 
to clarify and expand some of the discussion. (First 
Edition, TR, November 1986; Extended Review, Oc- 
tober 1987.) CEC 


Algebra, P. General Algebra 1988. Ed: Rainer 
Mlitz. North-Holland (US Distr: Elsevier Science), 
1990, 265 pp, $84.50. [ISBN: 0-444-88617-6] Pro- 
ceedings of a conference in memory of Wilfried 
Nobauer in Krems, Austria in August 1988. Includes 
two survey papers, one on the work of Nobauer, 
and eighteen previously unpublished papers from the 
elghty conference lectures. JPH 


Calculus, T**(13-14: 3). Calculus Two: Linear 
and Nonlinear Functions. Francis J. Flanigan, Jerry 
L. Kazdan. Undergrad. Texts in Math. Springer- 
Verlag, 1990, xvii + 619 pp, $39.95. [ISBN: 0-387- 
97388-5] Second Edition—clarity enhanced through 
substantial revisions and major additions by users 
of First Edition, Fristedt, Gray, and Frank of the 
University of Minnesota (TR, December 1971). Re- 
tains strong geometric flavor. Linear algebra set- 
ting for a study of linear maps. Clusters of help- 
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ful remarks throughout. Delightful extras, interest- 
ing exercises, and “challenging problems.” Thought- 
encouraging phrasing of non-routine problems is a 
plus. Somewhat sophisticated material presented in 
a non-threatening manner. A book that grows on the 
reader. Prerequisites include single-variable calculus 
and a bit of mathematical maturity. JDEK 


Real Analysis, P. Hardy-type Inequalities. B. Opic, 
A. Kufner. Longman Scientific & Technical (US 
Distr: Wiley), 1990, 333 pp, $44 (P). [ISBN: 0-582- 
05198-3] Lecture notes concerning what conditions 
(on the domain, weight functions, etc.) insure that 
Hardy-type inequalities hold for certain classes of 
functions. MLR 


Complex Analysis, T(16-18: 1), S*, P, 
L*. Complez Analysis: The Geometric Viewpoint. 
Steven G. Krantz. Carus Math. Mono., No. 23. MAA, 
1990, xv + 212 pp, $23. [ISBN: 0-88385-026-5] An 
inviting sequel (eminently suitable for independent 
study) to a first complex analysis course that intro- 
duces tools from Riemannian geometry, notably the 
Poincaré metric, to shed light on the Schwarz lemma 
and Picard’s and Montel’s theorems. Using minimal 
formalism, the author builds insight based on an ef- 
fective blend of geometric and analytic methods. In- 
troduces the Carathéodory and Kobayashi metrics 
and proves Poincaré’s theorem that there is no bi- 
holomorphic map from the bidisc D? in C? onto the 
ball B. LAS 


Complex Analysis, T(18: 1, 2), S, P. Introduc- 
tion to Holomorphic Functions of Several Variables, 
Volumes I-III. Robert C. Gunning. Wadsworth, 
1990. Volume I: Function Theory, xix + 203 pp, 
$45.95, [ISBN: 0-534-13308-8]; Volume II: Local The- 
ory, xix + 218 pp, $51.95, [ISBN: 0-534-13309- 
6]; Volume III: Homological Theory, xix + 194 pp, 
$51.95. [ISBN: 0-534-13310-X] Based on, but sub- 
stantially revised and updated from the author’s 
1965 text with Hugo Rossi, this is a comprehensive, 
readable, and inviting introduction, at the gradu- 
ate level, to multivariable complex analysis. Volume 
I covers aspects of function theory including holo- 
morphic, pluriharmonic, and plurisubharmonic func- 
tions; pseudoconvexity and domains of holomorphy; 
@ problems and Dolbeault cohomology. Volume II 
covers local theory of analytic functions and analytic 
varieties. Volume JI] treats methods for semilocal 
and global analysis: sheaf cohomology, Cech coho- 
mology, Stein varieties, holomorphic mappings, etc. 
No exercises. PZ 


Complex Analysis, P. Fifteen Papers in Complez 
Analysis. L.A. Aizenberg, et al. AMS Transl., Ser. 
2., V. 146. AMS, 1990, viii + 112 pp, $65. [ISBN: 0- 
8218-3130-5] Translation of papers from the 1985 
Russian volume Multidimensional Complez Analy- 
sis. LAS 


Differential Equations, P. Qualitative Theory of 
Differential Equations. Eds: B. Sz.-Nagy, L. Hat- 
vani. North-Holland (US Distr: Elsevier Science), 
1990, 683 pp, $166.75. [ISBN: 0-444-87407-0] Pro- 
ceedings of the Third Colloquium on the title subject 
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held at the Bolyai Institute in Szeged, August 22-26, 
1988. Papers are devoted to asymptotic properties 
and stability problems in ordinary and functional dif- 
ferential equations. Includes applications to mechan- 
ics, physics, biology, and control. Except for survey 
articles, papers have been refereed and contain com- 
plete proofs. BL 


Differential Equations, T(15). Ordinary Dif- 
ferential Equations: An Introduction to Nonlinear 
Analysis. Herbert Amann. Transl: Gerhard Met- 
zen. Stud. in Math., V. 13. Walter de Gruyter, 
1990, xiii + 458 pp, DM 148. [ISBN: 3-11-011515-8] 
Translated from German and described as a book 
for second-year students, this book is a notch above 
what American students might expect. The second 
motivating example in the book begins on page 12: 
“Let M C R® be an mdimensional C?-manifold 
(e.g., the two-sphere S? in R3), and let ....” Lin- 
ear algebra is a prerequisite, the foundations of the 
calculus of variations are examined, and the author 
explains that he always has in mind in this course 
the theory of evolution equations. AWR 


Partial Differential Equations, P. Harmonic 
Analysis and Partial Differential Equations. Eds: 
Mario Milman, Tomas Schonbek. Contemp. Math., 
V. 107. AMS, 1990, xi + 129 pp, $38 (P). [ISBN: 0- 
8218-5113-6] Proceedings of a conference held April 
4-5, 1988 at Florida Atlantic University. Ten papers 
applying harmonic analysis to a wide variety of prob- 
lems in partial differential equations. BL 


Partial Differential Equations, P. Lecture Notes 
in Mathematics-1431: The Navier-Stokes Equations: 
Theory and Numerical Methods. Eds: J.G. Hey- 
wood, et al. Springer-Verlag, 1990, vii + 238 pp, $22 
(P). [ISBN: 0-387-52770-2] Proceedings of a confer- 
ence at Oberwolfach, September 18-24, 1988. Of the 
23 papers, five address numerical methods. BL 


Numerical Analysis, T(16-17: 1, 2), L. Nu- 
merical Linear Algebra and Optimization, Volume 1. 
Philip E. Gill, Walter Murray, Margaret H. Wright. 
Addison-Wesley, 1991, xvii + 426 pp, $46.25. [ISBN: 
0-201-12649-4] A unified introduction to modern 
numerical techniques for solving linear systems and 
linear least squares problems, for updating matrix 
factorizations, and for linear and nonlinear program- 
ming. AO 

Numerical Analysis, S**. Schaum’s Solved Prob- 
lem Series: 2000 Solved Problems in Numerical 
Analysis. Francis Scheid. McGraw-Hill, 1990, v 
+ 698 pp, $19.95 (P). [ISBN: 0-07-055233-9] 2000 
solved problems in numerical analysis listed under 
different topics, with a brief summary in front of 
each chapter. Introduces a large variety of numer- 
ical approximation methods, from elementary to ad- 
vanced problems, including solving differential equa- 
tions, algebraic systems, polynomials, integrals, and 
series. LS 


Numerical Analysis, S. M-matrices in Numeri- 
cal Analysis. Gunther Windisch. Texte zur Math., 
B. 115. BG Teubner Leipzig, 1989, 140 pp, 17,50 M 
(P). [ISBN: 3-322-00710-3] An easy access to M- 
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matrices theory. Explores the relation between non- 
singular M-matrices and the discretization of second 
order linear elliptic and parabolic problems by finite 
element, finite difference and the methods of lines. 
Excludes convergence analysis and nonlinear prob- 
lems. Reviewers copy was missing several pages. LS 


Numerical Analysis, P. Algorithms for Approzi- 
mation II, Eds: J.C. Mason, M.G. Cox. Chapman 
& Hall, 1990, 514 pp, $99.95. [ISBN: 0-412-34580- 
3] Proceedings of the Second International Confer- 
ence on Algorithms for Approximation at the Royal 
Military College of Science, Shrivenham, UK, July 
12-15, 1988. ‘Ten invited and 33 contributed pa- 
pers concern development of algorithms (approxima- 
tion by splines; polynomials; piecewise polynomials; 
complex methods; interpolation; smoothing and con- 
stant methods) and various applications. Includes a 
very useful, substantial catalogue of algorithms for 
approximation by Eric Grosse. DFA 


Numerical Analysis, T(17-18: 1), P, L. Numer- 
ical Continuation Methods: An Introduction. Eu- 
gene L. Allgower, Kurt Georg. Ser. in Computat. 
Math., V. 13 Springer-Verlag, 1990, xiv + 388 pp, 
$69. (ISBN: 0-387-12760-7] Compares predictor- 
corrector methods of solving nonlinear systems with 
simplicial methods. After a significant introduction 
to both methods, the author makes the argument 
that’ these methods, seemingly disparate, are actu- 
ally quite similar. SM 


Numerical Analysis, T(15-17), S, L*. Numer- 
ical Methods That Work. Forman S. Acton. MAA, 
1990, xx + 549 pp, $16.50 (P). [ISBN: 0-88385-450- 
3] Reprint of the 1970 original edition (TR, January 
1971; Extended Review, January 1974). A common- 
sense approach to fundamental methods that have 
not changed much in twenty years (despite advances 
in computation), with effective examples and sage 
advice to keep novice analysts away from numerical 
quicksand. First half surveys standard problems and 
methods (evaluation, roots, interpolation, quadra- 
ture, differential equations, eigenvalues, Fourier se- 
ries); second half revisits topics with greater subtlety 
and increased dimension. LAS 


Qperator Theory, P. Mathematical Analysis and 
Numerical Methods for Science and Technology, Vol- 
ume $3: Spectral Theory and Applications. Robert 
Dautray, Jacques-Louis Lions. Transl: John C. Am- 
son. Springer-Verlag, 1990, x + 515 pp, $98. [ISBN: 
0-387-50208-4] Topics include spectral decomposi- 
tion of self-adjoint operators and examples in elec- 
tromagnetism and quantum physics. (Volume 2: 
Functional and Variational Methods, TR, June-July 
1989.) MLR 


Functional Analysis, P. Nonstandard Methods in 
Fized Point Theory. Asuman G. Aksoy, Mohamed 
A. Khamsi. Universitext. Springer-Verlag, 1990, 
ix + 139 pp, $29 (P). [ISBN: 0-387-97364-8] Self- 
contained, unified treatment of recent work on ap- 
plications of Banach space ultraproducts to the fixed 
point theory of non-expansive mappings (natural ex- 
tensions of contraction mappings to subsets of Ba- 
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nach spaces). RM 


Functional Analysis, T(17-18: 1), P. Anal- 
yse Harmonique. M. Samuelides, L. Touzillier. 
Cepadues-Editions, 1990, 224 pp, 180 F. [ISBN: 
2-85428-240-X] A concise introduction beginning 
with classical Fourier analysis. Covers convolu- 
tions, Fourier transforms, harmonic synthesis, distri- 
butions, and Laplace transforms. Each brief chapter 
sketches both key mathematical results and common 
applications. LAS 


Functional Analysis, P. Lecture Notes in Mathe- 
matics-1433: Topics in Nevanlinna Theory. Serge 
Lang, William Cherry. Springer-Verlag, 1990, 174 
pp, $18 (P). [ISBN: 0-387-52785-0] Nevanlinna the- 
ory in the classical case of meromorphic functions 
and generalizations to equidimensional holomorphic 
maps f : C™ — X, where X is a compact manifold. 
Possible supplement for a graduate course in complex 
analysis. MLR 


Functional Analysis, T(16-17), P, L. Theory of 
Distributions: A Non-Technical Introduction. Ian 
Richards, Heekyung Youn. Cambridge University 
Pr, 1990, ix + 147 pp, $39.50. [ISBN: 0-521-37149-X] 
Intended for the “non-specialist” interested in distri- 
butions (generalized functions, e.g., the Dirac delta 
function). Good explanations, nice style. MLR 


Analysis, P, L. Scientific Computation on Math- 
ematical Problems and Conjectures. Richard S. 
Varga. CBMS-NSF Reg. Conf. Ser. in Appl. Math., 
V. 60. SIAM, 1990, vi + 122 pp, $21.50 (P). [ISBN: 0- 
89871-257-2] Looks at the interplay between anal- 
ysis and high-precision calculations in six indepen- 
dent examples: the Bernstein conjecture in approxi- 
mation theory, the “1/9” conjecture for semi-discrete 
approximations of parabolic equations, the Riemann 
Hypothesis, zeros of the partial sums of e”, real 
vs. complex rational approximations, and generaliza- 
tions of Jensen’s inequality. BC 


Analysis, P. Tables of Fourier Transforms and 
Fourier Transforms of Distributions. Fritz Oberhet- 
tinger. Springer-Verlag, 1990, viii + 259 pp, $35 
(P). [ISBN: 0-387-50630-6] “New, revised, and en- 
larged version of the previously published book by 
the author entitled Tabellen zur Fourier Transforma- 
tion (Springer-Verlag, 1957).” Impressive collection 
of functions tabulated. BH 


Differential Geometry, P. Lecture Notes in 
Mathematics-1424: Twistor Theory for Riemannian 
Symmetric Spaces. Francis E. Burstall, John H. 
Rawnsley. Springer-Verlag, 1990, 112 pp, $14.70 
(P). [ISBN: 0-387-52602-1] Studies the relationship 
between flag manifolds and Riemannian symmetric 
spaces with inner involution. Also studies the struc- 
ture of the zero set of the Nijenhuis tensor on J(N). 
Applies results of these studies to harmonic maps of 
S? into asymmetric space N. Offers uniform proof of 
results of Calabi, Eells-Wood, and Bryant concern- 
ing harmonic 2-spheres in $2", CP", and HP1. Com- 
pletely characterizes stable harmonic 2-spheres into 
an irreducible Riemannian symmetric space. Ex- 
tends results of Uhlenbeck on harmonic 2-spaces in 
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U(n) to other Lie groups. OJ 


Differential Geometry, P. Harmonic Maps and 
Minimal Immersions Through Representation The- 
ory. Gabor Toth. Perspectives in Math., V. 12. 
Academic Pr, 1990, xii + 154 pp, $24.95. [ISBN: 0- 
12-696075-5] An advanced monograph on the gen- 
eral problem of embedding minimal surfaces in Eu- 
clidean space. The present setting concerns “har- 
monic” embeddings, i.e, embeddings via maps whose 
components are harmonic functions. The author 
studies properties, especially boundary properties, of 
the parameter spaces of such maps, known as moduli 
spaces. PZ 


Geometry, P. Lecture Notes in Mathematics-1436: 
Enumerative Geometry. Ed: S. Xambo-Descamps. 
Springer-Verlag, 1990, v + 303 pp, $32 (P). [ISBN: 
0-387-52811-3] Ten papers (plus one by J.M. Miret 
and S. Xambé Descamps) from twenty-nine lectures 
presented at a conference in Sitges, Spain, June 
1987. JPH 


Geometry, T*(15-16: 1), P, L. Basic Concepts of 
Geometry. Walter Prenowitz, Meyer Jordan. Ard- 
sley House (320 Central Park West, New York, NY 
10025), 1989, xix + 350 pp, $34.95 (P). [ISBN: 0- 
912675-48-9] A paperback republication of a classi- 
cal introduction to modern geometry; begins with in- 
cidence geometry and adds postulates to study other 
geometries including projective, affine, Euclidean, 
and non-Euclidean. JNC 


Geometry, T(14-16: 1), S, P. Measure, Topology, 
and Fractal Geometry. Gerald A. Edgar. Undergrad. 
Texts in Math. Springer-Verlag, 1990, xiii + 230 pp, 
$29.95. [ISBN: 0-387-97272-2] Text for a rigorous 
pure mathematics course on fractal geometry. Frac- 
tal examples, metric topology, topological dimension, 
self-similarity, measure theory, Hausdorff dimension. 
Prerequisites: experience with mathematical proofs; 
basic abstract set theory; solid background in calcu- 
lus. Exercises integrated throughout the chapters. 
Dozens of illustrations and color plates. RB 


Algebraic Topology, P. Advances in Homotopy 
Theory. Eds: S.M. Salamon, B. Steer, W.A. Suther- 
land. London Math. Soc. Lect. Note Ser., V. 139. 
Cambridge University Pr, 1989, xiii + 177 pp, $27.95 
(P). (ISBN: 0-521-37907-5] Fifteen papers from the 
Fifth Oxford Topology Symposium held in 1988 at 
Cortona, Italy to celebrate the sixtieth birthday of 
Joan James. JAS 


Algebraic Topology, P. Lecture Notes in Mathe- 
matics-1423: Stable Homotopy Groups of Spheres: A 
Computer-Assisted Approach. Stanley O. Kochman. 
Springer-Verlag, 1990, viii + 330 pp, $31.20 (P). 
(ISBN: 0-387-52468-1] A development of a theoret- 
ical basis for efficient computation of the stable ho- 
motopy groups of spheres (herein carried out for the 
first 64 stable stems). Includes a discussion of al- 
gorithms used in the Fortran programs. Program 
listings are available from the author. The computa- 
tional techniques using spectral sequences may be of 
more general interest. JAS 


Differential Topology, T(18), S, P. Variational 
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Principles of Topology: Multidimensional Minimal 
Surface Theory. A.T. Fomenko. MIA, V. 42. Kluwer 
Academic, 1990, xviii + 374 pp, $133. [ISBN: 0- 
7923-0230-3] Offers new approach to the multidi- 
mensional Plateau problem. Uses the concepts of 
stratified surfaces and their stratified volumes to re- 
formulate and solve it in the context of spectral Bor- 
dism theory. Requires a strong foundation in alge- 
braic topology, differential geometry, and the calcu- 
lus of variations. OJ 


Topology, T*(17-18: 1, 2), P. General Topology, 
Revised and Completed Edition. Ryszard Engelk- 
ing. Heldermann Verlag, 1989, viii + 529 pp, DM 
148. [ISBN: 3-88538-006-4] Some updating reflect- 
ing current research and additions to the bibliogra- 
phy as well as a few corrections characterize this new 
version of the 1977 classic text and reference. JAS 


Topology, T(15-16), L. Introduction to Topology, 
Third Edition. Bert Mendelson. Dover, 1990, ix 
+ 206 pp, $5.95 (P). [ISBN: 0-486-66352-3] Paper- 
back reprint of the late author’s elementary, but thor- 
ough, introduction to axiomatic point set topology 
(essentially the same as the Allyn and Bacon Second 
Edition). Includes also an introduction to homotopy 
theory, the fundamental group, and surface topology. 
(1968 Allyn and Bacon Second Edition, TR, January 
1969; 1975 Third Edition, TR, June-July 1975.) JPH 


Topology, T(18: 2). Topics in General Topol- 
ogy. Kiiti Morita, Jin-iti Nagata. Math. Lib., V. 41. 
North-Holland (US Distr: Elsevier Science), 1989, xi 
+ 747 pp, $197.25. [ISBN: 0-444-70455-8] A collec- 
tion of fifteen monographs by various Japanese topol- 
ogists on recent developments in general topology de- 
signed as a text or reference work for advanced stu- 
dents seeking to get to the research frontier. A good 
common index adds to its value as a reference. JAS 
Optimization, T(17-18: 2), P, L. Optimization 
and Nonsmooth Analysis. F.H. Clarke. Classics in 
Appl. Math., V. 5. SIAM, 1990, xii + 308 pp, $28.50 
(P). {ISBN: 0-89871-256-4] A reprint of the 1983 
Wiley text (TR, March 1984). Part of a series of 
“out-of-print” books published by SIAM. Corections 
have been incorporated into this edition. No exer- 
cises. SM 


Dynamical Systems, P. Chaos in dynamischen 
Systemen. Willi-Hans Steeb, Albrecht Kunick. Bib- 
liographisches Institut, 1989, 239 pp, (P). [ISBN: 3- 
411-14152-2] A general introduction, in German, 
to theory and methods of chaotic dynamical sys- 
tems. The dynamical systems considered arise in 
various contexts: difference equations, differential 
equations, Hamiltonian systems, cellular automata, 
etc. A number of applications are also considered, 
mainly to physics, mechanics, and electronics. Con- 
tains many diagrams and computer programs. PZ 


Control Theory, P. Lecture Notes in Control 
and Information Sciences-142: Algebraic Theory of 
Linear Feedback Systems with Full and Decentral- 
ized Compensators. A.N. Gindes, C.A. Desoer. 
Springer-Verlag, 1990, v + 176 pp, $27.10 (P). 
[ISBN: 0-387-52476-2] Algebraic approach (ring 
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theoretic factorizations) to the study of linear, time- 
invariant, multi-input/output feedback control sys- 
tems (full and decentralized, discrete and continu- 
ous time systems). Focus on closed loop stability, 
parametrization of stabilizing compensators. RM 


Control Theory, P. Control of Quantum-Mechan- 
ical Processes and Systems. A.G. Butkovskiy, Yu. I. 
Samoilenko. Math. & Its Applic., V. 56. Kluwer 
Academic, 1990, xiii + 232 pp, $99. [ISBN: 0-7923- 
0689-9] From the Preface: “Purposeful control of 
the quantum state of matter has long been the focus 
of investigation for many researchers working in vari- 
ous branches of physics {such as laser technology] .... 
This book sums up the results of modern research in 
the theory of automatic control, which can be used 
to solve many theoretical problems of control systems 
defined by the equations of quantum physics.” BC 


Control Theory, T(16-17: 1, 2). Mathematical 
Control Theory: Deterministic Finite Dimensional 
Systems. Eduardo D. Sontag. Textsin Appl. Math., 
V. 6. Springer-Verlag, 1990, xiii + 396 pp, $39. 
(ISBN: 0-387-97366-4] Accessible treatment of ba- 
sics of control and systems theory: linear systems, 
optimal control, dynamic feedback, observers, reach- 
ability, controlability, parametrization and stability, 
deterministic Kalman filtering. Minimal prerequi- 
sites, with appendices which cover needed material 
from linear algebra, normed spaces, ordinary differ- 
ential equations. RM 


Computer Literacy, S, P, L? Computer Viruses 
and Anti-Virus Warfare. Jan Hruska. Ellis Hor- 
wood (US Distr: Wiley), 1990, 128 pp, (P). [ISBN: 
0-13-171067-2] A handbook which presents opera- 
tional methods and internal structures of viruses on 
MS-DOS personal computers, and discusses counter- 
measures, anti-virus software, etc. Does not treat 
non-DOS machines or threats other than viruses 
(e.g., worms, Trojan horses). C language source for 
two anti-virus programs included; bibliography, anti- 
virus software vendors. RB 


Elementary Computer Science, T(13-14: 1), 
L. Great Ideas in Computer Science: A Gentle In- 
troduction. Alan W. Biermann. MIT Pr, 1990, xx + 
446 pp, $27.95 (P). (ISBN: 0-262-52148-2] An in- 
troduction to the discipline of computer science that 
emphasizes ideas and concepts. Focuses on the dis- 
coveries and technological breakthroughs that have 
made computers an important part of our daily life. 
Covers programming, hardware, and software. AO 


Elementary Computer Science, T(13-14), L. 
A First Course in Modula-2. Lowell A. Carmony, 
Robert L. Holliday. Computer Science Pr, 1990, xii 
+ 419 pp, $29.95 (P). (ISBN: 0-7176-8229-4] A suit- 
able text for a first computer science course. The 
emphasis is on problem-solving and top-down, struc- 
tured design of algorithms using the Modula-2 pro- 
gramming language. The tone of the text is informal 
at first, allowing students to achieve intuition about 
formal programming procedures. Chapter 8 is dedi- 
cated entirely to a discussion of disciplined program- 
ming techniques, e.g., top-down designs, pseudo- 


1991] 


codes, commenting, and debugging. Many exercises 
and examples. MPR 

Programming, T(15-17: 1), C, P. Object- 
Oriented Programming with Simula. Bjgrn Kirke- 
rud. Intern. Comput. Sci. Ser. Addison-Wesley, 
1989, xiii + 515 pp, (P). [ISBN: 0-201-17574-6] The 
aim is object-oriented programming. So why Sim- 
ula? Because the author likes it and makes a case 
that Simula is an especially good vehicle for learn- 
ing object-oriented programming. Sample programs 
are available by “anonymous ftp” from the author’s 
internet host ifi.uio.no. JAS 


Programming, S, P, L**. The Annotated C++ 
Reference Manual. Margaret A. Ellis, Bjarne Strous- 
trup. Addison-Wesley, 1990, x + 447 pp, $37.75. 
[ISBN: 0-201-51459-1] A complete reference for the 
C++ programming language. It contains the refer- 
ence manual recently adopted as the basis for ANSI 
standardization of the language together with anno- 
tations and commentary about the design and imple- 
mentation of the language. Examples are given that 
explore the differences between C and C++ and that 
explain language subtleties. AO 


Programming, S(15-16), P. Partial Computa- 
tion and the Construction of Language Processors. 
Frank G. Pagan. Prentice-Hall, 1991, x + 166 
pp. [ISBN: 0-13-651415-4] Partial computation is a 
(hitherto theoretical) programming strategy involv- 
ing programs writing programs: to solve a problem, 
Program A uses some of the data to write Program 
B, which can solve the problem given the remain- 
ing data. This monograph introduces a methodology 
based on this idea, and demonstrates it through the 
case study of compiler/interpreter construction. RB 


Programming, T(13-14: 1), L. Interactive FOR- 
TRAN 77: A Hands-On Approach, Second Edition. 
I.D. Chivers, Jane Sleighthome. Ser. in Comput. & 
Applic. Ellis Horwood (US Distr: Simon & Schus- 
ter), 1990, 224 pp. [ISBN: 0-13-466764-6] A new 
edition of the 1984 text. A new feature is the ad- 
dition of an appendix listing all the intrinsic func- 
tions of Fortran 77. Covers standard material. Many 
provocative exercises. SM 


Programming, S(14-17), P, L. C++: A Guide 
for C Programmers. Sharam Hekmatpour. Prentice- 
Hall, 1990, x + 264 pp, (P). [ISBN: 0-13-109471-8] 
An introduction to those features of the C++ lan- 
guage which extend C language, for undergraduates, 
graduates, and professionals. The added facilities of 
C++ are described in the first half of the book; the 
second half presents four case studies as examples 
of application. Several programming exercises with 
answers. RB 


Programming, S, P. UNIX System V, Release 
4: ANSI C Transition Guide. AT&T. Prentice-Hall, 
1990, iii + 49 pp, (P). [ISBN: 0-13-933698-2] Offi- 
cial System V documentation: eleven technical arti- 
cles on how to write new and upgrade existing C code 
to comply with the ANSI C language specification. 
Originally written by David Prosser of Bell Labs for 
internal use. RB 
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Programming, T(15-17: 1), P. The C++ Work- 
book. Richard S. Wiener, Lewis J. Pinson. Addison- 
Wesley, 1990, xviii + 349 pp, (P). [ISBN: 0-201- 
50930-X] Intended as a sequel to any standard pro- 
gramming text for C or C++, this book aims to de- 
velop both facility and style for the serious user of 
C++. JAS 


Programming, 8(13-16). The C++ Answer 
Book. Tony L. Hansen. Addison-Wesley, 1990, ix + 
578 pp, (P). [ISBN: 0-201-11497-6] A heavily anno- 
tated set of solutions to the exercises in Stroustrup’s 
basic language manual The C++ Programming Lan- 
guage. This volume provides a significant body of 
example code which has the blessings of AT&T peo- 
ple. JAS 


Programming, 8(14), L. LISP Style & Design. 
Molly M. Miller, Eric Benson. Digital Pr, 1990, x + 
214 pp, $26.95 (P). [ISBN: 1-55558-044-0] A primer 
on design and style of Common Lisp programs, not 
Lisp itself; assumes reader knows Lisp. For the au- 
thors, style comprises efficient algorithms, good orga- 
nization, appropriate abstractions, well-constructed 
function definitions, useful commentary, effective de- 
bugging. Uses a single large programming example 
throughout to illustrate the design process. Very 
readable. DFA 

Programming, T(15-17). Programming in C++. 
Stephen C. Dewhurst, Kathy T. Stark. Prentice- 
Hall, 1989, vi + 233 pp, (P). [ISBN: 0-13-723156-3] 
More and less than a programming manual. This 
book explains the ideas behind C++ and provides 
examples of programming techniques using the lan- 
guage. Not a language manual, although it discusses 
the features of the language in detail. Aimed at ex- 
perimental programmers. JAS 


Programming, S, L. Object-Oriented Software. 
Ann L. Winblad, Samuel D. Edwards, David R. 
King. Addison-Wesley, 1990, xviii + 291 pp, $29.25 
(P). [ISBN: 0-201-50736-6] An overview for in- 
terested persons, no matter what their program- 
ming experience—none, some, much. Object ori- 
entation and its benefits; object-oriented languages, 
databases, user-interfaces; topics in object-oriented 
applications—analysis, design, programming, main- 
tenance, project management, development tools, 
and environments; examples of applications. Non- 
technical. Useful list of references and glossary. DFA 


Languages, T(16-17: 1), P, L. Functional Pro- 
gramming: Practice and Theory. Bruce J. MacLen- 
nan. Addison-Wesley, 1990, xii + 596 pp, $35.50. 
[ISBN: 0-201-13744-5] An introduction to func- 
tional programming methodology covering both 
practical and theoretical issues. No specific language 
is used in the book; examples are presented using a 
pseudocode based on standard mathematical nota- 
tion. AO 

Languages, P, L*. Applications of Object-Oriented 
Programming. Eds: Lewis J. Pinson, Richard S. 
Wiener. Addison-Wesley, 1990, x + 212 pp, $37.75. 
[ISBN: 0-201-50369-7] Six case studies that de- 
scribe the use of object-oriented programming lan- 
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guages in projects that have been designed and im- 
plemented. The focus of each case study is on the 
object-oriented design, its implementation, and the 
lessons used. The languages used in the projects in- 
clude C++, Smalltalk, Objective-C, Actor, and Ob- 
ject Pascal. AO 


Languages, T(14-15: 1), P. Functional Program- 
ming with Hope. Roger Bailey. Ser. in Computers 
& Their Applic. Ellis Horwood (US Distr: Wiley), 
1990, xii + 301 pp. [ISBN: 0-13-338237-0] An in- 
troductory text on functional programming using the 
Hope programming language. Hope combines the se- 
mantics of a functional programming language with 
strong typing, user-definable data structures, and 
features that support modular program design. AO 
Languages, T*(15-17: 1), L. Programming Lan- 
guages: An Interpreter-Based Approach. Samuel N. 
Kamin. Addison-Wesley, 1990, xi + 640 pp. [ISBN: 
0-201-06824-9] A textbook for a programming lan- 
guage course that focuses on non-imperative lan- 
guages; specifically, Lisp, APL, Scheme, Sasl, Clu, 
Smalltalk, and Prolog. An interpreter for a simpli- 
fied subset of each language is included. AO 
Languages, S(15-17), L. Programming with Gen- 
erators, An Introduction. Alfs Berztiss. Ser. in Com- 
put. & Their Applic. Ellis Horwood (US Distr: Wi- 
ley), 1990, 194 pp. [ISBN: 0-13-739087-4] Genera- 
tors are similar to procedures, but some local state is 
preserved between calls. Programming with genera- 
tors allows one to focus on streams of values rather 
than on individual values. This book provides exam- 
ples of the use of generators and develops program- 
ming validation and verification techniques appropri- 
ate for use with generators. AO 


Languages, P, L. The REXX Language: A Prac- 
tical Approach to Programming, Second Edition. 
Michael Cowlishaw. Prentice-Hall, 1990, xii + 203 
pp, (P). [ISBN: 0-13-780651-5] The author devel- 
oped REXX beginning in 1979 to be a general 
purpose procedural language that is elegant and 
easy to deal with. REXX has an interpretive style; 
it is useful for general purpose computing and as 
a cross-operating system command language inter- 
preter (most implementations are on IBM systems). 
This manual officially defines REXX 4.0, summarizes 
philosophy behind the language. RB 

Algorithms, T*(14-15: 1), L. Algorithms in C. 
Robert Sedgewick. Addison-Wesley, 1990, xii + 657 
pp. [ISBN: 0-201-51425-7] This is a C language 
version of the author’s popular book Algorithms. 
Presents algorithms for sorting, searching, string- 
processing as well as for problems in graph theory, 
geometry, and numerical mathematics. AO 
Algorithms, T**(15-17: 1, 2), L*. Introduction 
to Algorithms. Thomas H. Cormen, Charles E. Leis- 
erson, Ronald L. Rivest. Electrical Eng. & Com- 
put. Sci. Ser. MIT Pr, 1990, xvii + 1028 pp, $49.95. 
[ISBN: 0-262-03141-8] A comprehensive introduc- 
tion to the study of computer algorithms. Algo- 
rithms are described using a pseudocode and ana- 
lyzed in terms of run-time efficiency. Each chapter 
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presents an algorithm, a design technique, an appli- 
cation area, or a related topic. Useful as a reference 
as well as a textbook. AO 


Algorithms, S(16-18), P. Solving Problems On 
Concurrent Processors, Volume II: Software for Con- 
current Processors. lan G. Angus, et al. Prentice- 
Hall, 1990, x + 805 pp. [ISBN: 0-13-829714-2] 
Software codes for scientific algorithms and appli- 
cations described in Volume I. Includes discussion 
of concurrent systems currently available, bench- 
mark codes, and tutorial introduction to CrO$S III 
(crystalline—i.e., synchronous—communication sys- 
tem for concurrent computers), currently available 
for Intel, NCUBE hypercubes, and the BBN Butter- 
fly. RM 


Computer Systems, T(15-17), S, P, L**. In- 
side Smalltalk, Volume I. Wilf R. LaLonde, John R. 
Pugh. Prentice-Hall, 1990, xvi + 512 pp. [ISBN: 0- 
13-468414-1] The first volume of a two-volume in- 
depth introduction to using Smalltalk-80, an object- 
oriented programming environment. This volume 
covers fundamental concepts, the Smalltalk environ- 
ment, the basic classes, and graphics classes. As- 
sumes the reader has experience programming in a 
procedural language and has access to a Smalltalk 
system. AO 


Computer Systems, T(14-16: 1), P, L. UNIX 
Network Programming. W. Richard Stevens. Pren- 
tice-Hall, 1990, xi + 772 pp. [ISBN: 0-13-949876-1] 
A hands-on text about Unix network software devel- 
opment, illustrated with 15,000 lines of documented 
source code. Presumes C reading knowledge, com- 
puting experience. Discussions of: Unix processes; 
interprocess communication mechanisms; overviews 
of networking and popular protocols; BSD sockets 
and System V Transport Layer Interface; about a 
dozen complete examples amplified by discussions of 
issues. Exercises after most chapters. RB 


Computer Systems, P. OSF/Motif, Revision 1.0. 
Open Software Foundation. Prentice-Hall, 1990. 
OSF/Motif User’s Guide, Revision 1.0, xii + 66 
pp, (P), [ISBN: 0-13-640509-6]; OSF/Motif Style 
Guide, Revision 1.0, xi + 156 pp, (P), [ISBN: 0- 
13-640491-X]; OSF/Motif Programmer’s Guide, Re- 
vision 1.0, xxiv + 726 pp, (P), [ISBN:0-13-640525-8]; 
OSF/Motif Programmer’s Reference, Revision 1.0, 
ix + 941 pp, (P). [ISBN: 0-13-640517-7] Motif is a 
set of tools for window management and applications 
program development in a windowing environment 
built on top of the X-Window system. Motif is an 
attempt by the Open Software Foundation to pro- 
vide a high-level standard for applications running 
in a windowing environment. The User’s Guide is a 
quick tutorial to using the Motif user environment. 
The Style Manual is a specification manual for stan- 
dardizing the user interface. The other two volumes 
provided guidance, examples, and function references 
for the developer of programs intended to run in this 
environment. JAS 


Computer Systems, P. METAFONTware. Don- 
ald E. Knuth, Tomas G. Rokicki, Arthur L. Samuel. 
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TeXUsers Group (PO Box 9506, Providence, RI 
02940), 1990, $20 (P). A collection of four techni- 
cal reports describing the most recent versions of the 
programs GFType, GFtoPK, GFtoDVI, and MFT. 
The versions described here are those that corre- 
spond to Version 1.8 of METAFONT which provided 
support for 8-bit input and extended ligatures. AO 


Computer Systems, P. X Window System: The 
Complete Reference to Xlib, X Protocol, ICCCM, 
XIFD, Second Edition, X Version 11, Release 4. 
Robert W. Scheifler, James Gettys. Digital Pr, 1990, 
xxiv + 851 pp, $49.95 (P). [ISBN: 1-55558-050-5] 
Complete reference for low levels of the X11 win- 
dow system Release 4 (“X”), written by participants 
in the development of X. Contents: Xlib C language 
library interface (bulk of the text); the X commu- 
nication protocol; communication conventions be- 
tween applications (clients), font description speci- 
fications. RB 


Computer Systems, P, L. The X Window System 
in a Nutshell for Version 11 Release 4 of the X Win- 
dow System: Desktop Quick Reference. Eds: Daniel 
Gilly, Tim O’Reilly. O’Reilly & Associates, 1990, 
xi + 367 pp, $24.95 (P). [ISBN: 0-937175-24-2] A 
quick reference handbook for users of the X Window 
System (Version 11, Releases 3 and 4). Primarily 
of interest to experienced users who are writing pro- 
grams using Xlib and Xt. AO 


Computer Systems, P, L. XView Programming 
Manual for Version 11 of the X Window System, 
Volume Seven. Dan Heller. O’Reilly & Associates, 
1990, xxviii + 557 pp, $30 (P). [ISBN: 0-937175-38-2] 
XView (X Window-System Visual/Integrated Envi- 
ronment for Workstations) is a user-interface toolkit 
that supports interactive, graphics-based applica- 
tions running under the X Window System. This 
book provides a basic introduction to developing ap- 
plications using this toolkit and also serves as a ref- 
erence manual. AO 


Computer Systems, S(14-15), C, P, L. Hyper- 
tezt and Hypermedia. Jakob Nielsen. Academic Pr, 
1990, xii + 263 pp, $29.95. [ISBN: 0-12-518410-7] 
Hypertext is software which allows one to access in- 
formation in a nonsequential manner. At certain 
points the user has several options as to what to 
read (access) next. Hypertext is beginning to make 
an impact in areas such as on-line documentation, 
computer-aided learning, and other areas with large 
amounts of interconnected information. This book 
provides an overview of its uses, implementation, and 
history. MPR 


Computer Systems, P. X and Motif Quick Ref- 
erence Guide. Randi J. Rost. Digital Pr, 1990, vii 
+ 369 pp, $24.95 (P). [ISBN: 1-55558-052-1] Sum- 
mary reference of critical information in three pri- 
mary references on the upper and lower layers of the 
X window system (Release 4) and OSF/Motif (Ver- 
sion 1.1) programming. Not a replacement for an 
introductory guide or the comprehensive manuals. 
Function headers; data structure definitions; various 
tables. RB 
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Computer Systems, P. OSF/Motif Reference 
Guide. Douglas A. Young. Prentice-Hall, 1990, 155 
pp, (P). [ISBN: 0-13-642786-3]| Summary reference 
manual for the Intrinsics layer and the OSF/Motif 
widget collection for the X windows system, by the 
author of widely-used introductions to X windows. 
Not a replacement for introductory manuals or com- 
prehensive references. RB 


Computer Systems, P. X Window System Toolkit: 
The Complete Programmer’s Guide and Specifica- 
tion, X Version 11, Release 4. Paul J. Asente, 
Ralph R. Swick. Digital Pr, 1990, xxxv + 967 pp, 
$44.95 (P). [ISBN: 1-55558-051-3] Comprehensive 
guide for upper layers (Intrinsics, widgets) of the X11 
window system Release 4 (“X”), by the lead design- 
ers of those layers; companion volume to X Window 
System: The Complete Reference to Xlib, X Protocol, 
ICCCM, XLFD. Assumes C language and overview 
of X architecture. Application programming with 
the toolkit; writing widgets (bulk of the text); spec- 
ifications for functions, data structures. RB 


Computer Systems, P. The X Window System: 
Programming and Applications with Xi, OSF/Motif 
Edition. Douglas A. Young. Prentice-Hall, 1990, x + 
533 pp, (P). [ISBN: 0-13-497074-8] A tutorial intro- 
duction to writing programs in an X Window System 
environment using Xlib and the Xt Intrinsics library. 
An updated version of the author’s 1989 book that 
uses the Motif release 1.0 widget set rather than the 
Xw widget set. AO 


Computer Systems, P, L. UNIX for FORTRAN 
Programmers. Mike Loukides. O’Reilly & Asso- 
ciates, 1990, xviii + 244 pp, $24.95 (P). [ISBN: 0- 
937175-51-X] An excellent book describing the fea- 
tures of the Unix Fortran compiler f77 and related 
software. The reader is assumed to know the For- 
tran language. Discussed here are the Unix editor 
vi, the debugging programs dbx and adb, the com- 
pilation tool make, and the large scale source code 
librarian res. Some minimal shell programming for 
the Bourne Shell. Environmental variables for the 
C-shell are also briefly mentioned. This book is ex- 
tremely well written. SM 


Computer Systems, P, L. X Window Systems: 
Programming and Applications with Xt. Douglas A. 
Young. Prentice-Hall, 1989, x + 468 pp, (P). [ISBN: 
0-13-972167-3] A tutorial introduction to writing 
programs in an X Window System environment us- 
ing Xlib and the Xt Intrinsics library. Based on the 
Xt Intrinsics and the Xw widget set from Releases 2 
and 3 of Version 11 of the X Window System. AO 


Computer Systems, P, L. X Toolkit Intrinsics 
Reference Manual for Version 11 of the X Window 
System, Second Edition, Volume Five. Ed: Tim 
O’Reilly. O’Reilly & Associates, 1990, xii + 764 pp, 
(P). [ISBN: 0-937175-57-9] A reference manual for 
the Xt Intrinsics library consisting of the reference 
pages for the library together with a permuted in- 
dex, a quick reference guide, and other useful infor- 
mation. AO 


Computer Systems, P, L. X Toolkit Intrinsics 
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Programming Manual for Version 11 of the X Win- 
dow System, Second Edition, Volume Four. Adrian 
Nye, Tim O’Reilly. O’Reilly & Associates, 1990, 
xxxiii + 543 pp, (P). [ISBN: 0-937175-56-0] The Xt 
Intrinsics library consists of routines for building and 
using widgets (pre-built user-interface components) 
for the X Window System. This book is a tutorial 
introduction to building applications using existing 
widgets and writing new widgets. It is based on the 
Xt Intrinsics and the Xw widget set from Release 4 
of Version 11 of the X Window System. AO 


Computer Graphics, C. PC Graphics with GKS. 
Peter R. Bono, e¢ al. Transl: Cynthia Skripak. 
Prentice-Hall, 1990, xi + 321 pp, (P). [ISBN: 0- 
13-654435-5] Introduces programmers to computer 
graphics using GKS. Provides basic concepts of the 
graphics system along with examples of GKS func- 
tions. Illustrates function definitions in two varia- 
tions: GKS standard and Fortran 77. Classifies all 
GKS functions at the end. LS 


Computer Graphics, L**. Graphics Gems. Ed: 
Andrew S. Glassner. Academic Pr, 1990, xxix + 
833 pp, $49.95. [ISBN: 0-12-286165-5] A collection 
of algorithms, programs, and mathematical tech- 
niques for the computer graphics programmer. It 
contains brief tutorials and clever ideas from the folk- 
lore of computer graphics that deal with nuts-and- 
bolts issues of programming and implementation. An 
appendix contains C language implementations of 
many of the algorithms described. AO 


Computer Graphics, P. X Window System User’s 
Guide for X11 R3 and R4 of the X Window System. 
Valerie Quercia, Tim O’Reilly. X Window System 
Ser., V. 3. O’Reilly & Associates, 1990, xxvi + 723 
pp, $30 (P). [ISBN: 0-937175-14-5] An introduction 
and reference for first-time and experienced users of 
the X11 windows system (“X”), releases 3 and 4; part 
of a popular series of X guides. Contents: using X, 
including the “twm” window manager; customizing 
X; Unix-style reference pages; appendices on system 
management, fonts, etc. RB 


Theory of Computation, P. Lecture Notes in 
Computer Science-425: Algebraic Logic and Uni- 
versal Algebra in Computer Science. Eds: C.H. 
Bergman, R.D. Maddux, D.L. Pigozzi. Springer- 
Verlag, 1990, xi + 292 pp, $27.60 (P). [ISBN: 0- 
387-97288-9] Proceedings of a conference in Ames, 
Iowa (June 1988) in which mathematicians and com- 
puter scientists, who had been independently apply- 
ing similar algebraic methods to their different prob- 
lem types, sought to identify common ground. Six- 
teen papers representing a wide range of topics. RB 


Theory of Computation, P. Lecture Notes in 
Computer Science-440: CSL ’89. Eds: E. Borger, 
H. Kleine Buining, M.M. Richter. Springer-Verlag, 
1990, vi + 437 pp, $35 (P). [ISBN: 0-387-52753-2] 
Proceedings of the Third Annual Workshop on Com- 
puter Science Logic held at the University of Kaiser- 
slautern, October 1989. Twenty-eight papers on a 
mixture of topics in the theory of logic in computer 
science. RB 
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Artificial Intelligence, §(16-18), P. Minimalist 
Mobile Robotics: A Colony-style Architecture for an 
Artificial Creature. Jonathan H. Connell. Perspect. 
in Artif. Intell., V. 5. Academic Pr, 1990, xvii + 
175 pp, $29.95. [ISBN: 0-12-185230-X] Describes 
“Herbert,” a robot with grasping arm that wanders 
around an unstructured office environment collect- 
ing empty soda cans. Design philosophy uses the 
world as its own best model, with distributed, inde- 
pendent multi-agent “behaviors,” coordinated by a 
priority-based arbitration scheme, with no central- 
ized control, no complex internal representations, no 
internal communciation between agents, yet yielding 
seemingly purposeful behavior. RM 


Computer Science, P. Annual Review of Com- 
puter Science, Volume 4, 1989-1990. Ed: Joseph 
F, Traub, e¢ al, Annual Reviews, 1990, viii + 540 
pp, $45. [ISBN: 0-8243-3204-0] This series presents 
high-quality critical review articles, intending to help 
define and set standards in the discipline; publication 
is being suspended for lack of willing authors. Arti- 
cles on topics including supercomputers, distributed 
file systems, complexity classes, superscalar archi- 
tectures, several artificial intelligence fields, architec- 
ture, software prototyping. RB 


Computer Science, $(15-16), P. Software Ezcel- 
lence. Larry Shiller. Prentice-Hall, 1990, xv + 239 
pp. [ISBN: 0-13-829748-7] This book defines excel- 
lent software as software that meets its design objec- 
tives, is manageable, is changeable and long-living, 
and is learnable by someone not part of the original 
design team. Develops the idea of excellent software 
and describes how it can be achieved. AO 


Computer Science, P. Advances in Computers, 
Volume 30. Ed: Marshall C. Yovits. Academic Pr, 
1990, x + 331 pp, $65. [ISBN: 0-12-012130-1] Sur- 
vey articles on specialized parallel architectures for 
textual databases, database design, software reliabil- 
ity, cryptography, Soviet computing. RB 
Applications, P. Conference Proceedings. TpX 
Users Group (PO Box 9506, Providence, RI 02940), 
1988, vi + 262 pp, (P). Proceedings of the annual 
conference held at Montreal, August 22-24, 1988. 
The articles emphasize commercial and production 
environments as well as solutions for problems for 
dealing with non-English languages including He- 
brew. JAS 


Applications (Economics), P, L. The Collected 
Economics Articles of Harold Hotelling. Ed: Adrian 
C. Darnell. Springer-Verlag, 1990, ix + 178 pp, 
$49.80. [ISBN: 0-387-97011-8] Harold Hotelling, a 
statistician with very little formal training in eco- 
nomics, made contributions to economics which, 
though not voluminous, were pioneering in math- 
ematical economics. Here they all are, together 
with a 28-page introduction to the life and economic 
thought of Hotelling. WE 

Applications (Engineering), T(14: 2), P. Ad- 
vanced Engineering Mathematics, Second Edition. 
A.C. Bajpai, L.R. Mustoe, D. Walker. Wiley, 1990, 
ix + 502 pp, $34.95 (P). [ISBN: 0-471-92595-0] 


1991] 


Aimed at engineering undergraduates. Good review 
of materials for those seeking professional degrees. 
Mix of analytical, numerical, and computer-based 
techniques. Light on “proofs.” Numerical techniques 
approached informally. Plentiful supply of worked 
examples and problems for solution, with answers to 
most. Changes from First Edition (TR, February 
1978) include more on use of calculator and micro- 
computer. Chapter on integral transforms extended. 
Discrete Fourier Transform introduced. JDEK 


Applications (Fluid Dynamics), P. Topological 
Fluid Mechanics. Eds: H.K. Moffatt, A. Tsinober. 
Cambridge University Pr, 1990, xviii + 805 pp, $115. 
[ISBN: 0-521-38145-2] Seventy-six papers from a 
1989 conference on topological aspects of fluid me- 
chanics and magnetohydrodynamics. BC 
Applications (Fluid Dynamics), T(17-18), P. 
Nonlinear Waves, Solitons and Chaos. E. Infeld, 
G. Rowlands. Cambridge University Pr, 1990, xi 
+ 423 pp, $85; $29.95 (P). [ISBN: 0-521-32111-5; 
0-521-37937-7] Self-contained, graduate-level text 
concentrating on nonlinear wave phenomena in fluids 
and plasmas. Starts with linear waves and instabili- 
ties. Treats surface waves, weakly and fully nonlinear 
waves, coherent behavior (like solitons) and nonco- 
herent phenomena such as chaos and turbulence. Ex- 
ercises following each chapter. Good for researchers 
entering the field. BL 

Applications (Fluid Dynamics), T(15-16: 1), 
L. A Mathematical Introduction to Fluid Mechanics, 
Second Edition. A.J. Chorin, J.E. Marsden. Texts 
in Appl. Math., V. 4. Springer-Verlag, 1990, 168 pp, 
$29. [ISBN: 0-387-97300-1] Section headings say it 
best. In order, they are Euler’s Equations, Rota- 
tion and Vorticity, Navier-Stokes Equations, Poten- 
tial Flow, Boundary Layers, Vortex Sheets, Charac- 
teristics, Shocks, The Riemann Problem, Combus- 
tion Waves. Based on a course in fluid mechanics 
at Berkeley. This edition incorporates some updates 
and revisions. Some exercises added; informal; in- 
tentionally references are limited and not exhaus- 
tive. JDEK 


Applications (Management), $(14-16), P. Soft- 
ware Project Management For Small to Medium 
Sized Projects. John J. Rakos. Prentice-Hall, 1990, 
xvill + 302 pp. [ISBN: 0-13-826173-3] A text in- 
tended primarily for project managers, usable at 
technical or business schools as well as within compa- 
nies, generally non-technical in nature, describing a 
standard approach in which phasing and documen- 
tation are simplified to fit smaller projects. Three 
parts: phases for building a project; practical man- 
agement methods; managing people. RB 

Applications (Physics), T(16-17), S, P, L. In- 
flation and Quantum Cosmology. A.D. Linde. Ed: 
Robert Brandenberger. Academic Pr, 1990, xii + 199 
pp, $29.95. [ISBN: 0-12-450145-1] An authoritative 
discursive survey organized into three chapters: par- 
ticle physics and inflationary cosmology provides a 
brief overview of the inflationary universe scenario 
and a discussion of the problems with the big bang 
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model; generation of density perturbations discusses 
in detail the mechanism by which energy density fluc- 
tuations can lead to galaxy formation; and quantum 
cosmology and the stochastic approach to inflation 
which covers recent developments. It ends with a se- 
ries of amateurish cartoons, the humor of which is 
lost on the reviewer. MU 


Applications (Physics), §(18), P. Renormalized 
Quantum Field Theory. O.I. Zavialov. Math. & 
Its Applic., V. 21. Kluwer Academic, 1990, xii + 
524 pp, $149. [ISBN: 90-277-2758-9] Extensively 
revised and updated from the 1979 original Renor- 
malized Feynman Diagrams, this new epic contains 
previously unpublished results. MU 

Applications (Physics), $(16-18), P, L. Spinors 
and Calibrations. F. Reese Harvey. Perspectives in 
Math., V. 9. Academic Pr, 1990, xiil + 323 pp, 
$39.95. [ISBN: 0-12-329650-1] Primarily a collec- 
tion of examples: the (simple) Lie groups, the spin 
groups for general signature, normed algebras for 
general signature, the exceptional groups G2 and F4, 
the orbit structure of the simpler representations of 
these groups, and the special Lagrangian and asso- 
ciative calibrations. MU 


Applications (Physics), P. Integrable Systems in 
Quantum Field Theory and Statistical Mechanics. 
Eds: M. Jimbo, T. Miwa, A. Tsuchiya. Adv. Stud. 
in Pure Math., V. 19. Academic Pr, 1989, 682 pp, 
$89.50. [ISBN: 0-12-385342-7] 


Applications (Physics), P. Mathematical Analy- 
sis and Numerical Methods for Science and Technol- 
ogy, V. 1: Physical Origins and Classical Methods. 
Robert Dautray, Jacques-Louis Lions. Transl: Ian 
N. Sneddon. Springer-Verlag, 1990, xvii + 695 pp, 
$89.90. [ISBN: 0-387-50207-6] 


Applications (Social Science), P. Chaos and 
Socio-Spatial Dynamics. Dimitrios S. Dendrinos, 
Michael Sonis. Appl. Math. Sci., V. 86. Springer- 
Verlag, 1990, xvii + 184 pp, $39.80. [ISBN: 0-387- 
97283-8| Introduces an iterative process which the 
authors claim has wide applicability to the social 
and biological sciences, particularly to population 
dynamics. The discussion is intended to be quali- 
tative, and true to their word, the authors provide 
few proofs. Readers should prepare for an exposi- 
tional style more typical of social scientists than of 
mathematicians. JO 


Reviewers 


DFA: David F. Appleyard, Carleton; RB: Richard Brown, 
St. Olaf; JNC: Judith N. Cederberg, St. Olaf; LC: Laura 
Chihara, St. Olaf; BC: Barry Cipra, St. Olaf; CEC: Clifton 
E. Corzatt, St. Olaf; AD: Amy Davidow, Macalester; BH: 
Bruce Hanson, St. Olaf; JPH: Joan P. Hutchinson, Macalester; 
OJ: Ockle Johnson, St. Olaf; JDEK: Joseph D.E. Konhauser, 
Macalester; BL: Brian Loe, Carleton; SM: Steve McKelvey, 
St. Olaf; RM: Richard Molnar, Macalester; JO: Jeff Ondich, 
St. Olaf; AO: Arnold Ostebee, St. Olaf; MLR: Margaret L. 
Reese, St. Olaf; MPR: Matthew P. Richey, St. Olaf; AWR: 
A. Wayne Roberts, Macalester; JAS: J. Arthur Seebach, 
Jr., St. Olaf; LS: Li Sheng, Macalester; LAS: Lynn Arthur 
Steen, St. Olaf; MU: Milton Ulmer, Carleton; PZ: Paul Zorn, 
St. Olaf. 
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Ethnomathematics: A Multicultural View of 
Mathematical Ideas 
Marcia Ascher 
In this fascinating and ground-breaking new book, 
Marcia Ascher introduces the mathematical ideas of 
people in traditional, or “small scale,” cultures, such 
as American Indian tribes, Incas, and various cultures 
in the South Pacific and Africa. As Ascher explores 
mathematical ideas involving numbers, logic, spatial 
configuration, and the organization of these into sys- 
tems and structures, readers will gain both a broader 
understanding of mathematics and an appreciation 
for the ideas of other peoples. 1991. Approximately 
$98 pages. Casebound. ISBN: 0-534-14880-8. 

95. 


Complex Variables, 
Second Edition 
Stephen D. Fisher 


This undergraduate text for mathematics or engineer- 
ing majors offers a direct route to the most important 
topics in the theory and applications of complex vari- 
ables. Hundreds of solved examples, exercises, and 
applications help readers appreciate complex vari- 
ables as both a practical tool and a mathematical 
structure of enormous beauty and elegance. 1990. 
426 pages. Casebound. ISBN: 0-534-13260-X. 


APL with a Mathematical Accent 
Clifford A. Reiter and William R. Jones 


Providing a concise introduction to APL’s powerful 
mathematical, statistical, and graphical capabilities, 
this book helps readers master computational skills 
applicable to linear algebra, statistics, probability, 
numerical analysis, operations research, abstract al- 
gebra, number theory, and mathematical modeling. 
1990. 225 pages. Paperbound. ISBN: 0-534-12864-5. 


Harmonic Analysis 

Henry Helson 

This reprint of the 1983 edition is appropriate for 
graduate students who are familiar with the Lebes- 
gue integral, complex function theory, and Banach 
Spaces. 1991. 192 pages. Paperbound. 

ISBN: 0-534-15570-7. 


A Course in Ring Theory 
Donald S. Passman 


This textbook for a graduate course in ring theory of- 
fers a module theoretic stroll through a mixture of 
commutative and noncommutative ring theory, with 
emphasis on the latter. Projective and injective 
modules supply the framework for a coherent choice 
of material. 1991. 320 pages. Casebound. 

ISBN: 0-534-13776-8. 


Wadsworth & Brooks/Cole 
Advanced Books & Software 


Statistics for Business and Economics 
Debra Olson Oltman and James R. Lackritz 


Offering a classic, non-calculus-based treatment of 
descriptive and inferential statistical methods and 
their application to business problems, this thorough 
new text helps students develop competence in 
selecting and applying decision-making procedures 
they'll need as managers and executives. To em- 
phasize not just problem solving but what needs to 
be done and said after the answer is obtained, the 
authors provide more detailed explanations and more 
interpretation of results than is offered in most 
textbooks. The book includes many real examples, 
problems, and case studies, demonstrating how 
Statistics is used in a wide variety of business-related 
fields. 1991. Approximately 1000 pages. Casebound. 
ISBN: 0-534-14430-6. Accompanied by. Instructor's 
Solutions Manual. Student Solutions Manual. Data 
Disk. EXP-TEST. 


Probability and Statistics for 
Engineering and the Sciences, Third Edition 
Jay L. Devore 


This new edition of Devore’s best-selling text con- 
tinues to use real data in both exercise sets and 
examples and adds a new chapter on Quality Con- 
trol. Devore focuses on the probability models and 
Statistical methods students are most likely to en- 
counter and use in their careers in engineering and 
the natural sciences. 1991. Approximately 704 
pages. Casebound. ISBN: 0-534-14352-0. 


Theory of Point Estimation 

E.L. Lehmann 

A reprint of the 1983 edition. 1991. 506 pages. 
Casebound. ISBN: 0-534-15978-8. 


Testing Statistical Hypotheses, 

Second Edition 

E. L. Lehmann 

A reprint of the 1986 edition. 1991. 603 pages. 
Casebound. ISBN: 0-534-15984-2. 


Statistics with Stata 

Lawrence C. Hamilton 

This book/diskette package guides users through Stu- 
dent Stata, a full-featured graphics-oriented statistical 
package for IBM PCs and compatibles. Statistics 
with Stata can be used at a variety of levels, from in- 
troductory to advanced. The book's focus is on how 
to do the work of data analysis, using Student Stata 
as a power tool. Hamilton illustrates data analysis 
with examples and includes interpretive discussion 
as needed. 1990. 176-page paperbound book plus 
two 5 1/4" disks. ISBN: 0-534-12850-5. 


State-of-the-art software 


New for 1991 
MathWriter™ 2.0: The Scientific Word 


Processor for the Macintosh° 


J.Robert Cooke and E. Ted Sobel 


MathWriter is a complete “what you see is what 
you get” scientific word processing program 
that allows users to enter mathematical expres- 
sions as text, not graphics, so both text and 
mathematics can be edited in the same docu- 
ment. In addition to a spelling checker, 
thesaurus, and automatic hyphenation, the pre- 
gram also automatically sizes and centers 
mathematical symbols, italicizes variables, and 
formats tables and matrices. MathWriter shows 
on-screen renumbering of both equations and 
references to equations and can show both old 
and revised versions of the same document. 
1991. Professional version: $395. 


Reduced-feature Educational Version: $99.95. 


New for 1991 


EXP2TeX: The EXP to TEX Conversion 
Utility, Version 2.0 
Tom Morris 


This exciting new program allows you to com- 
bine the speed and versatility of EXP with the 
typesetting capabilities of TeX! Write with the 
truest “what you see is what you get” scientific 
word processor for the IBM PC, then use 
EXP2TeX to convert your file to a Plain TeX 
source file. If you use the EXP macros included 
with this program, most files will not require any 
editing in TeX. 

1991. $195. 


Wadsworth & Brooks/Cole 
Advanced Books & Software 


New for 1991 


The EXP Font Development Kit, 
Version 2.0 
Donald C. Mullen, Jr. and Simon L. Smith 


This Microsoft® Windows™ 3.0-based program 
allows you to create new symbols and import 
soft fonts into EXP, Version 2.0. The package 
contains the source files for all standard EXP 
fonts in 8-, 10-, and 12-point sizes. 

1991. $150. 


EXP: The Scientific Word Processor, 
Version 2.0 (for IBM PCs and com- 
patibles) 

Simon L. Smith 


Speed, ease of use, and an exceptional ability 
to handle mathematical fonts and symbols have 
made EXP one of today’s most popular “what 
you see is what you get” scientific word proces- 
sors. 


1990. $295. 
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To order a personal copy of software or 
books, use our toll-free number (800) 354- 
9706, or write to us at the address below. 
To request complimentary copies of 
books for review, or to receive a copy of 
our 1991 Advanced Books & Software 
Catalog, please write: 


Wadsworth & Brooks/Cole 


Advanced Books & Software 


MMO001 

511 Forest Lodge Road 
Pacific Grove, CA 93950 
(408) 373-0728 


Pweeeeeeeweeeteeeeeeeeseseeeeeeeeeeeeeeeeeeaeaeeee ee £4 


re | 


When was the last 
time a computer 


program helped 
you think about 
mathematics? 


Call or write for our latest software catalog. 
For PC or Macintosh. 


Lascaux Graphics 3220 Steuben Ave. Bronx, NY 10467 (212) 654-7429 


im FAST Protected Mode Fractal Graphics 

D : ~ Vi A N Plus...Orbit Diagrams & Frame Animation! 
For IBM 386w.387 coprocessor4 MB OF486 

10" Magnification * 175, 000 iterations/second « Frames generated average « < 2m min. + TIFF, Laserdett, etc. 


Software Only with Source Code DI-MAT gA4ge DI- GRAPH. $139.25 
' U1 $ 49.95 5 4 ooee FAST 32-bit Protected Mode Affordable Hi-res Scientific Plotting 
DI-GRAPH Q $1 29. 95 a $4 89. 95 | | Assembly Matrix Routines | [Presentation quallty - IBM compatible 
Shipping/Handling: $5.00 Each Item 
== DAMASKOS, Inc. Tel. 215-358-0200 
PA residents ad? 6% sales tax Make check payalde to: P.O. Box 469 » Concordville, PA 19331 * FAX 215-558-1019 


Experience 
Counts! 


New for 1991 
Practical Business Math 


Procedures, 3/e 
Slater 


Practical Business Math 


ee Procedures, Brief Edition 
Slater 


Practical Business Math 
Procedures, Brief Edition, 


Electronic Calculator Version 
Slater 


Jeff Slater’s latest texts tailored to meet 


your needs. 
Also Available in ’91 New in '90 
Collegiate Business Arithmetic Review for 
Mathematics, 5/e Business 
Berston & Fisher Berston & Fisher 


The Stancl & Stancil Floor Plan 


Mathematics for Brief Calculus for 
Management and the Life Management and the Life 
and Social Sciences and Social Sciences, 2/e 
Stancl & Stancil Stancl & Stancil 


Calculus for Management 
and the Life and Social 
Sciences, 2/e 

Stancl & Stancl 


For more information or an examination copy, write or call your IRWIN 
representative or Faculty Service at 1-800-323-4560. In Canada, 
call 1-800-268-4178 or 1-416-298-9532. 


IR WIN 1818 Ridge Road, Homewood, IL 60430 


DEVELOPMENTAL 
MATHEMATICS 


Cass and O’Connor’s 


Fundamentals with Elements of 
Algebra: A Bridge to College 
Mathematics 


Proga’s 
Basic Mathematics, Third Edition 
ALGEBRA 


Boye, Kavanaugh, and Williams’ 
Elementary Algebra 
Intermediate Algebra 


Gobran's 
Beginning Algebra, Fifth Edition 
GEOMETRY 


Geltner and Peterson's 


Geometry for College Students, 
Second Edition 


PRECALCULUS 
Kaufmann’s 

Precalculus, Second Edition 
CALCULUS 


Dick and Patton's 
Calculus, Volume 1 


_—_— 


KEEPING PACE WITH YOUR CHANGING 
CURRICULUM NEEDS 


New and revised titles from PWS-KENT Publishing Company 


Pence’s 
Calculus Activities for the TI-81 


Swokowski's 
Calculus, Fifth Edition (formerly the 
Alternate Edition) 

Calculus of a Single Variable 
Wiggins’ 

Problem Solver for Finite 
Mathematics and Calculus 
DISCRETE MATHEMATICS 


Fletcher, Hoyle, and Patty's 


Foundations of Discrete 
Mathematics 


POST-CALCULUS 


Cullen's 

Linear Algebra and Differential 
Equations, Second Edition 
ADVANCED MATHEMATICS 


Ehrlich’s 
Fundamental Concepts of Abstract 
Algebra 
Ruckle’s 


Modern Analysis: Measure Theory 
and Functional Analysis with 
Applications 


CALL TOLL-FREE 1-800-343-2204 
(in MA 617-542-3377) 
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PWS-KENT Publishing Company 
20 Park Plaza 


Boston, MA 02116 
és 


Partners in 


Wadsworth, Inc. 
ducation 


Just Published from HBJ! 


INTRODUCTORY ALGEBRA 
and 

INTERMEDIATE ALGEBRA 

RONALD HATTON and 

GENE R. SELLERS 

both of Sacramento City College 

Each Paperbound, 1991 


ELEMENTARY ALGEBRA 


Second Edition 

and 
INTERMEDIATE ALGEBRA 
Second Edition 
YOSHIKO YAMATO and 
MARY JANE CORDON 
both of Pasadena City College 
Each Hardcover, 1991 


ALGEBRA FOR 
COLLEGE STUDENTS 


Third Edition 
BERNARD KOLMAN, 
Drexel University, and 
ARNOLD SHAPIRO 
Hardcover, 1991 


FINITE MATHEMATICS 
WILLIAM OWEN and 
FRED CUTLIP 

both of Central Washington 
University 

Hardcover, 1991 


When you asked for a 
sensible, no-frills, main- 
stream calculus textbook at 
a reduced cost for your 
students, we heard you... 


CALCULUS 

One and Several Variables 
ROBERT ELLIS and 
DENNY GULICK 

both of University of Maryland 
Hardcover, 1991 $29.95 (Net)! 


SINGLE-VARIABLE CALCULUS 
WITH DISCRETE MATHEMATICS 
JOHN A. FEROE and 
CHARLES STEINHORN 

both of Vassar College 

Hardcover, 1991 


ELEMENTARY LINEAR ALGEBRA 
WITH APPLICATIONS 


Second Edition 

RICHARD O. HILL, JR.., 
Michigan State University 
Hardcover, 1991 


ABSTRACT ALGEBRA 
DENNIS KLETZING, 
Stetson University 
Hardcover, 1991 


HARCOURT BRACE JOVANOVICH, INC. 
III College Sales Office 
7555 Caldwell Avenue, Chicago, IL 60648 


(708) 647-8822 
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User D: EXAMPLE .ATH Free :97~% Derive Algebra 


2000 years of mathematical knowledge ona disk 


DERIVE, the successor to muMATH,, is a powerful computer algebra system 
for your PC compatible computer that provides the following capabilities: 

@ Exact and approximate arithmetic to thousands of digits 

&@ Equations, complex numbers, trigonometry, calculus, vectors, and matrices 
@ 2D and 3D function plotting with zooming capability 

@ MDA, CGA, EGA, VGA, and Hercules graphics and text support 

@ Attractive 2D mathematical display of formulas 

@ Easy to use menu-driven interface with on-line help 

@ Ideal for engineers, scientists, students and teachers 

@ $200 plus shipping: Call or write for information. 


System requirements: IBM PC or compatible computer, MS-DOS version 2.1 or later, 512K 
memory, and a 54 inch (360K) or a 3! inch (760K) diskette drive. Or NEC PC-9801 or 
compatible computer, MS-DOS version 2.1 or later, 512K memory, and a 514 inch (640K) 
diskette drive. 


DERIVE and muMATH are trademarks of Soft Warehouse, Inc. Hercules is a ttademark of Hercules Computer Technology, Inc. 
IBM is a registered trademark of International Business Machines Corp. MS-DOS is a registered trademark of Microsoft Corp. 
NEC is a registered trademark of Nippon Electric Company. 


Soft Warchousc: 


3615 Harding Avenue, Suite 505 « Honolulu, Hi 96816 
eS (808) 734-5801 after noon PST 
©1988 Soft Warehouse, Inc. Handcrafted software for the mind. 


Revised 
and 
Updated 


THE LAST PROBLEM 


E. T. Bell 
Revised and updated by Underwood Dudley 


What Eric Temple Bell calls the last prob- 
lem is the problem of showing that Pierre 
Fermat was not mistaken when he wrote 
in the margin of a book, almost 350 years 
ago, that 2” + y” = z” has no solution in 
positive integers when n > 3. The orig- 
inal text of THE LAST PROBLEM traced 
the problem from Babylonia in 2000 B.C. 
to seventeenth-century France. Along the 
way we learn quite a bit about history, and 
just as much about mathematics. Under- 
wood Dudley's notes bring us up-to-date on 
recent attempts to solve the problem. 


The book is unique in that it is a biogra- 
phy of a famous problem. The book fits 
no categories. It is not a book of mathe- 
matics. Pages go by without an equation 
appearing. It is not a history of number the- 
ory because it includes too much about the 
history of the western world, and it is not 
a history of western civilization because its 
focus is on mathematics. It is too entertain- 
ing to be scholarly and contains too much 
mathematics to be widely popular. It is an 
unusual book. . 


What T.A.A. Broadbent said about Bell’s 
work applies to THE LAST PROBLEM. 
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His style is clear and exuberant, his 
opinions, whether we agree with them 
or not, are expressed forcefully, often 
with humor and a little gentle malice. 
He was no uncritical hero-worshipper, 
being as quick to mark the opportunity 
lost as the ground gained, so that from 
his books we get a vision of mathemat- 
ics as a high activity of the questing 
human mind, often fallible, but always 
pressing on the neverending search for 
mathematical truth. 


This is a rich and varied, wide-ranging book, 
written with force and vigor by someone with 
a distinctive style and point of view. It will 
provide hours of enjoyable reading for any- 
one interested in mathematics. 


328 pp., Paperbound, 1990 
ISBN-0-88385-451 -1 


List: $17.50 MAA Member: $13.50 
Catalog Number TLP 


ORDER FROM 


@ Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D. C. 20036 


Two Books On 
Mathematical 
Ingenuity By 
Ross Honsberger | 


Ross Honsberger is the author of seven 
books in the Dolciani Mathematical Expo- 
sition series, each of which presents prob- 
lems from algebra, arithmetic, number the- 
ory, probability, and geometry, and provides 
ingenious solutions and/or intriguing results. 
His most recent addition to the series is 
MORE MATHEMATICAL MORSELS which 
is a continuation of the earlier Mathematical 
Morsels volume published in 1979. Now is 
your opportunity to own both of these ex- 
cellent books. The problems presented are 
meant to be enjoyed, rather than instruct, al- 
though instruction is almost always the au- 
tomatic by-product. 


Honsberger says in the Preface of his first 
volume of morsels, “Mathematics abounds 
in bright ideas. No matter how long and 
hard one pursues her, mathematics never 
seems to run out of exciting surprises. And 
by no means are these gems to be found 
only in difficult work at an advanced level. 
All kinds of simple notions are full of ingenu- 
ity. The present volume discusses scores of 
elementary problems and contains dozens 
of marvellous ideas.” 


All of the problems are accessible to any- 
one with a knowledge of freshman mathe- 
matics. Buy both of these excellent books 
today, and share them with your students. 


This is an excellent text..lt exposes 
gifted undergraduate students to a set 
of elementary problems...whose proofs 
are ingenious, usually containing beau- 
tiful ideas...Its lucid expository style 
and stimulating mathematical content 
make its reading a rewarding experi- 
ence for mathematicians at all levels. 
(about Mathematical Morsels) 


M.S. Cheema in Mathematical Reviews 


MATHEMATICAL ASSOCIATION OF AMERICA 
Doleiani Machemaucal Expasiions No.10 


...even the more advanced theoretician 
will find many of the problems fascinat- 


ing... 
D.T. Swift-Hook in Physics Education 
(about Mathematical Morsels) 


NEW NEW NEW NEW NEW NEW 


MORE MATHEMATICAL MORSELS 
Ross Honsberger 


315 pp., Paperbound, 1990, 
ISBN 0-88385-313-2 


List: $16.00 MAA Member: $13.50 
Catalog Number DOL-10 


AN MAA BESTSELLER 


MATHEMATICAL MORSELS 
Ross Honsberger 


249 pp., Hardbound, 1979 
ISBN 0-88385-303-5 


List: $ 28.00 MAA Member: $ 21.00 
Catalog Number DOL-03 


Both volumes purchased now 
List: $40.00 MAA Member: $29.00 
Catalog Number DOL-310 


ORDER FROM 


Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D. C. 20036 


THE AMERICAN 
MATHEMATICAL MONTHLY 
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TELEGRAPHIC REVIEWS 


NOTICE TO AUTHORS 


See statement of editorial policy (volume 89, p. 3). Follow the format in current issues. Put your full mailing 
address in a line at the head of the paper, following the title and the author’s name. Send three copies of the 
manuscript, legibly typewritten on only one side of the paper, double-spaced with wide margins, and keep one as 
protection against loss. Prepare illustrations carefully, on separate sheets of paper in black ink, the original 
without lettering and two copies with lettering added. Rough copies of the illustrations should be included in the 
manuscript at the appropriate places. Supply the full five-symbol Mathematics Subject Classification number, as 
described in Mathematical Reviews, 1985 and later. (This is necessary for indexing purposes.) 

All manuscripts whose length is more than 7 manuscript pages should be sent to HERBERT S. WILF, 
Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104. Shorter articles about the 
teaching of mathematics should be sent to either MELVIN HENRIKSEN (linear algebra, algebra, differential 
equations), Department of Mathematics, Harvey Mudd College, Claremont, CA 91711, or STAN WAGON 
(calculus, analysis, number theory, discrete mathematics, statistics), Department of Mathematics, Macalester 
College, St. Paul, MN 55105. Other shorter articles should be submitted as follows: in algebra, discrete 
mathematics, or probability, to RODICA SIMION, Department of Mathematics, George Washington University, 
Washington, DC 20052; in geometry or topology to DENNIS DETURCK, Department of Mathematics, Univer- 
sity of Pennsylvania, Philadelphia, PA 19104; in analysis to DAVID J. HALLENBECK, Department of Mathe- 
matical Sciences, University of Delaware, Newark, DE 19716. Papers in other fields may be sent to any of the 
editors; however, do not send a paper to more than one editor. Proposed problems or solutions should be sent to 
PAUL T. BATEMAN, Montu_y Problems, Department of Mathematics, University of Illinois, 1409 West Green 
Street, Urbana, IL 61801; unsolved problems to RICHARD GUY, Department of Mathematics and Statistics, 
University of Calgary, Alberta, Canada T2N 1N4. 


EDITOR 
HERBERT S. WILF 


ASSOCIATE EDITORS 


PAUL T. BATEMAN DAVID J. HALLENBECK RODICA SIMION 
DONNA BEERS PAUL R. HALMOS ANITA EB. SOLOW 
DENNIS DETURCK MELVIN HENRIKSEN JOEL SPENCER 
HAROLD G. DIAMOND JOAN P. HUTCHINSON LYNN A. STEEN 
JOHN DIXON WILLIAM M. KANTOR KENNETH B. STOLARSKY 
J.H. EWING JOSEPH KONHAUSER STAN WAGON 
RICHARD GUY RICHARD LIBERA DOUGLAS B. WEST 
LEE A. RUBEL 


Reprint Permission: Marcia P. Swarpb, Executive Director, Mathematical Association of America, 1529 
Eighteenth Street, N.W., Washington, DC 20036. 

Advertising Correspondence: Ms. ELAINE PEpREIRA, Advertising Manager, Mathematical Association of 
America, 1529 Eighteenth Street, N.W., Washington, DC 20036. 

Subscription correspondence, changes of address, and inquiries about nondelivered or defective issues: 
Membership/Subscriptions Department, Mathematical Association of America, 1529 Eighteenth Street, N.W., 
Washington, DC 20036. 

Microfilm Editions: University Microfilms International, Serial Bid Coordinator, 300 North Zeeb Road, Ann 
Arbor, MI 48106. 


The AMERICAN MATHEMATICAL MONTHLY (ISSN 0002-9890) is published monthly except bimonthly June-July and 
August-September by the Mathematical Association of America at 1529 Eighteenth Street, N.W., Washington, DC 20036 
and Montpelier, VT. 

The annual subscription price of $32 for the AMERICAN MATHEMATICAL MON'IHLY to an individual member of the 
Association is included as part of the annual dues. (Annual dues for regular members, exclusive of annual subscription 
prices for MAA journals, are $64. Student and unemployed members receive a 66% dues discount; emeritus members 
receive a 50% dues discount; and new members receive a 34% dues discount for the first two years of membership.) The 
nonmember /library subscription price is $128 per year. 

Copyright © by the Mathematical Association of America (Incorporated), 1991, including rights to this journal issue 
as a whole and, except where otherwise noted, rights to each individual contribution. General permission is granted to 
Institutional Members of the MAA for noncommercial reproduction in limited quantities of individual articles (in whole 
or in part) provided a complete reference is made to the source. 

Second class postage paid at Washington, DC, and additional mailing offices. 

Postmaster: Send address changes to the AMERICAN MATHEMATICAL MONTHLY, Membership/Subscriptions Depart- 
ment, Mathematical Association of America, 1529 Eighteenth Street, N.W., Washington, DC 20036-1385. 

PRINTED IN THE UNITED STATES O1 AMERICA 


How Things Float 


E. N. GILBERT, AT & T Bell Laboratories, Murray Hill, New Jersey 07974 


EpDGAR N. GILBERT received the Ph.D. in mathematics from M.I.T. in 1948. 
Since then he has been a member of the Mathematics and Statistics 
Research Center at AT & T Bell Laboratories. His main interests have been 
communication theory, queueing and other applications of probability, 
routing problems, and acoustics. 


ABSTRACT S. M. Ulam once asked if spheres are the only homogeneous bodies that can float in 
every orientation. Here an affirmative answer is given for a special class of bodies of revolution that 
seemed likely to contain counterexamples. Traditionally, the floating bodies of most interest have been 
ships, which are not homogeneous. However, homogeneous bodies are found to float in surprising ways. 
Bodies with symmetry often float asymmetrically. Thus, circular cylinders can float with the axis of 
symmetry tilted away from vertical at strange angles. A cube or regular tetrahedron can float with 
neither a vertex, an edge center, nor a face center down. Indeed, like the circular cylinder, a cube may 
have infinitely many indifferently stable floating orientations that are not isolated, but form a 
one-parameter family. 


1. Introduction. The stable orientations of a floating body present a classic 
problem of hydrostatics. Naval architects use a well-known stability condition 
(Theorem 1 below) to design ship hulls that float stably under a variety of loading 
conditions. With the exception of icebergs (Section 4), floating homogeneous 
bodies have little practical importance. However they have interesting mathemati- 
cal properties. A simple law connects the stable floating positions of congruent 
homogeneous bodies of densities p and 1 — p (Theorem 2). Spheres may be the 
only homogeneous bodies that can float in all orientations (Ulam [8]), but that has 
never been proved or disproved (Section 3). Theorem 4 proves it for a special class 
of bodies. A homogeneous body with an axis of rotational symmetry always has an 
equilibrium orientation with the axis vertical, but the equilibrium may be unstable. 
For example, only the shortest semiaxes of a homogeneous ellipsoid provide stable 
equilibria (Section 6). Depending on its dimensions and density, a circular cylinder 
may float with its axis vertical, horizontal, or tilted at some special angle away from 
vertical (Section 8). Regular polyhedra have many symmetry axes (through vertices, 
face centers, and edge centers) but, again, these orientations need not be stable. 
All of them are unstable for a tetrahedron of density 1/2 and so this tetrahedron 
floats in a way that is not easily guessed (Section 5). Likewise, for some values of p, 
the cube floats in an asymmetric way. In fact, like many circular cylinders, some 
cubes do not float in isolated stable orientations; instead they have infinitely many 
indifferently stable orientations, forming a one-parameter family (Section 10). 


2. Potential and metacenter. It is convenient to use coordinates fixed in the 
body. The problem then is to predict where the waterline will be on the body when 
it is floating stably. Let m be a unit normal to the water surface, directed up from 
the water. It suffices to find n since then Archimedes’ principle determines the 
waterline. 

Stable positions for a floating body occur at minima of a potential energy 
function U(n). U(n) is the work done to bring the body into a position with n 
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upward and with the waterline at the level where the body displaces its own weight 
in water. The work done against gravity is mgS, where m is the mass of the body, 
g the acceleration of gravity, and S, the height of the center of gravity G of the 
body. The work done against buoyancy forces is just the work required to raise the 
particles of the displaced water up to the water level. This work is mgS, where S, 
is the depth of the centroid B of the submerged part of the body. B is called the 
center of buoyancy. Thus 


U(n) = mg(S, + S,) = mgn:-(G—B). (1) 


Since the gravity force acts downward from the center of gravity and the 
buoyancy force acts upward from the center of buoyancy, the line joining B and G 
must have the vertical direction n in equilibrium. Some equilibrium directions are 
maxima or saddle points of the potential function U(n) and represent unstable 
equilibrium orientations of the body. 

If n) is an equilibrium direction, the behavior of S(n) =n -(G — B) near ny 
determines the type of equilibrium. To perturb n away from np, rotate the body 
through a small angle a about an axis with unit direction vector ¢ normal to no. 
Then n =n, cosa + (ny X t)sin a. If S(m) has continuous partial derivatives of 
order 3 at No, 


S(n) = S(ny) + const a? + O(a’). 


Buoyancy forces will tend to move the body back to orientation n, if the constant 
in the quadratic term is positive, or further away from n, if the constant is 
negative. The main theorem about floating bodies expresses this constant in terms 
of a point M, called the metacenter, which is defined as follows. 

For a given equilibrium direction n,, let V denote the volume of the submerged 
part of the body and let A denote the two-dimensional figure in which the plane of 
the waterline intersects the body. Let J denote the moment of inertia of A about 
an axis with direction ¢ (t - nm) = 0) and passing through the centroid of A. Then 
the metacenter for ny and ¢ is the point 


M=B+n)I/V (2) 
lying on the (vertical) line through G and B at height J/V above B. 


THEOREM 1. When n is rotated away from equilibrium direction no, through a 
small angle a about axis t, 
2 


Buoyancy forces act to move the body back to orientation ny if M lies above G and 
away from ng if M lies below G. 


n-(G—B) =S(n) = S(n,) + + O(a3). (3) 


The idea of metacenter and its connection with stability date back to Huyghens. 
Theorem 1 appears in many books on mechanics (({1], [9]) and naval architecture 
([6]). 

Since the metacenter depends on ¢, an equilibrium orientation 1, may be stable 
with respect to some perturbation axes ¢ and unstable for others (saddle-point 
equilibrium). If the principal moments of inertia of A about its centroid are J, and 
I,, with J, < Ll, let the metacenters for rotations about the corresponding princi- 
pal axes be M, and M,. Then n, is a minimum (stable point), saddle point, or 
maximum of S(7) if G lies below M,, between M, and M,, or above M,. In naval 
architecture M, and M, are the metacenters for rolling and pitching. 
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B, M,, and M, are determined entirely by the shape of the submerged part of 
the body but G may be moved about by redistributing cargo. To make a ship float 
stably in a particular orientation n,, its mass must be distributed to put G below 
M,. For instance a small boat, originally stable, can become unstable if its 
passenger stands up. A vessel partly filled w.h liquid (a tanker, leaky ship, or 
half-empty bottle) requires a more complicated treatment; its cargo includes liquid 
that moves as the body is perturbed. 

In what follows the body is a solid made from homogeneous material of density 
p <1. This is not an important case for naval architecture but it contains some 
mathematical curiosities. Symmetry axes of bodies like regular polyhedra, cylin- 
ders, or cones are obvious equilibrium directions. The kinds of stability that these 
bodies exhibit are often surprising. 


3. Indifferent stability. Because of its symmetry, a homogeneous sphere has 
the same potential U(n) = mgn -(G — B) for all orientations n. Every n is then 
an equilibrium direction. But these equilibria are indifferently stable. That is, the 
perturbed body tends neither to return to its original orientation nor to move away 
from it. Likewise a long circular cylinder has the same potential U(n) for all 
directions n normal to the cylinder’s axis. These directions are equilibria, indiffer- 
ently stable with respect to rolling although stable with respect to pitching. 

Ulam [8] asked, “A solid S of uniform density p has the property that it will 
float in equilibrium (without turning) in water in every orientation. Must S be a 
sphere?”” No nonspherical body with this property is known. H. Auerbach [2] 
settled a similar problem for cylindrical bodies by constructing noncircular cylin- 
ders that float indifferently stably with respect to rolling, in the manner of a 
circular cylinder. FiGurE 1 is the cross section of one of Auerbach’s indifferently 
stable cylinders. Chords drawn in FiGure 1 represent waterlines for different 
floating positions. These chords cut the figure in two halves of equal area 
(p = 1/2). Different heart-shaped cross sections work for other densities and 
there are other solutions that are not heart-shaped. 

For a body to have constant U(“), as Ulam required, (1) shows B = G — hn 
with # a constant. As n varies, the center of buoyancy remains on a sphere of 


Fic. 1. Cross section of indifferently stable floating cylinder of density 1/2. Chords are possible 
waterlines. Centers of buoyancy B lie on a circle about G. 
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radius / and center G. Moreover, (3) requires M = G and then (2) shows I = AV, 
a constant for all m and ¢. 

In Auerbach’s cylinder problem, the same reasoning requires that centers of 
buoyancy lie on a circle, as FiGuURE 1 shows. To achieve constant moment of inertia 
I for rolling, the waterline chords in FiGureE 1 all have the same length. These 
constraints supply the differential equation that determined the shape of FiGuRE 


1. 
When op is small, the body floats with small submerged volume and, in the limit 


as p — 0, as though resting on a flat table. Ulam also asked, “If a body rests in 
equilibrium in every position on a flat horizontal surface is it a sphere?” The 
answer to that question must be “yes,” at least for bodies that touch the flat 
surface at only one point P(n). For, as p > 0, B must approach P(n); then 
boundary points P(7) all lie on a sphere. For a purely geometric argument, not 
using the potential or metacenter, see Montejano [4]. 


4. Capsized bodies. The stable equilibrium positions of a floating body made 
from homogeneous material of density p will ordinarily depend on p. If equilibria 
are known for p < 1/2, the following theorem will immediately supply equilibria 
for p > 1/2. 

THEOREM 2. Consider a homogeneous floating body that may have either density 


p or p' =1-— p. Let the potential functions of the body with densities p and p’ be 
U(n) and U"(n). Then U'"(~n) = U(n). 


Proof. Let the body have volume K. The body with density p or p' displaces 
water volumes V = pK or V’' = p'K. For a given direction vector n let B denote 
the center of buoyancy of the body with density p. Write B’' for the centroid of the 
part of the body above the water level. This part has volume K — V = V’ and so B' 
is the center of buoyancy when the body with density p’ is oriented with direction 


vector —N. 
From KG = VB + V’'B' one finds V(G — B) + V'(G — B’) = 0 and then 


—-n:(G-—B')V'’=n:-(G—B)V. (4) 
Now (1) and (4) show that U’(—n) = U(n). @ 


An immediate consequence of Theorem 2 is 

CorROLLARY. If no is an equilibrium direction for a body of density p then —Ng is 
one for the same body with density 1 — p. The two equilibria are alike (both minima, 
both maxima, or both saddle points). Bodies with p = 1/2 float as stably capsized as 
right-side-up. 


FiGureE 2 illustrates the corollary by showing stable positions for a body with 
densities p and 1 — p. 


Fic. 2. Stable floating positions for congruent homogeneous bodies of density p and p’ = 1 — p. 
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(a) (b) 


Fic. 3. Two stable orientations of truncated spheres. 


For another example, consider a sphere truncated by a plane cut. In one 
equilibrium position the plane cut is horizontal and above the water level (Figure 
3a). The submerged part of the body is spherical and so, as Section 2 remarked, 
the metacenter M lies at the center of the sphere. Truncating the sphere has 
moved G from the center M of the sphere to a lower point. Then this equilibrium 
is stable for any p. But now the corollary shows that truncated spheres of all 
densities also float stably when capsized, with the cut submerged (Figure 3b). 

The corollary can help show how icebergs float. Ice has density about p = .9 
relative to sea water. Consider, instead, a fictitious iceberg with density p’ = 1 — p 
= .1. This light body may be expected to float in nearly the orientation it would 
assume resting on a flat plane. Then the real iceberg floats in the capsized 
orientation. For example, an iceberg with a large flat face is likely to float face-up. 
Of course, as Figure 3b illustrates, it may also float in other orientations, although 
not as stably. 


5. Counting equilibria. In looking for equilibrium points a useful identity 
relating the numbers of maxima, minima, and saddle points is 


N(max) + N(min) = M(saddle) + 2, (5) 


where N(--: ) is read “number of...”. Equation (5) is a topological identity that 
applies to functions U(n) with continuous second derivatives, defined on the 
sphere (see [5] and [7]). It assumes equilibrium points are isolated and only of the 
three kinds indicated. Thus, near an equilibrium point, U(m) must agree to terms 
of second order with a non-degenerate quadratic form in local coordinates. 

The identity provides a check that all equilibrium points have been found and 
properly classified. For example, the directions of the six semiaxes of a homoge- 
neous ellipsoid are obvious equilibrium points. In Section 6, these are shown to be 
the only equilibria; there are two maxima, two minima, and two saddle points, in 
agreement with (5). 

The identity (5) has an interesting application to the equilibria of a floating 
regular polyhedron. The directions from the polyhedron’s center to the vertices, 
face centers, and edge centers are obvious equilibrium directions because of 
symmetry. Indeed, in the limit as p — O, the body can touch the water surface only 
on a face, edge, or vertex, and these equilibria are the only possibilities. Moreover 
their numbers satisfy Euler’s identity 


N(faces) + N(vertices) = N(edges) + 2. (6) 
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With the light regular polyhedron resting on the water surface as though on a 
table, each face, vertex, or edge provides a minimum, maximum, or saddle point of 
U(n) (this need not be true of irregular polyhedra) and (5) is an instance of (6). 

One might expect directions along axes of symmetry to be the only equilibria 
even when p > 0. Axes through faces and vertices are axes of 3-fold symmetry or 
more; then M, = M, for these axes and their equilibria can be only maxima or 
minima. Axes through edge centers have only 2-fold symmetry and so might give 
the saddle-point equilibria that (5) demands. However, the situation can be more 
complicated. A regular tetrahedron with p = 1/2 provides a quick example. The 
corollary to Theorem 2 shows that it floats as stably face-down as vertex-down; 
indeed the metacenters, obtained from (2), show both orientations to be maxima of 
U(n). Moreover the orientations with edge down also turn out to be maxima. Then 
other orientations, not along symmetry axes, will minimize U(n) and even more 
will provide saddle points. 

Figure 4a will help find the missing equilibria. It is convenient to represent 
each n by the point where the ray from G, with direction n, crosses a face of the 
tetrahedron (gnomonic projection). The corollary to Theorem 2 states that equilib- 
rium directions come in pairs, represented by diametrically opposite points, such 
as vertex P, and face center F’,, or the edge centers E and E’. The plane of P,, E’, 
P;, F, E, and G is a plane of mirror symmetry of the tetrahedron. Then maxima 
and minima of U(n), with n restricted to vectors lying in the plane, occur only at 
equilibrium points. Since P,, F, and EF are maxima there must also be two minima, 
Q between P, and F, and R between F and E. To locate Q exactly, note that the 
plane of P,, E', P,, E is another plane of mirror symmetry, and hence a waterline 


(b) 


Fic. 4. Equilibrium points and equipotential lines on a regular tetrahedron of density 1 /2. 
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of an equilibrium orientation of the tetrahedron. The normal to this plane is 
parallel to edge P,P, and represented by Q, the midpoint of the dotted altitude 
PE of face P,P, P,. A metacenter calculation shows Q to be a stable point. The 
point R is more complicated but, lying between F and E, R is diametrically 
Opposite a point R’ on edge P,P,. Thus the missing equilibria include 12 minima 
like Q, on midpoints of face-altitudes, 12 points like R, and 12 like R’. Equation 
(5) will be satisfied if R and R’' are saddle points and there are no further 
equilibria. The laborious calculations, required to locate R and R’, find their type 
of equilibrium, and verify that all equilibria have been found, have not been made. 
Figure 4b is only an artist’s conception of the lines of constant U(n) on a face. 

The stable orientations of a tetrahedron of density 1/2 are related to symmetry 
because they hold one edge vertical. However, with p slightly different from 1/2, 
these orientations are no longer equilibria and the body must float asymmetrically. 


6. Ellipsoid. The stability of a floating ellipsoid will be determined by first 
showing how compressing a body affects its stability. Consider any floating body F 
made of homogeneous material of density p. If n, is an equilibrium position for F, 
introduce cartesian body-coordinates x, y, z with the z-axis in the direction n,. A 
transformation 


x'=ax, y=by, z'=cz (7) 
compresses F into a body F’. Volumes V and V’ of corresponding regions in F 
and F’ are related by 


V' = abel. 


If F' is also made of homogeneous material of density p, the compression (7) maps 
the waterline of F to the waterline of F’ for the same direction n,. Centers of 
gravity G and buoyancy B of F also map to the corresponding centers G’ and B’ 
of F’. Then v1, is also an equilibrium direction of F’. However F and F’ may have 
different types of equilibrium at n, because (7) does not map metacenters into 
metacenters. 

Let M and M’ be the metacenters of F and F’ for small rotations of n about 
the x-axis. These metacenters are related because the waterline figures A, A’ of 
F, F' have moments of inertia J, I' satisfying I' = ab°I. Then (2) shows 


b2 
ny’ (M' — B') =I'/V' = ab?I/(abcV) = ho’ (M-—B). 


The stability condition 0 <n, -(M’ — G’) for F' is equivalent to n, -(G' — B’) < 
n° (M' — B'), or 
2 . _ 
pinto CM ~ 8) > 1. (8) 
cn): (G—B) 

Now, in particular, let / be a unit sphere so that the compression (7) produces 
an ellipsoid F’, with semiaxes a, b,c, floating with c-semiaxis vertical. Because the 
sphere is indifferently stable, (3) shows that all its metacenters M coincide with G. 
The condition (8) for stability with respect to small rotations about the a-semiaxis, 
starting from a configuration with n, along a c-semiaxis, reduces to c <b. 
Similarly F’ will be stable with respect to small rotations about a b-semiaxis if 
c <a. It follows that the shortest semiaxes of F’ are stable directions n, the 
longest semiaxes maximize the potential U(n), and the middle-sized semiaxes 
provide saddle points. 
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If the semiaxes a,b,c of an ellipsoid F’ are all different, they provide six 
equilibrium directions, two each of minima, maxima, and saddle points. Equation 
(5) suggests that these may be the only equilibrium directions. A proof that there 
are no other equilibrium directions follows. 

Consider the ellipsoid (x'/a)* + (y'/b)? + (z'/c)? = 1 obtained from a unit 
sphere x7 + y* + z* = 1 by the compression (7). Suppose a < b < c. Imagine the 
ellipsoid floating with upward direction n’ = Q)’, wp’, v’); let A'x' + py’ + v'z' = d' 
denote the water surface. On the unit sphere the corresponding water surface 1s 
Ax + py +vz=d where n=(A,y,v) is a unit vector with the direction of 
(\'a, w'b, v'c). The center of buoyancy B of the sphere lies below the origin G, in 
the direction of —n, say B = —(BA’a, Bu'b, wv'c). Since affine transformations 
carry centroids into centroids, the center of buoyancy of the ellipsoid is at 


B’ = —(Ba*d', Bb*u', Bc?v'). 
In order for n' to be an equilibrium direction of the ellipsoid, mn’ and B’ must have 


the same direction. Since a’, b”, c” are all different, the only possibilities for n' are 
(+1,0,0), (0, + 1,0), (0,0, + 1), the semiaxis directions. 


7. Body of revolution. For bodies of revolution, U(n) simplifies to a function 
u(O) of just the angle 6 between n and the axis a of revolution (0 < 0 < 7). The 
axial directions nm = +a are obvious equilibria. Other equilibrium directions n are 
not isolated, but lie on circles of constant 6. Then (5) does not apply. 

For equilibria other than n = +a it is convenient to use the term pitch for a 
rotation about a horizontal axis p that is normal to a. Then a roll is a rotation 
about a horizontal axis normal to p, i.e., an axis in the plane of n and a. Because 
the plane of m and a is a plane of mirror symmetry of the waterline figure A, the 
pitch and roll axes are principal directions for A. Then M, and M, are the 
metacenters for pitching and rolling (although perhaps not in that order). 


THEOREM 3. Consider a homogeneous floating body of revolution with axial 
direction vector a. At an equilibrium direction n, # +a, the roll metacenter M, is at 
the center of gravity G. 


Proof. Introduce body coordinates x, y, z with z axis along the axis of rotation 
and x-axis in the plane of a and ny. Write ny: a=cosé,). Then ny = 
(sin 6), 0, cos @,). A roll is a rotation about the x-axis. A roll of size a moves the 
vertical direction from n, to n = (sin 6), cos 0, sin a, cos 6, cos @). Then n+ a = 
cos 0 = cos 6, cos a and, for small a, 6 = 0, + (a7/2)cot 6) + O(a‘). The poten- 
tial at n is 


du 
U(n) = u(8) = uO) + (8 — A) + O(|@ — |”). 


At ny, du/dé = 0 because n, is an equilibrium direction. Then U(n) = U(n,) + 
O89 — 6,|7) = U(n,) + O(a*) and (3) shows M, = G. & 


Theorem 3 shows that a body of revolution is indifferently stable with respect to 
small roll perturbations. This result can also be deduced from the observation that 
the body is indifferently stable to rotations about its axis a of symmetry. For the 
sake of simplicity, an equilibrium of a body of revolution will be called stable if it is 
pitch-stable. 


CoROLLARY. An equilibrium ny # +a of a body of revolution is stable iff I, and 
I, the moments of inertia of A for pitching and rolling, satisfy I, < I,. 


The corollary follows immediately from (2) and Theorems 1 and 3. 
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An ellipsoid with two semiaxes equal is an ellipsoid of revolution, a spheroid. 
The results of Section 6 show that the rotation axis a must be either vertical (6 = 0 
or 7) or horizontal (6 = 7/2) in any equilibrium position of a spheroid. Oblate 
spheroids are stable with a vertical and prolate spheroids are stable with a 
horizontal. Other bodies of revolution can have more complicated equilibria. The 
circular cylinder of the next section is a typical example. 


8. Cylinder. In this section the body is a circular cylinder of radius R and 
height H. 

In one equilibrium position the cylinder floats on end, with 0 = 0 or 6 =7 in 
the notation of Section 7. A simple calculation of the pitch metacenter shows that 
this orientation is stable if 

1 Hy" o} 9 
— — < —, 
oi) (=) <5 (9) 


In another equilibrium the cylinder floats on its side, with 6 = 7/2. The 
waterline figure A is a rectangle, with sides H and w, where Archimedes’ 
principle determines w as a root of 


WwW Ww ' w \? 
aresIn TR OR Gr, = pT. 


The corollary to Theorem 3 shows that this equilibrium is stable if 
w <dH. (10) 


Both (9) and (10) may hold if p is small, only (9) if H/R is small, only (10) if 
H/R is large, and for some parameter values (e.g., p = 1/2, H/R = 1.7) neither 
(9) nor (10) hold. Then some cylinders must float with axis tilted at other angles @, 
not 0, 7/2, or 7. There are several cases to consider because the waterline may 
intersect either end of the cylinder, neither, or both. In what follows x, y, z are 
cartesian coordinates, fixed in the body, with the z-axis placed along the axis of 
revolution. Then the cylinder is x* + y? < R*,0<z<H. 

In the simplest configuration, the end z = 0 is completely submerged and the 
end z = H is completely out of the water. The waterline plane passes through the 
points (— R, 0, pH — R tan @), (0,0, pH), and (R,0, pH + R tan @). This configura- 
tion is possible only if 


R 
7 tan 8 < Min {p,1 — p}. (11) 


If this orientation is an equilibrium, it is stable. For, the waterline figure A is an 
ellipse with semiaxes R and Rsecaé@ > R; the corollary to Theorem 3 ensures 
pitch-stability. The equilibrium condition is that G be directly above B, i.e., the 
line GB must make angle @ with the axis of the cylinder. Now G = (0,0, H/2) and 
B = (B,,0, B,) where 


R? tan 6 oH R tan 6 \’ 
B == B = . 


*" 4pH ”? “7 2 2pH 


The condition for stability is B, = (H/2 — B,)tan @, or 


1 _ 9 12 
~p){—| == + -tan’9. 
p(1~p)() = 5 + ztan (12) 
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If p(1 — p)(.H/R)* < 1/2, there is no equilibrium of this kind, but then (9) 
shows that the cylinder floats stably on its end. For larger values, (12) determines 
an angle @. If 6 satisfies (11), then the cylinder floats stably tilted at angle 6. If (11) 
is violated, (12) no longer applies and the equilibrium is one with waterline passing 
through one or both ends. 

To simplify the problem, only the case p = 1/2 will be considered. Formula 
(12) with p = 1/2 determines an angle @ satisfying (11) if 2 < (H/R)* < 8/3. 
When H/R = y8/3 = 1.63299, the tilt angle is 6 = 39.23° and the two ends of 
the cylinder are just tangent to the water surface. For larger values of H/R, 
consider a configuration with waterline plane passing through (—(H/2)cot 6, 0, 0), 
(0,0, H/2), and ((H/2)cot 6,0, H). The formulas that follow are simplified by the 
notation 


H 
t= = cot, Jy = fx'V1 — x? de (13) 
2R 0 
for k = 0,2. The integrals J, are elementary, 
1 
Jy = mau 1 — ¢? + arcsin ¢} 


Jy = o (is ~11-2?)"}. 


The waterline in this configuration passes through both faces. That requires 
(H/2)cot 0 < R,ie., t <1. 
Now V = 7R’*H/2 and B = (B,,0, B,) with 


4 
VB, = ZR — J3) 


x 


7 1 


6 4 2 


The equilibrium condition 


2J,-J 
tan? 9 = ——~—. (14) 


The stability of this equjlibrium must be determined by finding the principle 
moments of inertia of the waterline figure A. A is no longer the entire ellipse, 
with semiaxes R and Rsec 6, because that ellipse protrudes beyond the ends of 
the cylinder. To get A, pieces at the ends of the ellipse must be cut off to leave a 
figure of length H csc @ (see Ficure 5). The moments for pitching and rolling are 


I, = 4R* sec? 0J, 
I. = =R‘ sec 0( J, — Ja). 


r 


The corollary to Theorem 3 now gives the stability condition J, > I,, or 


(15) 
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Fic. 5. Waterline figure A of floating circular cylinder. 


90° 


13. y2 15 y8/3 17 18 19 20 21 
H/R 


~ Fic. 6. Stable orientation of cylinder of density 1/2 as function of H/R. 


Each value of t in 0 < t < 1 determines an angle @ in (14) and then (13) gives 
H/R = 2t tan 6. A check on the condition (15) shows that the equilibrium is stable 
throughout 0 < +t < 1. As ¢ varies, H/R and @ range over 2 > H/R > y8/3 and 
90° > 6 > 39.23°. 

The cylinder with density p = 1/2 has, for each H/R, a unique angle @ at 
which it floats stably. FiGure 6 shows how the stable orientation varies with H/R. 
Formulas @ = 0°, (12), (14), and @ = 90° hold in the ranges 0 < H/R < v2, 
V2 <H/R < 8/3, 8/3 < H/R <2and2<H/R <~«. 


9. Inversion symmetry. A body has inversion symmetry if, when the origin O 
of coordinates is placed at the centroid G, the inversion (x, y, z) > (—x, — y, — Z) 
leaves the body unchanged. If a body has inversion symmetry, then any plane 
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through G cuts the body into two congruent halves. If a body has p = 1/2, as well 
as inversion symmetry, then its waterlines are easy to find. The waterline, for any 
given n, lies in the plane normal to n that passes through G. 

Bodies of revolution, being already indifferently stable with respect to rolling, 
seem likely candidates to be indifferently stable to all perturbations, as requested 
by Ulam. 


THEOREM 4. Except for spheres, there are no indifferently stable bodies of 
revolution with p = 1/2 that also have inversion symmetry. 


Proof. Introduce spherical polar coordinates (r, y, 6) with origin at G and with 
polar angle y (the axis of revolution is y = 0 and y = 77). An equation of form 
r = r(y) will describe the surface of the body. 

Consider the body floating with upward direction vector n and let n have angle 
coordinates y = 0, 6 = 0. Now introduce a second system of spherical polar 
coordinates (r, a, B) with pole a = 0 at n. The original polar angle y is related to 
the new coordinates by 


cos y = cos 8 cos a + sin @ sin a cos B. (16) 
The potential U(m) is (1) with 


S(n) = —[ff(r -n)r? sin a drda dB, 


wherein the integration extends over points P of the body below the waterline 
plane. The range of integration is 


IT 
O<r<r(y), 0<B <27, y Se<7, 


where y depends on a and 8B, as in (16). Because P -n = r cos a, the r-integration 
is immediate; 


1 
S(n) = ~Z J [r*(y)oos asin a da dp. (17) 
Expand r‘*(y) in a Neumann series 


r“(y) = dA, P,( cos Y) (18) 


with P,(-) the kth Legendre polynomial. Because the body has inversion symme- 
try, r“(7 — y) = r*(y), and coefficients A, with odd k vanish. With cos y given 
by (16) the addition theorem for Legendre polynomials ((3]) expresses P,(cos y) in 
terms of Legendre polynomials P,(-) and associated Legendre polynomials P;"(-) 
of cos a and cos 0 
P P )P 56 yy Ea)! 
C = COS cos @) + —_—_—_— 
(COS ¥ ) 1 ( a) P,( ) = (k+m)! 
Then when (18) is substituted into (17), the integration on B removes the terms 
containing associated Legendre polynomials, and leaves 


S(n) = - 5 {Ac + )° A,P,(cos 6) ['Pax)xae}, (19) 


P;"(cos 6) P;"(cos a)cos mB. 


Now suppose the body is indifferently stable, so that S(m) is constant. In (19), 
the coefficient of each term P,(cos 6) must vanish. With k even, say k = 2r > 2, 


(-1)'"7! (2 


a(2r—-1(2r +1) \r *0 


[Pd x)xds = 
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and so A, = 0. Also, as was mentioned above, A, = 0 for odd k because of the 
inversion symmetry. Then (18) becomes r*(y) = A, and the body is a sphere. & 


Bodies with inversion symmetry are so special that the theorem does not 
strongly suggest that the sphere may be the only indifferently stable body. For 
example, in the problem of indifferently roll-stable cylinders, a very similar 
argument would show there are no non-circular examples with inversion symmetry. 
Yet, Auerbach produced examples without inversion symmetry. 


10. Cube. This section will show that a homogeneous cube can float in 
surprising ways. Again, x, y, z will be body coordinates, now in the interval [0, 1]. 
Then G = (1/2,1/2,1/2). Three planes through G, each normal to a coordinate 
axis, are planes of mirror symmetry of the cube. It follows, with n = (A, py, v), that 
U(n) depends only on the magnitudes |A|,|wl, |v|. In what follows A, u,v will be 
assumed non-negative. It will also suffice to restrict p to 0 < p < 1/2 because 
Theorem 2 will then show how denser cubes behave. Equilibria with n along axes 
of symmetry will be considered first; the metacenters for these equilibria are easily 
calculated. 

When the cube floats face-down, n -(G — B) = (1 — p)/2. Also n- (M — B) 
= 1/(12p) for rotations about any axis t. Then the cube floats stably on its face if 
pl — p) < 1/6, i.e., if p < .211325. 

When the cube floats edge-down, 


l 2 
n-(G-B)= =~ aye. 


The metacenter now depends on the axis ¢. It will be convenient to use the terms 
roll and pitch as though the cube were a ship with the submerged edge as its keel. 
The metacenters M,, M, for rolling and pitching satisfy 


2 I 
n-(M,—B) 3 VP » n-(M, — B) éJp 
For 0 <p < 1/8 the equilibrium is a saddle point of U(n), roll-unstable and 
pitch-stable. For 1/8 < p < 9/32 it is a maximum. For 9/32 < p < 1/2 it is again 
a saddle point, but roll-stable and pitch-unstable. 

When the cube floats vertex-down the submerged part of the body is a 
triangular pyramid for 0<p< 1/6. But, when 1/6 <p < 1/2 the waterline 
plane cuts all 6 faces of the cube and the figure A is an irregular hexagon. The 
calculations of B and M are longer, although still elementary. Floating vertex-down 
turns out to be unstable (a maximum) for 0 < p < 1/6 and stable for 1/6 < p < 
1/2. 

Equation (5) shows that axes of symmetry need not be the only equilibrium 
directions. When 1/8 < p < 9/32 these axes provide no saddle points. A curious 
phenomenon occurs when .211325 < p < .224602. In that range the potential U() 
of the unstable face-down orientation is actually lower than the potential of the 
stable vertex-down orientation; then there must be other stable orientations with 
even lower potential. 

To find the missing equilibria numerically one can just plot U(n) as a function 
of n on the unit sphere. The function U(n), which will be derived next, can only be 
piecewise analytic; its form depends on which vertices are submerged. 

For fixed n = (A, pw, v), the derivation introduces an integral 


F(s) = [fferorter tees dx dy dz 
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taken over the entire body. For the unit cube, 

l1-e *1-—e-4# 1-e7" 

——— (20) 
AS LS VS 


but the same argument will apply to a general body as long as the origin is located 
so that the body lies entirely in the positive octant. 


F(s) = 


THEOREM 5. For k = 1,2, write f,(h) for the inverse Laplace transform of 
F(s)/s*. When the water surface is the plane 4x + wy + vz =h, fh) is the 
displaced volume V and f,(h)/V is the depth of the center of buoyancy B below the 
water level. 


Proof. Write A(h) for the area of intersection of the plane Ax + wy + vz =h 
with the body. Then 
F(s) = e~"S dy dy dz = “eT hsA h) dh, 
(s)= fff fe ™Ah) 


the Laplace transform of A(h). F(s)/s is the inverse Laplace transform of 


f(h) = far) dh’, 


but that integral is just the displaced volume V below the plane Ax + wy + vz =Ah. 
Likewise, F(s)/s? is the Laplace transform of 


fan) = f° f° ACH") dh an 


_ ["cr — h") A(h") dh", 
(0) 


the second line following by interchanging the order of integration. But this 
integral is just V times the depth of the center of buoyancy. @ 


In general, Archimedes’ principle specifies V. Then the equation 
f(A) — V, (21) 
from Theorem 5, determines A and the depth f,(h)/V of B. G is at height n- G 


above the origin, or n - G — h above the water level. Then the potential U(n) in 
(1) is proportional to 


f,(h) 
+ — 
V 
For the unit cube, V = p in (21) and n:G=(A+y+v)/2 in (22). To find 


f,(h), multiply out (20) into a sum of 8 terms, each an exponential divided by 
Apvs**>. Transforming this expression term-by-term produces 


n:'(G-—B)=n:-G h. (22) 


fh) — Tank pat tall _ I|h — I< *? — Il _ ull<*? _ \|h _ p||**? 


+||h — A ~ pllS** + lh — pw — vile + Wh — » — Alle? 
lh -A ~ pw ~ vil), (23) 


where ||z|| means z if z >0 and 0 if z <0. Each term in (23) represents a 
different vertex of the cube. Thus A is the depth of (0, 0,0), A — A is the depth of 
(1,0,0), A —A —yp is the depth of (1,1,0), etc. Only the submerged vertices 
contribute non-zero terms to (23). 
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Formulas (21), (22), (23) were used in a numerical study of a cube of density 
p = .217. This density lies in the interesting range, discussed earlier, in which the 
global minimum of U(n) occurs away from the cube’s symmetry axes and so 
remains to be found. Figure 7 shows level contours of n-(G — B), drawn in 
gnomonic projection from G onto a typical face of the cube as in FiGuRE 4b. 
Beside the equilibria at the face center, edge centers, and vertices, there are four 
more saddle points, one near each corner of the face. Note that equipotentials in 
Figure 7 do not meet the face edges at right angles because the gnomonic 
projection is not conformal. 


0.39317 


oe) 
a Zane 


Fic. 7. Equipotential lines on a face of a cube of density p = .217. 


The missing global minima turns out not to be isolated points. They form a 
circle, with n-(G — B) = .39143, surrounding the face center. To show this 
analytically, consider directions n near (0,0, 1), a face-center axis. Suppose A and 
w small enough (and v = y1 — \* — pw” large enough) so that the submerged 
vertices are (0,0,0), (1,0, 0), (0, 1,0), and (1, 1,0). Then f,(A) in (23) is a sum of 
four terms. When the terms of (23) are expanded in powers of h, considerable 
cancellation occurs, leaving 

/ 


1 ’ 
f,(h) = 75, ton — 6A(A + w) + 2A° + 3Am + 27}. (24) 
V 


A+ pm 
2 


p(n) = [a 


and ‘ 


The condition f,(h) = V = p of Theorem 5 provides 
A + 
h= 5 r + pv. (25) 


Now substitute (24) into (22) and use (25) to eliminate h. The result, with the help 
of A? + p’* + v? = 1, simplifies to 


1 il 
n-(G~B) = 5-4 + v[120(1 ~ p) ~ 1] (26) 
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Only the » component of n enters into (26) and so the level contours of 
n-(G — B) near the face center are circles. If p(1 — p) > 1/6 (recall that this was 
also the condition that the face center be an unstable equilibrium point), then 
n-(G — B) in (26) is a minimum at 


1 


v12p(1-p)-1- 


With p = .217 the minimum occurs at v = .981085 = cos 11.16°. For all such n, 
only the vertices of one face are submerged and (24) does apply. 

A cube of density p = .217 will float with a face-center tilted 11.16° away from 
vertical. However, the coordinates A and p of n are restricted only by A* + p* = 1 
— p* = 037473 and the slightest disturbance will move the cube to a new equally 
stable orientation with the same v. This cube resembles the cylinder in having a 
one-parameter family of indifferently stable orientations. In fact, as n ranges over 
all directions, n -(G — B) remains within 0.65% of .3940 and so the cube comes 
close to satisfying the condition n -(G — B) = const of an indifferently stable 
body. 
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Introduction. The iteration of complex analytic functions is one of the most 
alluring subfields of non-linear dynamical systems. One reason for this is the 
striking beauty of the computer graphics images of Mandelbrot [Mal], 
Peitgen/Richter [PR], Douady and Hubbard [DH, DH1], and others. But, for 
mathematicians at least, the attractiveness goes much deeper: the combination of 
powerful techniques from complex analysis and dynamical systems yields a field 
which has an intrinsic mathematical beauty of its own. 

Basically, the problem in complex analytic dynamics, as in many dynamical 
systems, is to understand the fate of all points under iteration of a map F. That is, 
given z © C, what can we say about the sequence of points z, F(z), F7(z) = 
F(F(z)),..., F"(z),... called the orbit of z. Here we emphasize that F” means 
the n-fold composition of F with itself, not the nth power of F. 

There are basically two types of orbits in any dynamical system: those that 
behave stably or tamely, and those that behave unstably or chaotically. To be 
imprecise for the moment, an orbit is chaotic if there are nearby points whose 
orbits tend to diverge far from the given orbit. This means that a chaotic orbit does 
not lend itself to numerical computation: any small error due to round-off or other 
numerical approximation may throw us off the original orbit and onto another one 
whose fate is radically different. The stable orbits, on the other hand, behave more 
tamely: an orbit is stable if nearby points have orbits which remain nearby for all 
iterations. 

In complex dynamics, the set of all chaotic orbits is called the Julia set, after the 
French mathematician Gaston Julia who, along with Fatou, first studied the 
iteration of complex analytic functions in the 1920’s [J], [F]. The Julia set consists 
of all points that have the property that, if U is any neighborhood of z, then any 
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point z* in C (with at most one exception) has at least one preimage in U for some 
F". That is, the iterates of F tend to smear arbitrarily small neighborhoods of 
points in the Julia set over almost the entire complex plane. In this sense J(F) is a 
“strange” repeller. It is a fact that the Julia set contains all of the “interesting” 
orbits of a complex dynamical system. For this reason, its structure is the subject of 
much contemporary research. 


In this paper we will concentrate on the Julia sets of entire transcendental 
functions like A exp z where A > 0 is a real parameter. Entire functions share 


many of the dynamical properties of polynomials or rational maps [B], [DH], but 
there are some major differences. For example, the Julia sets of these maps have 
the interesting property that they may explode as the parameter A is varied. That 
is, the Julia set undergoes sudden, dramatic changes at certain critical parameter 
values. In the exponential case, one such explosion occurs when A = 1/e. For 
A-values less than this critical value, we will show that the Julia set occupies a 
relatively small, nowhere dense subset of the right half plane, but when A > 1 /e, 
the Julia set is the whole plane! This means that the set of chaotic orbits changes 
discontinuously at this parameter value. This type of bifurcation cannot happen in 
families of polynomials or rational maps. One of our major goals in this paper is to 
describe this explosion in detail. 

In §1 of this paper we describe some computer experiments which foreshad- 
owed these results. In §2 we present some preliminary notions from complex 
analysis and dynamical systems theory. We also give a precise definition of the 
Julia set and formulate some of its basic properties in this section. The goal of §3 is 
to describe the Julia set of Ae* before the explosion occurs. We prove here that 
the Julia set is a nowhere dense subset of the right half plane when A < 1/e. In §4 
we concentrate on the case A > 1/e where the Julia set is the whole plane. We 
also present several alternative characterizations of the Julia set in these two 
sections, first as the closure of the set of repelling periodic cycles and second as 
the closure of the set of escaping orbits. This last characterization gives an efficient 
algorithm for plotting the Julia sets of these functions. Finally, in §5, we indicate 
how similar analysis may be used to describe explosions in other families of entire 
transcendental functions. 


1. Computer Experiments. The explosions in the Julia set of an entire function 
are easy to discover experimentally using a computer. The reason for this is that 
there is an alternative characterization of the set of chaotic orbits of entire 
functions which makes computation of the Julia set quite easy. Later in this paper 
we will show that the Julia set of Ae” may also be defined as the closure of the set 
of points whose orbits tend to © in the right half plane. While at this stage there 
appears to be nothing chaotic, about an orbit tending to , we will see later that the 
essential singularity at © for Ae* introduces a tremendous amount of chaotic 
behavior into the dynamics nearby. 

Using this alternative characterization of the Julia set, the algorithm to compute 
pictures is quite simple: 


1. Select a square in the plane and construct a k X k grid in this square. 

2. For each grid point, compute the orbit of this point up to a maximum of N 
iterations. 

3. If, at iteration i < N, the real part of the orbit is larger than some bound, B, 
stop iterating and color the original grid point black. 

4. If no point in the orbit ever enters Re z > B, leave the original point white. 
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In the output generated by this algorithm, we think of the black points as 
constituting the Julia set and the white points as lying in the complement of the 
Julia set. This is not quite accurate, as we discuss below, but serves as a good 
guideline for experimental observations. 

In FiGuRE 1, we have displayed the Julia sets for several exponentials computed 
using this algorithm. In each case we have used the bound B = 50. One might 
object that escaping past Re z = 50 hardly constitutes escaping to ». However, if 
Re z > 50, then |Ae7| > |Ale°? which is quite large. Moreover, as we will see 
below, the pictures we obtain with this bound are quite accurate. 

FiGureE la displays the Julia set of Ae’ with A = .3 < 1/e. Note that there is a 
large white area in this figure. The points in this area do not lie in the Julia set. 
This figure was generated using N = 25, but the reader may check that the same 
figure 1s obtained using any larger N. 

FiGuRE 1b displays the Julia set of Ae’ with A = .4 > 1/e. Note the dramatic 
change in the size of the Julia set for this value of A! This is the explosion in the 
exponential family. 

FiGURE 1b was computed using N = 25. If we increase N to 50, we obtain 
FiGuRE lc. Note that the black region has apparently increased in size. Actually, as 
we discuss below, there should be no white whatsoever in these pictures: the Julia 
set of Ae* when A > 1/e is the entire complex plane. The reason we see white 
points when using the above algorithm is that it takes some points longer than 25 
or 50 iterations to escape. 


Fic. la. The Julia set of 3e2% 


ic. 1b. Julia set of .4e%. Fic. lc. Fig. 1b. recomputed with N = SO. 
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Fic. 2a. The Julia set of .66i cos z. Fic. 2b. The Julia set of .687 cos z. 


Explosions occur in other families of transcendental functions too. For example, 
the Julia sets of the complex sine or cosine functions may be computed using the 
above algorithm with one slight modification. For sine and cosine the Julia set may 
also be defined as the closure of the points whose orbits tend to infinity with the 
imaginary part tending to +. Hence we color a point black if its orbit enters the 
region |Im z| > 50. Figure 2 shows the result of applying this algorithm to 
.66i cos(z) and .68icos(z). Figure 3 displays the Julia set of sin(z) and 
(1 + .1i)sin(z). In both cases we note that the Julia sets undergo dramatic 
changes. These are the explosions which we describe in this paper. 


“ake 


Fic. 3a. The Julia set of sin z. Fic. 3b. The Julia set of (1 + .12)sin z. 


2. Preliminaries. Before looking at the specific case of Ae’, let us first consider 
some basic definitions from dynamical system theory. First, a fixed point for a 
function F, as the name implies, is a point z such that F(z) = z. The point z is 
periodic if there exists an integer n such that F”"(z) = z. We define the smallest 
positive such n as the period of z and note immediately that any fixed point, Zz, is 
periodic with period one, and, in fact, F”(z) = z for all n. 
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Of particular interest to us as we consider Julia sets will be the behavior of 
orbits nearby certain periodic points. We first distinguish several different types of 
periodic points. First, if z,) is a periodic point of period n, and 0 < ((F")'(z))| < 1, 
then z,) is said to be an attracting periodic point. In the special case when 
(F")'(z)| = 0, we call z) a super attracting periodic point. If z, is a periodic point 
of period n and \((F")'(z,)| > 1, then z, is defined to be a repelling periodic point. 
The intermediate case, (F”")(z,)| = 1, gives indifferent periodic points. As we 
would expect, points nearby an attracting fixed point tend, upon iteration, towards 
that point. In the case of a repeller, points nearby the fixed point eventually escape 
from that point upon iteration. 

More specifically, we state 


PROPOSITION 2.1. If Z, is an attracting fixed point, then there is an open disk U 
about z, such that if z is an element of U, then lim,,_,.(F"\(z) = Zo. 


Proof. Since Z, is an attracting fixed point, there exists 6 > 0 and w < 1 such 
that if |z — z,)| < 6, then |F'(z)| < w. Hence it follows that if z,, z, © B;(z,), the 
ball of radius 6 about z,, then 


|F(z,) — F(z2)| < mlz, — 221 < lz) - 2a. 
Therefore, F(B3(z))) C B,3(zy). By the Contraction Mapping Principle, F is a 


contraction on B;(z,), and so all points in the ball tend to the fixed point z, under 
iteration. 


In the above proof, the set D = B;(z,.) — F(B;(z,) is called a fundamental 
domain. This means that, if z € B;(z)), z # Z,, then there is a unique w € D and 
n > 0 such that F”(w) = z. This means that each forward orbit in B;(z,) which is 
not the fixed point passes through D exactly once. 

To see why this is true, we note that, as above, 


F"(Bs(Z)) Cc Bns(Zo)- 


Hence there is a smallest integer j >0 such that z < F/(B,(z,)) but z¢ 
F!*'(B,(z,)). Then we may let w € D be such that F/(w) =z and n =j. See 
FIGURE 4. 


F(B5(Z9)) 


Fic. 4. Bs(zo) — FCBs(z,)) is a fundamental domain. 


We call the set of all points whose orbits tend to a given fixed point the basin of 
attraction of that point. Clearly, the basin of attraction is an open set. 

In the case of a repelling fixed point, nearby points eventually escape from the 
repeller upon iteration. In contrast to the previous proposition, we state 


PROPOSITION 2.2. If Z, is a repelling fixed point, then there exists a disk U about 
Zy such that if z € U(z # 29), then there exists k > 0 such that F*(z) € U. 
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Proof. Since z, is a repelling fixed point, we have |F'(z,)| > 1. Hence, by the 
Inverse Function Theorem, there exists a neighborhood of z,) on which a branch of 
F~' exists. We compute 


\(FO')'(z9)| = 1/|F'(F7'(29))| = I/|F'(20) | < 1. 


We conclude that z, is an attracting fixed point for F~' and hence, by our 
previous proposition, there exists 5 > 0 such that if z © B,(z,) then F-'(z) € 
B,(z,) and (F7')"(z) > Zp. 

Now assume that for any neighborhood U of Zp, there exists z*, z* # z), which 
satisfies F*(z*) € U for all k => 0. We may assume that U = F~'(B,(z,)) for 6 
sufficiently small. As above, B,(z,) — F~'(B;(z,)) is a fundamental domain for 
F~'. Hence there exists w € B,(z)) — F-'(B;(z,)) and n > 0 such that (F7')"(w) 
= z*, But then F"(z*) = w & U. This contradiction establishes the result. q.e.d. 


In order to give a more precise definition of the Julia set, we now shift our 
attention to some important concepts from traditional complex analysis. Let F be 
a family of analytic functions defined on an open set U C C. For our purposes, 
will usually be the family of iterates of a given complex function. The family ¥ is 
said to be normal on U if every sequence in Y has a subsequence that either (1) 
converges uniformly on every compact subset of U, or (2) converges uniformly to 
on U. 

Of particular interest to us later on will be 


THEOREM 2.3 (Montel). Jf Uyeglf(U)] omits two or more points in the 
complex plane, then FY is a normal family on U. 


Equivalently, if Y is not normal on U, then 


U [f)] 


fEF 


omits at most one point in the complex plane. 

In dealing with entire, transcendental functions like Ae* we have access to a 
particularly powerful tool. Recall that such functions have an essential singularity 
at oo, Hence, they satisfy the hypothesis for the Great Picard Theorem. 


THEOREM 2.4. Suppose an analytic function F has an essential singularity at 
z =a. Then for any neighborhood U of a and for all z* © C (with at most one 
exception) there exist infinitely many z € U such that F(z) = z*. 


We are now ready to give a formal definition of the Julia set. Let us consider 
FY = {F"} where F” is the nth iterate of F. We define the Julia set of F, JCF), by 


J(F) = {z|{F”} is not normal on any neighborhood of z}. 


We make the observation here that the Julia set is completely invariant. That is, 
if a point is contained in J(F), then so are all of its images and all of its preimages. 
In addition, from this definition we may immediately conclude that all repelling 
periodic points are contained in J(F). In fact, we can show 


PROPOSITION 2.5. Closure{repelling periodic points} C JCF). 


Proof. Suppose z, belongs to the closure of the repelling periodic points. If U 
is an open neighborhood of z,, then there exists z* € U such that z* is a 
repelling periodic point. Without loss of generality we will assume that z* is a 
repelling fixed point. Hence, there exists a neighborhood V of z* such that for 
z €V (z #2z*) there exists k > 0 such that F*(z) € V. At the same time, we 
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know that F*(z*) = z* © V for all values of k. Hence, {F”} cannot converge 
uniformly on V. It follows that {F”} is not normal on U and, therefore, z,) € J(F). 


In contrast to the above proposition, the attracting fixed points and their basins 
of attraction are never contained in the Julia set. This follows easily from our 
previous work when we note that any point in the basin of attraction of a fixed 
point z, has about it a neighborhood U such that if z € U, then lim, _,,. F(z) = 
Z). Hence, F” converges uniformly to the constant function g(z) = z) on U. We 
conclude that F” is normal at z, and, therefore, z) ¢ J(F). Similar arguments 
give the same result for periodic points. 

We conclude this section by discussing some of the elementary dynamical 
properties of the exponential function E,(z) = Ae* with A > 0. As a dynamical 
system on the real line, this map has two quite distinct dynamical behaviors, 
depending on whether 0 < A < 1/e or A > 1/e. This can be seen immediately by 
looking at the graphs of E, (see Ficure 5). When A < 1/e, E, has a pair of fixed 
points, g, and p,, with 0 <q, < p,. Note that q, is an attracting fixed point since 
E‘(q,) < 1. On the other hand, p, is a repelling fixed point since E{(p,) > 1. If 
x >p,, then E(x) > x and it follows that Ey(x) > © as n > ~. If, however, 
X <p), xX #q,, then 


| E,(«) —4q,|<lx-4@,| (*) 


so E(x) > q, as n > ©, When A > 1/e, the dynamics of FE, are quite different. 
For all x € R, we have E(x) > x, so that E¥(x) > © as n > . Thus all points 
have orbits which tend to © when A > 1 /e. 


la 


0.0 ‘0.0 
Fic. 5. The graphs of Aexpz for A > 1/e and A < Ife. 


The change in the dynamics of FE, which occurs at A = 1/e is an example of a 
bifurcation. In this case it is a relatively simple bifurcation known as a saddle-node 
bifurcation. In this paper we will investigate the effects of this bifurcation in the 
complex plane. We will show that the global ramifications of this bifurcation are 
quite spectacular in C: the Julia set of FE, explodes as A increases through 1 /e. 

Before describing the Julia set of F,, we recall some of the basic mapping 
properties of the exponential function: 


1. E,(x + iy) = Ae*e'” = Ae*(cos y + isin y) (Euler’s formula.) 
2. Consequently, £, maps vertical lines x = c to circles of radius Ae° centered 
at the origin, and horizontal lines y = c to rays 0 = c emanating from the 
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origin. In particular, any rectangle with sides parallel to the axes and vertical 
length 277 is mapped onto an annular region surrounding the origin. 
3. E, 1s 27ri-periodic. 


3. The Tame Exponential. In this section we will describe the Julia set of EF, 
when A < 1/e. We will see that J(E,) is quite small in this case: most points have 
orbits which are stable when A < 1/e. Recall that when A < 1/e, E, has two fixed 
points on R: an attracting fixed point at q, and a repelling fixed point at p,. Since 
EQ) = Ae and E,(—log A) = 1, we have 


O<q, <1< —logaA <p,. ( *) 
Consider the half plane 
H = {z|Re z < —log A}. 
If z € H, then 
|EX(z)| =|£,(z)| = A exp(Re z) < 1. 


Hence £, contracts H into the unit disk, which, since —log A > 1, is completely 
contained in H. Thus, by the Contraction Mapping Theorem, all points in H tend 
under iteration to a fixed point, which must be q,. Let us denote the basin of 
attraction of gq, by W(q,). Thus, H c W(q,). Clearly, all points in W(q,) have 
stable orbits since they all tend asymptotically to the same point. As a conse- 
quence, we will be mainly interested in the complement of W(q,). As we shall see, 
this set is the Julia set, J(E,). 

Let us investigate the dynamics of E, in the complement of W(q,). Using (*), 
we may choose v such that 


1<E,(v) < -logA <v<p,. 


Let H, denote the half plane Re z < v. Arguing as before, E, maps H,, inside H 
and so H, is also contained in W(q,). Note that if Re z > v, then 


|EX(z)| =|E,(z)| > Ae’ > 1. 


This fact is important since it shows that FE, is an expanding map on the 
complement of H,,. 

Before discussing the properties of the complement of W(q,) we first show that, 
for A < 1/e, this set is quite small. To do this we need an important lemma that 
we will use several times in the sequel. (We note that this result holds for all A, not 
just A < 1/e.) Let B;(z,) denote the open ball of radius 6 centered at zp. 


Lemma 3.1 (Expansion Lemma). Suppose |E{(z)| > w for all z © B;(z,.) where 
5 <1. Then there is an open set U C B;(z,) such that Ey: U > B,;(E,(z,)) is a 
homeomorphism. 


Proof. Since 6 <7, E, is one-to-one on B,(z,). Hence we may define the 
inverse map L: E,(B,(z,))) > B;(z,). From the chain rule it follows that |L’(z)| < 
1/p for all z © E,(B;(z,)). Thus, if |z) — z,| = 6, we have 


5 =|z) — z,;|)=|L(E,\(20)) — L(E,(21))| 


lA 


1 
—|E,( Zo) — E,(2,)| 
iv 


It follows that E, maps |z, — z,| = 6 to a curve which contains B,;(E,(z,)) in its 
interior. By the Maximum Principle, E,(B;(zy)) > B,s(E,(z)). 
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Thus we see that, as long as the orbit of a point remains in the complement of 
H,, where |E{(z)| = |E,(z)| = v > 1, successive iterations of E, tend to expand 
neighborhoods of the original point. We may apply this idea repeatedly to prove 
the following result. 


THEOREM 3.2. Suppose Z, lies in the complement of W(q,) and suppose U is an 
open set containing z,. Then U 1 W(q,) # ©. 


Proof. Assume that EY(U) M W(q,) = © for all n. It follows that EX(U) A H, 
= © for all n. Hence we have 


|E{(z)| >> 1 


for all z © U, where » = Ae”. Choose 6 so that B;(z,) C U. Then the Expansion 
Lemma implies that E,(B;(z,)) > B,s(E)(z,)). Now B,5(E,(z,)) does not meet 
H,, for otherwise we have a contradiction. It follows that we may apply the 
Expansion Lemma again to this disk. Continuing in this fashion, and using the fact 
that EX(z,)) € H,, it follows that we may find a disk of radius w”6 about EY(z,) 
which does not meet H,,. If we choose n large enough so that "6 > 277, then this 
disk must meet one of the horizontal lines of the form y = (2k + 1)z, where 
k © Z. But these lines are mapped by EF, onto the negative real axis, which lies in 
H,,. This contradiction establishes the result. q.e.d. 


CorOLLARY 3.3. W(q,) is open and dense in C. 


As a consequence, the complement of W(q,) is a nowhere dense subset of the 
right half plane. We use this fact to prove: 


PROPOSITION 3.4. The complement of W(q,) is precisely JCE,). 


Proof. We show first that the complement of W(q,) CJCE,). Suppose Z, 
belongs to the complement of W(q,). Since the complement of W(q,) is nowhere 
dense we know that for each neighborhood U of z, there exists z © U such that 
z € W(q,). Now, with n sufficiently large, we can make |E%(z) — q,| as small as 
we wish. In particular, we know that there exists N > 0 such that E7'(z) lies to the 
left of —log A for all n > N. At the same time, notice that E{'(z,) must remain to 
the right of —log A for all n. Therefore, the family of functions {£7} is not normal 
at Z,). We conclude that the complement of W(q,) is contained in J(E,). 

For the opposite direction, let z, be in J(E,). By definition, {Ey} is not normal 
on any neighborhood of z,. Let us suppose z, € W(q,). Then there exists a 
neighborhood, U, of z, such that EX(U) > q, since W(q,) is open. Hence, {£7} 
converges to the constant function g(z) = q, on U. Thus, {E{} is normal on U. 
This contradiction shows that z, is contained in the complement of W(q),). 


Note that, as a result of this proposition, all points which escape to «© under 
iteration of E, necessarily lie in J(E,). There are other points, like p,, which 
do not escape, but which nevertheless lie in the Julia set. From the graph of 
E, (Fic. 5), we see that in the case of p, itself, there are points arbitrarily nearby 
which do escape. This is a general phenomenon: the Julia set may also be 
described as the closure of the set of points which escape to © under iteration of 
E,. We stress that this is not the case for all functions. In particular, for 
polynomials, the escaping points are never in the Julia set since © acts as an 
attracting fixed point in C U {o}. For entire functions, however, the essential 
singularity at © makes the situation quite different: Picard’s Theorem implies that 
neighborhoods of % are mapped over all of C — {0} by E,. This suggests that 
escaping points are in the Julia set in this case. 
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Let Esc(A) be the closure of the set of points which escape to © under iteration 
of &,. We now prove 


THEOREM 3.5. If A < 1/e, then J(E,) = Esc(A). 


Proof. Clearly, if z) € Esc(A), then z, € W(q,) so z) € J(E,). Hence, Esc(A) 
Cc J(E,). For the other direction, suppose z, € J(E,). If U is any neighborhood of 
Z) we must produce a z € U such that |E"(z)| > ~., Since {E;"} is not normal on 
any neighborhood of z,, Montel’s Theorem tells us that U ,[ Ey(U)] omits at most 
one point in C. In particular, there is a point z* © R with z* > p, such that 
z*e U,[E"(U)]. Thus, there exists z € U such that E/(z) = z* for some j. 
Consequently, z € Esc A. Hence z, € Esc(A) and J(E,) ¢ Esc(A). 


We now describe the geometry of the Julia set of E, when A < 1/e. Certainly, 
J(E,) contains the interval [ p,,~), since p, is a repelling fixed point and all other 
points in the interval tend to © under iteration. By 27ri-periodicity and invariance 
of J(E,), the Julia set also contains the 27ri translates of this interval as well as all 
preimages of these lines under £;’. These are by no means all of the points in 
J(E,); this set contains what we call a “Cantor bouquet.” We will sketch the 
geometric construction of this set. For more details, we refer the reader to [DT]. 

Let C, denote the half-plane Re z > v. Cy is the complement of H,. Let us 
remove from C, all points which are mapped into H, by E£,. The regions that 
remain are therefore defined by applying various branches of the inverse transfor- 
mation of E, given by 

Ly (2) = log,(z/A) 

to Cy. Here log, is the branch of the logarithm function defined on C — {x € R| 
x < 0} and taking values in the strip Qk — 1)a <Imz < (2k + 1)a. Each L, , 
maps C,-onto a region shaped like a “tongue” as depicted in FiGure 6. Let us 
denote by 7, the image of Cy under L, ,. Let C; = UT,. C,, therefore, consists 
of all points in C) which remain in C, for at least one iteration of E,. We think of 
C, as being obtained from C, by the removal of an open set which consists of 
“fingers” extending to the right and separating the T,’s. Notice that these fingers 
are mapped by E, into H, c W(q,). 


Fic. 6. The tongues in Cp. 
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We may obtain J(E,) by iterating this procedure. Apply each L, , to C). The 
result is an infinite collection of tongues which lie in 7,. Equivalently, E, expands 
T,, over Cy in a one-to-one fashion and so there are regions in T, which are 
mapped onto each 7;. Let us denote the image of C, under all of the L, ,’s by C. 
We again think of C, as being obtained from C, by the removal of a collection of 
fingers in C, which extend to the right. See Figure 7. C, consists of all points in 
Cy) whose first two E,-images lie in C5. 


CY SE ASSES 


: TQ 


x SASS 


AL.’ ISI FSS 
KR, RWW AN 


Fic. 7. The construction of C5. 


Continuing in this fashion, we denote by C, the images of C,_, under all of the 
L, ,. The reader will undoubtedly recognize this construction as similar to that of 
the construction of the Cantor Middle-Thirds set as encountered in elementary 
analysis. Instead of removing open intervals from the middle of closed intervals, we 
are removing open sets which separate C,,_, into an infinite collection of tongues 
which we denote by C,. Clearly, the C,, are nested closed sets. Note that 


(E,) = (Cy. 
n=0 


This follows since any point which does not lie in some C, is mapped into H, 
under Ey’. 

It can be shown [DK] that this intersection consists of a collection of curves in 
C. Each curve, like the interval | pA, ©), is homeomorphic to the non-negative real 
axis. The collection of these curves forms a perfect set, and hence our terminology 
Cantor bouquet. 


Remark. It is not true that all of the nested sequences of tongues described 
above actually yield a non-empty intersection; some of the tongues nest down to 
as n—o, The exact situation is described in [DK]. For our purposes, the 
important thing is the dramatic difference between the structure of this set and its 
counterpart for A > 1/e described in the next section. 


4. The Chaotic Exponential. In this section we will complete the proofs that 
J(E,) is the closure of the set of repelling periodic points as well as the closure of 
the set of escaping points when A > 1/e. Thus far we have shown that 


J(E,) = {zl{E%} is not normal at z} > {repelling periodic points}. 


In addition, for A <1/e, we have seen that J(E,) = Esc(A). As before, the 
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essential singularity at % suggests that we can adopt this third characterization of 
the Julia set when A > 1/e as well. 

Recall that the exponential is an expanding map in the half plane Re z > —log A 
in the sense that |E{(z)| = A exp(Re z) > 1. Geometrically, this means that E, 
expands areas in this region. This is the basic fact which we exploit throughout this 
section. 

If Re(E,(z,)) > log(3/A), then |Es(z,)| > 3. It follows from the Expansion 
Lemma that FE, expands small neighborhoods of z, by a factor of 3. We may apply 
this idea repeatedly to prove: 


PROPOSITION 4.1. Suppose 
Re( E/(z,)) > log(3/A) + 37 


for j = 0,1,2,.... Let U be any open set containing z, and let S,,(E\(z,)) be the 
square with center EX'(z,) and side of length 27. Then there exists N > 0 such that, 
if n > N, there exists an open set U, CU for which Ey: U, > S,,(EX(z))) is a 
homeomorphism. 


Proof. Since Re(E/(z,)) > 3a + log(3/A), it follows that, if z © B,_(Ej(z))), 
then 


| Ex(z)| = Aexp(Re E,(z)) > 3. 


Now suppose B,(z,) C U. Choose N so that 3%~'5 < @ but 3%6 > a. Apply the 
Expansion Lemma repeatedly to find an open set W,, C B,(z,) that is mapped 
homeomorphically onto By E,(z,)) by EW. Now Bz EX(z,)) > BCE (zy). 
Again apply the Expansion Lemma. We find, for each k > 0, an open set 
V, © B_CE(z,)) such that EX: V, > B;,CE.Y**(z,)) is a homeomorphism. 

Since ‘B,,(EN**(z))) D S>,( EY **(z,)), we may restrict EX to V’ C V, so that 
EX: Vi > S,,(E**(z,)) is also a homeomorphism. Then, if n =N +k, set 
U,=E,‘(V,). It follows that E”: U, > S,_CE""(z,)) is a homeomorphism. _ q.e.d. 


Now suppose z, is an escaping point. Since |Ey'(z))| > ~%, it follows that 
Re Ey'(z,) > © as well. Thus all of the escaping orbits tend to © in the “same” 
direction—to the right. 


LemMaA 4.2. Let zy) € Esc(A) and suppose U is an open set containing z,. Then 
there exists z © U and n = 0 such that EX(z) € R. 


Proof. Since zy) escapes, we may assume that Re Ej(z,) > log(3/A) + 3a for 
all j > 0. Hence the previous Proposition applies and there exists an open set 
U, <U such that EX: U, > S,,CE;'(z))) is a homeomorphism. But S,,CE%(z9)) 
must meet some horizontal line of the form y = ka, k € Z. Then E”?*'(U,) AR 
+ d. 

Now we can prove 

THEOREM 4.3. If A > 1/e, then JCE,) = Esc(A). 


Proof. The proof that JCE,) C Esc(A) is exactly the same as in the case 
A < 1/e. Hence we only prove the converse. Let z) € Esc(A). If U is an open set 
containing z,, then we must show {E;’} is not normal on U. By the previous lemma 
and the fact that all real points escape when A > 1/e, we may assume that z) © R 
and z, > log(3/A) + 37. 

For simplicity, we write z, = Ey(z,). Note that z,,, > z,. By Proposition 4.1 
we may choose n > 0 so that E\(U) > S,_(z,). The mapping properties of E, tell 
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us that E,(S,,(z,)) is the annulus given by 


€ "Zn41 <|z| < eZ, 4}. 


Since e~” < 1/10 and e” > 10, it follows that the annulus E,(S,_(z,)) contains 
S,(z,,). Continuing in this fashion, we find that EX(S,_(z,)) contains the 
annulus 


OC "Znap SAZIC Zp ap: 


We denote this annulus by A,; since z,,, 2 © as k — o, it follows that the A, 
grow very large with k. Thus, as we let & increase we can produce points in U 
which get mapped near the left outer radius of A, as well as those that get 
mapped near the right outer radius. These points, in turn, get mapped “very close 
to” 0 and “very close to” %. We conclude that {£7} is not normal on U.  q.e.d. 


With the above proof we have two equivalent definitions of the Julia set of EF), 
A > 0. We now add a third. Recall that in our preliminary section we showed that 
the closure of the set of repelling periodic points was contained in J(F’) for any 
complex function F. We now show inclusion the other way for our function E,. 
We first state a variation of the Schwarz Lemma for use in our proof. 


Lemma 4.4 (Schwarz). Let U be open, bounded and simply connected in C. If 
F: C > C is analytic and if F(U) C U, then there exists a fixed point z, € U, and 
|F'(zo)| < 1. 


We are now ready to prove 
PRopositION 4.5. JCE,) © closure{repelling periodic points}. 


Proof. Let z) € JCE,) and suppose W is an arbitrary neighborhood of z,. We 
will show that there is a repelling periodic point in W. Since p, is a repelling fixed 
point for E,, the Inverse Function Theorem implies that there exists a neighbor- 
hood U, of p, on which E,' exists. In fact, we can find U, such that U, = 
E,'(U,) C U,. Continued iteration produces a nested sequence of neighborhoods 
of p,, U. DU, > U,D °:::. 

Montel’s Theorem implies that since {E,} is not normal at z,, then U,[EX*(W)] 
omits at most one point in C. Since U, [E;(W)] already omits zero (E¥(z) # 0 for 
all z € C), then there exists z, © W and m > 0 such that E\"(z,) = p,. Now since 
p, is a repelling fixed point, {£%"} is not normal on any neighborhood of p, and, by 
the same reasoning as above, there exists z, € U) and n > Osuch that E¥(z,) = z,. 
Therefore, E\"""(z,) = py. 

We know E/(z) = E,(z) # 0 for all z © C. Combining this fact with use of the 
chain rule, we conclude (£{")'(z\) # 0 and (EY)'(z,) # 0 for any m,n € Z. We 
apply the Inverse Function Theorem again to claim that there exists a neighbor- 
hood V of z, which is mapped in a one-to-one fashion onto a neighborhood of z, 
in W. Hence, EX(V) Cc W. Since (E3")(z,) # 0, we may use similar arguments 
(choosing V smaller if necessary) to show that E(%"*" maps V diffeomorphically 
onto U,; for some j. Further, Ey*""(V) = Uy since U, = (E,')(U,). Now 
Ex*+™*ry — U, is one-to-one and onto with local inverse (E7”*"*/)~'!. Using the 
Schwarz Lemma we conclude (E™*"*/)~' has an attracting fixed point in V. See 
Ficure 8. Equivalently, E"*”"*! has a repelling fixed point in V which implies E, 
has a repelling periodic point, z*, in V. But, EY maps V diffeomorphically onto a 
neighborhood of z, © W. Therefore, part of the orbit of z* passes through W. 
Now, since any point in a repelling periodic orbit is itself a repelling periodic point, 
we conclude that there is a repelling periodic point in W. 
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Fic. 8 


We comment here that although our proof specifically deals with E,(z), this 
argument may be adapted to show that J(F’) is the closure of the set of repelling 
periodic points for any function with at least one repelling periodic orbit. 

What does the Julia set of E, look like when A > 1/e? Recall from section 2 
that when A > 1/e, the function E,(x), x © R, has no fixed points. In fact, 
EX(x) — © for all values of x. Hence, in immediate contrast to the A > 1/e case, 
we have R CJ(E,). We can show that there are many other points in J(E,) as 
well. 

Let be a vertical line segment of length 27 in the left half plane (.e., far 
removed from the Cantor bouquet described in the previous section). Recall that 
E, maps this segment to a circle about the origin. This circle necessarily crosses 
the real axis twice. Hence, there are two points which now, upon further applica- 
tion of E,, tend to ». If we call these two crossing points x, and x,, we see that 
there must be two points, z, and z,, in - such that E,(z,) =x, and E,(z,) =x,. 
We now have Ex(z,) > © and Ex(z,) > © as n > «©, We continue choosing 
arbitrary line segments in this manner to produce pairs of points in the Julia set 
throughout the complex plane. In fact, we see that all horizontal lines of the form 
Im z = n7 where n © Z are contained in J(E,). 

Notice that we cannot use this argument when A < 1/e since in that case both 
z, and z, would be attracted to the fixed point q, instead of escaping to ». This 
roadblock is lifted precisely when A values increase through 1/e—hence, the 
explosion. 

With the above reasoning, we have shown that J(E,) for A > 1/e admits many 
more points than in the A < 1/e case. In fact, for these higher A values, the Julia 
set is the whole plane! \ 

Fatou first conjectured this in the early 1900’s and in 1981 Misiurewicz an- 
swered it by proving that the Julia set of E(z) = e? is the entire complex plane 
[M]. Recently, Sullivan [S] proved the No Wandering Domains Theorem which can 
be used to show J(E,) = C for all A > 1/e. He showed that we need only look at 
the behavior of certain “singular” orbits of a function to learn about its Julia set. 

There are two distinct types of singular points for a complex analytic function. 
One type is a critical point—as in calculus, the point Zz, is a critical point if 
F'(z,) = 0. Its image F(z,) is called a critical value. The other type of singular 
point is an asymptotic value. A point is called an asymptotic value if there exists a 
curve y(t) which tends to ~ as t — ft, and which satisfies 


lim F(y(t)) = Zp. 
tly 
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Clearly, 0 is an asymptotic value for E, (take any curve tending to with real part 
tending to — ©). 
A version of Sullivan’s Theorem states: 


THEOREM 4.6. Let F be an entire function with only finitely many critical and 
asymptotic values. Suppose the orbits of all critical points and asymptotic values of F 
tend to ©. Then JCF) = C. 


CorRo.LuarRy 4.7. If A > 1/e, JCE,) = C. 


Let us briefly summarize the explosion in J(E,). When A < 1/e, E, admits two 
fixed points, gq, and p,. The repelling fixed point p, has a “hair” attached: the line 
segment [ p,,%) lies in the Julia set. In this case, the singular value tends to q, 
under iteration and thus is blocked from escaping to ©. 

As A approaches 1 /e, the two fixed points coalesce at g, = p, = 1, with the line 
[1, ©) still lying in J(E,). 0 still does not reach this curve under iteration. 

When A > 1/e, two things occur. First, g, and p, disappear from the real line. 
Actually, one may check that the two fixed points appear as complex conjugate 
fixed points in the exterior of the unit disk. Hence these points have complex 
derivatives with modulus > 1, so nearby points tend to spiral away from them. In 
particular, this separation now allows 0 to hop onto the hair that formerly was 
attached to p,, and so E"(0) — . See Ficure 9. 


Fic. 9. E\(0) — © when A > 1/e. 


Although E, no longer has fixed points on the real line, there are two repelling 
fixed points in the complex plane. Hence, we may apply Theorem 4.5 to conclude 
that J(E,) is the closure of the set of repelling periodic points for A > 1/e as well 
as A < 1/e. 


5. Other Chaotic Bursts. Our work in the preceding sections suggests that 
similar explosions occur if we vary parameters in other transcendental functions. It 
turns out that this is indeed the case. For example, let us consider the function 
Cz) = iA cos(z) where A € R. We recall from elementary complex analysis that 


iA 
C,(z) = z fe" +e '*), 
If we initially restrict ourselves to the imaginary axis, we have 


iA 
C,(iy) = z fe? +e”) = iAcosh(y). 
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Hence C,(z) preserves the imaginary axis just as E,(z) preserved the real axis. 

As in the case of E,, this family of maps admits a saddle-node bifurcation when 
A =v = .66274... . See Figure 10. If we consider the case when A < v, we again 
have an attractor, q,, and a repeller, p, which now appear on the imaginary axis. 
Through analysis similar to that done previously we conclude that [ p,, 0) Cc J(C,). 
We can continue as before and construct a Cantor bouquet centered about this 
hair. In addition, due to the fact that C, is even and 277-periodic, we see that the 
Julia set consists of two rows of Cantor bouquet “fingers” which extend towards 
each other from the upper and lower half planes. The tips of these fingers can be 
seen in FIGURE 2a. We note that as long as A < pv, there will always be a strip 
containing the real axis which is contained in the basin of attraction of q, and that 
J(C,) will be a closed, nowhere dense subset of the regions above and below this 
strip. 


Fic. 10. The graphs of A cosh(y) for A < v and A > v. 


Now suppose A > v. We see from FiGure 9 that all points on the imaginary axis 
or its 27 translates now escape to © upon iteration of C,. Hence, each of these 
vertical lines is contained in J(C,). Further, we notice that the real axis is mapped 
onto the imaginary axis making it a part of the Julia set as well. Consequently, both 
critical values of C,, +iA, escape and Sullivan’s Theorem guarantees that J(C,) = 
C in this case. Thus we have 


THEOREM 5.1. Let v = .66274... denote the A-value for which iA cos z admits a 
saddle-node bifurcation. If 0 <A <v, then J(C,) is nowhere dense. If A > v, 
WC) =C€. 
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LETTERS TO THE EDITOR 


To the Editor: 


I came out of reading the article A New Derivation of Stirling’s Approximation to n! 
in the November, 1990, MONTHLY with the definite feeling that the derivation is 
not new. I’ve recently delivered classroom lectures on the derivation of the 
asymptotic development of I'(z), following Copson’s Asymptotic Expansions, §25. 
The Marsaglias follow almost the same path, only using the integer m and n! where 
Copson uses the complex variable p and I'(p). Their equation (3) for determining 
the coefficients a, is essentially that presented by Copson. While their ultimate 
expression of n! as a “series” is not in Asymptotic Expansions, it can be viewed in 
Copson’s Theory of Functions at the end of essentially the same derivation. 

What they have left out is any reason why their calculation should have anything 
more than a formal significance. They don’t prove that the expansion is asymptotic, 
although they do mention that property, but it is the asymptoticity and the 
possibility of obtaining error estimates that is the goal of teaching about Stirling’s 
formula in this century. After all, 


eT ~ 0 + Ox + Ox? + Ox + +++ +0" + 0( x") 


as x > 0+, but we'll not get any idea of the value of e~!/** by evaluating this 


expansion for any x > 0, no matter how large an n we choose. Even in a Calculus 
II course, where Watson’s Lemma would not be at home, quotations of error 
bounds and illustrations of their use are appropriate and desirable. 

To sum up: the authors’ derivation is not new, and what they have derived is 
insufficient for honest numerical calculation. 


Sincerely yours, 


Nathaniel Grossman 
Professor of Mathematics 
UCLA 
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To the Editor: 


Some readers have asked for references pertaining to my paper “A birthday 
present for Ramanujan,” this Monthly 97 (1990) 398-400. The most accessible 
single reference is Ramanujan Revisited, Proceedings of the Centenary Confer- 
ence, University of Illinois, Academic Press, 1988. The chapters by Dyson and by 
Garvan make wonderful reading and contain additional references. 


Michael D. Hirschhorn 
University of New South Wales 
New South Wales, Australia 


To the Editor: 


In my article on the Teaching of Mathematics entitled “The /, matrix norm and 
the spectral radius” (this MONTHLY, April, 1990) there is an error which has been 
pointed out to me by Prof. A. K. Yeylos of Ioannina, Greece. The statement of 
part 1 of Theorem 1 must be changed to read as follows: 

1. max (x‘Ax) = 4 maximum eigenvalue of A' + A. 

\|x|]=1 

As a result, the application of this theorem in the proof of Corollary 1 must be 

changed also so that the second line of the proof is replaced by: 


+ maximum eigenvalue of | A‘A + (A‘A)'| 


= + maximum eigenvalue of [2.A'A] = p( A‘A), 
where the final equality results from the fact that A'A has no negative eigenvalues. 


There is also a typographical error in the proof of Corollary 2: ||x|| was printed 
instead of ||x|] = 1, as in the statement of Theorem 1. 
Sincerely, 


Robert A. Leslie 
Agnes Scott College 


To the Editor: 


I wish to add a footnote to Paul Halmos’ charming review article on Progress in 
Mathematics, a panorama of pure and applied mathematics in the last 75 years. In 
his brief report on the Bieberbach conjecture, Halmos credits Loéwner with having 
proved the conjecture for n = 3. Lowner has done much more than that; he has 
shown that every schlicht function can be embedded in a one parameter family and 
has given an explicit description of all these families. Bieberbach, the editor of the 
Zeitschrift where Lowner’s article appeared, was so enthusiastic about Lowner’s 
idea and so convinced that it would lead to a solution in the general case 
that, without consulting the author, marked the paper I, the first of a series. 
Bierberbach was over-optimistic; it took more than 60 years before the conjecture 
was settled, but LOwner’s representation and point of view played a decisive part 
in de Branges’s proof. 


With best regards, 


Peter D. Lax 
New York University 
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EpItED BY DENNIS DETuRCK, DAvip J. HALLENBECK, AND RODICA SIMON 


Algebraic Characterization of Compact Abelian Groups 


M. V. DESHPANDE 
Department of Mathematics, Indian Institute of Technology, Bombay-400076, India 


SUDHIR R. GHORPADE 
Department of Mathematics, Purdue University, West Lafayette, IN 47907 


The well-known Banach-Stone theorem gives a nice algebraic characterization 
of compact Hausdorff topological spaces. One might wonder if the topological 
spaces are also endowed with a (compatible) group structure whether one can 
obtain a similar characterization for them to be the “‘same’”’, i.e., to be topologically 
isomorphic. The purpose of this note is to give one such characterization which 
doesn’t seem to have made its way into the literature yet. We begin by stating the 
result of Banach [1, p. 170] and Stone [7, p. 469] in a somewhat different form than 
the traditional version. 


THEOREM 1. Let X and Y be compact Hausdorff topological spaces. Then the 
following statements are equivalent. 


(1) X and Y are homeomorphic. 
(2) There exists an algebra isomorphism of C(Y ) onto C(X). 


Here CCX) (resp: C(Y)) denotes the space of complex valued continuous 
functions on X(resp: Y ). These spaces clearly form an algebra (i.e., a vector space 
which is also a ring such that the scalar multiplication behaves nicely with the ring 
multiplication) over C. In the traditional version the condition (2) requires a linear 
isometric isomorphism instead of an algebra isomorphism and proofs of this 
version could be found in [2, p. 442] or [5, p. 153]. Our proof of the result for 
topological groups would include a proof of Theorem 1 that is essentially as in [4] 
but does not use the language of Banach algebras and instead depends only on the 
following elementary fact, the proof of which is left as an exercise. 


Lemma A. If X is any compact topological space and M is any maximal ideal of 
the ring C(X), then there exists a unique u € X such that M = {f © CCX): flu) = O}. 


We now recall the notion of ‘a character on a topological group G which 
by definition is a continuous homeomorphism from G into the “circle group” T° = 
{z © C: |z| = 1}. The characters of G form a subgroup of C(G) which is usually 
denoted by G. We will use the following basic 


THEOREM 2. Characters of a locally compact (Hausdorff) abelian group G 
separate the points of G, 1.e., 


x€G,y€G,x #y = there exists xy € G such that x(x) # x(y). 


For a proof of this theorem we refer to ((4], p. 120). Let us also remark that if G 
is a compact group then G is discrete. Without further ado let us state and prove 
our main 
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THEOREM 3. Let G and H be compact (Hausdorff) abelian groups. Then the 
following statements are equivalent. 


(1) G and H are topologically isomorphic. 


(2) There exists a character preserving algebra isomorphism of C (H) onto C(G) 
(i.e., H is mapped into G). 


Proof. If h: G > H is a topological isomorphism (resp: homeomorphism) then 
it is easy to see that the map T: C(H) — C(G) defined by T( gx) = g(h(x)) for 
g © C(A), and x €G gives a character preserving algebra isomorphism (resp: 
algebra isomorphism). Conversely if T: CCH) — C(G) is any algebra isomorphism 
then we will show that T is given by g~>geh for some homeomorphism h: 
G — H which is a topological isomorphism if 7 is character preserving. Given any 
x €G let M, denote the ideal {g € C(H): T(g)(x) = 0}. Now M, is clearly a 
maximal ideal being the kernel of the homomorphism g — 7T(g)(x) whose image 
is the field C. Hence by Lemma A there exists a unique y = A(x) € H such that 
M, ={g © C(H): g(y) = O}. In fact T( gx) = g(y) = g(A(x)) for every g € 
CH ) since g — g(y)1,, is always in M,. Similarly, given any y © A the ideal 
M,={f © C(G): T~'(f Xy) = O}, is a maximal ideal and we can find a unique 


x=h(ye G such that M,= {f < C(G): f(x) = O}. It is easy to verify that the 
maps / and fh defined this way are inverses of each other, 1e., hoh = id,,, and 
hoh = id. Thus h is bijective. Now given any x, © G, by Urysohn’s lemma, there 
exists a neighbourhood V of h(x,) and some g € C(H#) such that g(hA(x,)) = 1 
and g(u) = 0 for all u © H\ V (note that H is normal being compact and 
Hausdorff). Since the set U = {x € G: T(g\Xx) # 0} is an open subset of G such 
that x, € Uand A(U) CY, it follows that / is continuous. Now both G and H are 
compact Hausdorff, and h: G — H is a continuous bijection, hence we conclude 
that 4 is a homeomorphism. This proves Theorem 1. Now assume that 7 preserves 
characters. Let x, € G,x, © G be given. We have to show that h(x, + x,) = 
h(x,) + h(x,), but this is quite easy—thanks to Theorem 2. So let y € H be any 
character. Then 


x(h(x, + x2)) = (veh)(x, + x2) = T(x) (x, + x2) 
and since T(y) is a character, the expression on the left equals 


T(X)(*1)T(X) (42) = x(h(x«1))x(h(x2)) = y(h(x,) + h(x>)). 
This proves that # is a homomorphism and hence a topological isomorphism. O 


REMARK. Note that an argument similar to the one used above together with 
Theorem 8 on page 442 of [2] would show that if G and H are as in Theorem 3 
and if T is a linear isometry of C(H) onto C(G) which preserves characters then 
G and dH are topologically isomorphic and in fact, there exists a topological 
isomorphism h: G — H and a character y: G — TI such that T(g) = y(g eh) for 
every g © C(#7). Conversely any such T is clearly a character preserving linear 
isometry of C(H) onto C(G). 


Example. Let us consider one of the simplest examples of a compact abelian 
group, namely the circle group [ = {z € C: |z| = 1}. So we are taking G = H =T. 
Now it is easy to see that the characters of I are precisely the functions z > z” 
for n € Z and the only topological isomorphisms among these are those functions 
for which n = +1. Hence by the above theorem it follows that the only algebra 
automorphisms of C(T) which preserve these countably many characters are the 
identity and the “conjugation” f(z) > f(Z). Also in view of the above remark we 
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see that the only character preserving linear isometries of C(I) onto itself are 
f(z) > z"f(z) and f(z) — z"f(Zz) where n varies over the set of all integers. 


Remark. If G and H are locally compact (Hausdorff) abelian groups then we 
can obtain a result similar to Theorem 3 with C(G) and C(H) replaced by C,(G) 
(viz., the algebra of continuous functions on G which “vanish at infinity”) and 
C,(H) respectively. 


REFERENCES 


1. S. Banach, Theorie des Opérations Linéaires, 2nd ed., Chelsea Pub. Co., New York, 1963. 
2. N. Dunford and J. T. Schwartz, Linear Operators, Part 1: General Theory, Interscience, New York, 
1958. 
3. S. R. Ghorpade, Compact Groups and Almost Periodic Functions, M.Sc. Dissertation, Indian 
Institute of Technology, Bombay, 1984. 
4, R. Larsen, Banach Algebras, Marcel Dekker Inc., New York, 1973. 
. J. Lindenstrauss and L. Tzafriri, Classical Banach Spaces, Springer-Verlag, Lecture Notes in 
Mathematics no. 338, New York, 1973. 
. L. Loomis, An Introduction to Abstract Harmonic Analysis, D. Van Nostrand, Princeton, N.J., 1953. 
. M. H. Stone, Applications of the theory of Boolean rings to general topology, Trans. Amer. Math. 
Society, 41 (1937). 


N 


TN 


First-Class Functions 


G. MYERSON 
Department of Mathematics, Macquarie University, New South Wales, Australia 2109 


In a recent issue of this MONTHLY, J. Fabrykowski [1] posed the problem of 
finding a sequence of functions continuous on [0, 1] whose pointwise limit is finite 
on the rationals and infinite on the irrationals. To analyze a proposed solution to 
the problem, he required several facts from number theory; in particular, facts 
about continued fractions and diophantine approximation. In fact, there are simple 
solutions to Fabrykowski’s problem which use no number theory; we give one 
below. Having removed all number theory from the answer, we proceed to banish 
it from the question, replacing the rationals with an arbitrary F, (that is, a 
countable union of closed sets). The main theorem is the following. 


THEOREM 1. There is a sequence of functions continuous on [0, 1] whose pointwise 
limit is finite on S and infinite on the complement of S if and only if S is an F,. 


I would like to thank my colleague, Béla Brindza, for insisting that we look for 
such a generalization, and for many fruitful discussions on the topic. 

Let us first take care of the rationals. Gelbaum and Olmsted, in Example 4 of 
Chapter 7 of [2], give a sequence of continuous functions converging pointwise to 
the function f given by 


f(x) = Oif x is irrational, 
f(p/a) = 1/q if p and q are relatively prime integers, q> 0. 


Taking reciprocals yields a sequence of continuous functions solving Fabrykowski’s 
problem. The construction in [2] makes no use of continued fractions, diophantine 
approximation, or other number-theoretical devices. The following construction is 
similar to that of [2], and arguably even a bit simpler: Enumerate the rationals in 
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(0,1) as r,,r5,..., and let f, be the piecewise linear function whose graph passes 
through the points (0,0), (7,,1),...,(7,, 2), and (1,0). If x is rational then x is 0 
or 1 or r, for some k, and lim, _,,, f,(x) is 0, 0, or k, respectively—in any event, 
lim, .. f,(x) is finite. If x is irrational, then for n sufficiently large the set 
{r,,1,---,¥,} contains elements less than x and elements greater than x (since 
the rationals are dense in [0, 1]). Let r,,,. be the greatest of the elements less than 
x, and r,,,, the least of the elements greater than x; then f(x) > min{/(n), u(n)}. 
Now /(n) and u(n) both go to infinity with n (again, because the rationals are 
dense in [0, 1]), so lim, ,.. f,(x) = @. 

We believe that the problem of the preceding paragraph was first posed and 
solved over fifteen years ago, but we have not been able to locate a reference. 

We now turn to the generalization. A function on an interval J is said to be of 
the first (Baire) class on J if it is the pointwise limit of a sequence of continuous 
functions on J. It is easy to see that there is a sequence of continuous functions 
whose pointwise limit is finite on a given subset S of [0,1] and infinite on the 
complement of S if and only if there is a function of the first class on [0, 1] whose 
zero-set is the complement of S. For suppose f,,n = 1,2,..., satisfies 
lim, +. f,(x) finite for x is S, infinite otherwise; then lim, _,,.1/(. + |f,|) is the 
desired function of the first class. Conversely, if f is of the first class, and S$ = 
{x: f(x) # O}, let f,,n = 1,2,..., be a sequence of continuous functions converg- 
ing pointwise to f; then the sequence 1/(|f,| + 1/n), n = 1,2,..., gets the job 
done. Thus Theorem 1 is equivalent to 


THEOREM 2. A necessary and sufficient condition for the existence of a function f 
of the first Baire class on [0,1] such that {x: f(x) # 0} = S is that S be an F, set. 


Proof (of the necessity part of Theorem 2). Let f be of the first class on [0, 1], 
and let f,, f,,... be a sequence of continuous functions converging pointwise to f. 
Let x be such that f(x) > 0; then f(x) > 1/t for some positive integer t, whence 
f, (x) > 1/2t for all n sufficiently large. Conversely, if there is a positive integer ft 
such that f(x) > 1/2¢ for all n sufficiently large, then f(x) > 0. So, 


(x: f(2) > 0} = UU 


which is a F... Similarly {x: f(x) < 0} is an F,, so {x: f(x) # 0} is an F,. 
The sufficiency part of Theorem 2 follows from a theorem attributed by van 
Rooij and Schikhof [3, Theorem 11.12] to Lebesgue: 


THEOREM 3. Let I be an interval and let f: I > R. Then f is of the first Baire class 
on I if and only if f~‘(U) is an F,, for every open U in R. 


A proof can be found in [3]; it is too long to reproduce here. 


Proof (of the sufficiency part of Theorem 2). Let S be an F,, set. We write 
S =A,UA,U... where the A, are closed sets; we may assume ¢ = Ay CA, C 
A,c::-, Define f by 


1/k ifx € A, —A,_,,k =1,2,..., 


1 
0) fxEésS. (1) 


Fx) = { 


Note that f-'(1/k) = A, 1 A,_, is the intersection of a closed set and an open 
set, hence both an F, and a G, (a G; set is a countable intersection of open sets). 
We are done if we can prove that f is of the first class. 
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Let U be open in R. If U does not contain 0, then f~'(U) is a countable 
(possibly finite, possibly empty) union of sets of the form f~'(1/k), each of which 
is an F,; hence, f~'(U) is an F,. 

If U does contain 0, then U contains all but finitely many of the numbers 
1/k,k =1,2,.... Thus f~'(U) is the complement of a finite union of sets of the 
form f-'(1/k). Each f~'(1/k) is a G;, so the finite union is a G,, so again 
f-'(W) is an F,. Thus, by Theorem 3, f is of the first class. 

We note that the necessity part of Theorem 2 follows immediately from 
applying the forward implication of Theorem 3 in the case U = R — {0}; in fact our 
proof of necessity is essentially the proof given in [3] for the forward implication of 
Theorem 3. 


Theorem 3 is quite elegant, but there are two drawbacks to using it here. One, if 
we rely on it, the proof of Theorem 2 is not self-contained. Two, it does not 
construct the sequence of functions converging to the first-class function in 
question. To overcome these drawbacks, we show how, given an arbitrary F, set S, 
we can construct the function f of Theorem 2 from scratch. 

As before, let S =A, UA, U-:: bean F,, with 6 #A, CA, C °::: . For x 
in [0,1] and A closed and non-empty we write d(x, A) for min, ,|x — y|, the 
distance from x to A. 

Now let 
— nd(x,A,,)], 


fin,n(*) = max 0, 


m,n positive integers. We note that f,,,, is continuous, bounded above by 
1/m(m + 1) and below by 0, and equal to 1/m(m + 1) on A,,. AS n > &, fi, 
approaches (1/m(m + 1))y,,, where 


1 ifx eA, 
0 else 


inl) = { 
is the characteristic function of A,,. 
Let fy = Um-ilm.n The series converges uniformly in x, so f, is continuous. 
Let g = lim, _,., f,. We will show that g is in fact the function f defined by (1), 
thus giving f explicitly as a pointwise limit of continuous functions. 
Let x be in [0,1]. Then 


g(x) = lim f,(x) = lim YO frp, a(X). 
n->% NO 4 
Now ; 


oO, 


1 % 1 
<— CS d ——_—_—_—— < 
fm nl 2) m(m + 1) an 2» m(m + 1) 


so, by the dominated convergence theorem, we may interchange limit and sum; 
thus, 
o@) o@) 1 


g(x) = Elim fy(*) = LD 


——— Xm{ x). 
mol mop mm + 1) 


Now if x is not in S then y,,(x) = 0 for all m so g(x) = 0 = f(x). If x is in S 
then let x be in A, (and thus in A,, for all m > k) but not in A,, for any m < k; 
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then 

° 1 ° 1 1 

g(x) 22 m(m + 1) Xm(%) 2 mm+1)  k P(x). 
Hence, g is identical to the function f given by (1). 

So we have removed a problem from the realm of number theory, and posi- 
tioned it in real analysis, where it belongs. Or does it? The concepts central to 
Theorem 2—first Baire class and F, sets—are purely topological. There ought, 
then, to be a theorem of this type: let A and B be topological spaces with such 
and such properties, and let p be a point in B. Then a necessary and sufficient 
condition for the existence of a function f: A — B of the first Baire class such that 
S = {x € A: f(x) # p} is that S be an F, set. We leave it to the reader to explore 
the possibilities. 
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On Series of Partial Fractions 


P. D. BARRY AND D. J. HURLEY 
Department of Mathematics, University College, Cork, Ireland 


Examples of explicit expansions in complex Fourier series 


f(o)= db k,e'" (1) 
n=—-o 
are commonly obtained in one of two ways, either 1(a) by putting z = e”® in an 
explicit expansion 


F(z)= > k,z" 
n=—o 
in a Laurent or power series that we know; or 1(b) by using the Euler coefficient 
formula 


‘ 


1 27 
k, = >= aye" dé. 
n= arf f(8)e 


For examples of (1a) see e.g. Knopp [2, pp. 454-5]. For examples of 1(b) see, 
e.g., Hsu [1, pp. 54-56], O’Neil [3; p. 1057], or Pipes [4, pp. 55-58]. From the many 
examples in books of the more common Fourier series of the form 


f(0) =4a,+ Ya, cos né + b, sin nd, 


n=1 


expansions of the type (1) can be deduced on using the formulae 


| | 1. | 
cosné = 5(e'"? +e°'"*), — sinné = 5, ¢e" — eine), 
i 
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Suitable functions f for use in (1b) are polynomials, exponentials e%°, finite series 
formed from sums and products of these, and sectionally continuous functions 
which are of the latter form in each interval of continuity. 

Examples of explicit expansions in series of partial fractions of the form 


oO 


k 
sf= LY — (2) 


n--oS TH 
Or 
2  (-1)"k 
h(g) = dL em (3) 


can be obtained by methods involving the calculus of residues; see, e.g., 
Titchmarsh [5, pp. 110-113] or Whittaker and Watson [6, pp. 134-136]. 

As usually treated, 1(a), (2) and (3) on the one hand, and (1b) on the other seem 
to belong to different areas of mathematics, the former to complex analysis and 
the latter to real analysis. We provide a link whereby expansions in (1b) yield ones 
of type (2) and (3). 


THEOREM. If 


fo) = XL ke 


then 
Wdg= 4 
eorit — |] is fe O= ben’ ( ) 
and 
*  (-1)'k, 


—— | faye de= ¥ , (5) 


n=—o 


oe C$+n 


when the relevant termwise integrations, over [0,27] and [—7,7], respectively, are 
valid. 


These conclusions will become apparent on performing the termwise integra- 
tions. For examples, we note that the integrations on the left-hand sides in (4) and 
(5) can be performed for the class of functions f that we described above as being 
suitable for (1b). 

As a first example we take f ‘with period 27 and satisfying 


f(0)=7-6, for0 <6 < 2m. (6) 
Then we find that for 0 < 0 < 27, 


00 1 . 
f(@) — » — ein, 
now in 

where the prime denotes that the term corresponding to n = 0 is omitted, and so 
(4) yields 

i T 1 1 

—-— +z - a cotiTs = — . 
an 4 é noon ot+n 


This gives, completely elementarily and by a general method, Euler’s celebrated 
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expansion of 7 cot 7 in partial fractions (see [5, p. 113], [2, p. 207]). As for (6), 
f(@) = -(@+7), for -7<6<0, 

(5) gives the corresponding expansion for 7 cosec 7é (see [5, p. 113]). 
As a second example, we let0 <A <p <7, 
f(@) = 1, A<O0<yp, 
f(@) be 0 otherwise in (0,27), and f have period 277. Then we find, by (4) and (5), 
that 


eths — ert w—ar in 1 | | 
a 7 ae yy —— [ein _ eine) 
f[e?7* — 1] 27g na-«0 27M Ct+n 
ets — ei yy — = (-1)" 1 


a [ei _ ein | 
qin C+n 


de 
n=—o 


ImilsingTl 20 


Clearly we can do an unlimited number of examples of this type. Both (4) and 
(5) can be subsumed under a more general formula if we integrate over [a, a + 277]. 
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A Note on Dixon’s Proof of Cauchy’s Integral Theorem 


PETER A. LogesB* 
Department of Mathematics, University of Illinois, Urbana, IL 61801 


In this note, we establish an elementary result in complex function theory and 
apply that result to simplify John Dixon’s elegant proof [2] (also see [1] and [3]) of 
the general Cauchy integral theorem and formula. Dixon’s proof uses local Cauchy 
theory. It is based on the fact that if f is analytic on a region G and y 1s a closed 
rectifiable curve in G, then the integral 


i fiw) -f() 
——_—__—— dw 
(with the integrand replaced by f'(w) when z = w) is an analytic function of z on 


all of G. A simple proof of this fact is given here, and we outline its application in 
Dixon’s proof of Cauchy’s integral formula and theorem. 


*The author’s work was supported in part by a grant from the U.S. National Science Foundation 
(DMS 87-02064). 
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Throughout this note, G will be an open set in the complex plane C, and y will 
be a closed rectifiable curve in G. We write f © H(G) if f is holomorphic, Le., 
analytic, on G. We use the notation D(z, r) for the disk {w € C: |w — z| <r}; we 
write C(z,r) for the circle {w € C: |w — z| =r}. The trace of the curve y in the 
complex plane is denoted by {y}. We begin with our elementary result; it is an 
application of the Cauchy integral theorem for disks. The author is indebted to 
Professor John B. Conway for a simplification of its proof. 


PROPOSITION 1. Given z € {y}, there is a closed curve o in G with z € {oa} such 


that f,f = [,f for all f © H(G). 


Proof. We assume that there is a point ¢ # z with ¢ € {y}; otherwise the result 
is trivial. Pick r > 0 so that D(z,2r) C G and ¢ € D(z,r). We will assume that y 
is given by y(t) for t € [0,1] and y(O) = y() = ¢. By the uniform continuity of the 
mapping y, 46 > 0 such that if s,¢ € [0,1] and |t — s| < 6, then |y(t) — y(s)| <r. 
Let U be the inverse image under y of the disk D(z, r). Each component of U is 
an open interval in (0,1). If (a, b) is a component of U, then by continuity, y(a) 
and y(b) are points on C(z,r). Moreover, if (a, b) contains a point ¢ such that 
y(t) = z, then b — a > 26 since |y(b) — y(t)| =r and |y(t) — y(a)| = r. Thus, 
there are only a finite number of components (a,, b;), 1 <i<n, of U containing 
points ¢ with y(t) = z. We replace the curve y on the closure [a,, b;] of each such 
component with the arc on the circle C(z,r) going from y(a;) to y(b;) in the 
positive direction. By Cauchy’s integral theorem, applied to the disk D(z, 2r), this 
replacement does not change the value of the integral for any f © H(G). The new 
path o avoids z. O 


The next result, Proposition 2, contains the application of Proposition 1 to 
Dixon’s proof: Unlike published proofs of this result, the proof here does not need 
the joint continuity of the function ~ defined below nor does it employ Fubini’s 
theorem. 


PROPOSITION 2. Let f be analytic on G, and let ¢— be the mapping of G X G into 
C defined by setting p(w, z) = (f(w) — f(z))/(w — z) for w #z and g(w,w) = 
f'(w). Then for each z € G, g(-,z) is analytic on G. Moreover, the function g 
defined by setting g(z) = [,e(w, z) dw is analytic on G. 


Proof. Given z € G, the function g(-, z) is analytic on G — {z} and continuous 
at z. From local Cauchy theory (e.g., [3], Theorem 10.14), it follows that o(-, z) € 
H(G). If z € {y}, then 


g(z) = Jo(w,z) dw = [fw /( ~z) dw — f(z) f/m ~ z) dw. 


It is easy to see that g is analytic on G — {y}. Now fix a point z € {y}. By 
Proposition 1, we can replace y with another curve o that misses z, and therefore 
an open disk D(z,«) about z, without changing the values of g on D(z, «). It 
follows that g is analytic on D(z,¢), and thus on G. O 


To outline Dixon’s proof of Cauchy’s integral theorem and formula, we need to 
recall that the index n(y,a) of y at any complex number a ¢ {y} is given by 
ny, a) = (1/27i)f{,1/(w — a) dw. The index is an integer valued function that is 
constant on the components of the complement of {y} and 0 on the unbounded 
component. A cycle [ is a finite set of closed, rectifiable curves {y,;: 1 <i <n}. 
The trace of T, {T}, is U Ay,}. For any function f continuous on {IT}, the integral 
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Irf = U;f,,f. Moreover, for a € {T}, nW, a) = L,n(y;, a). We say that T is a cycle 
in an open set G if {T} CG. 


THEOREM. Let G be an open subset of C, and let T be a cycle in G such that 
n(T,a) = 0 for each point a € G. Then we have Cauchy’s Integral Formula 


f(w) 
ami J 


W—Zz 


Vf <= H(G), Vz eG — {JT}, f(z) -n(T,z) = 


W, 
and Cauchy’s Integral Theorem 
Vf Ee H(G), [ f(») aw = 0, 
r 
Proof. For the Integral Formula, let f@H(G), and let go and g be the 
mappings of Proposition 2. Let H = {z = C — {I}: n(7, z) = 0}. Then A is an 


open subset of C such that G U H = C. Let h be the analytic function defined on 
H by setting 


nz) = [dw wen. 
At all points z € GN H, we have 
f(z) 
g(z) = A(z) ~ f —— 
Therefore, we may extend the function g to an entire function by setting g(z) = 


h(z) for z € G. Since lim, _,,, g(z) = 0, g = 0 on C by Liouville’s theorem. It 
follows that for all points z € G — {T}, 


> dw = h(z) — f(z): 2mi-n(T,z) =h(z). 


t . f(z) 1 , f(w) 
-n(T,z) = — dw = —— d 
M2) (V2) = 5 Jaw Ini sew oe " 
To establish Cauchy’s Integral Theorem, we fix a € G — {T} and note that 
w 

[f) dw = [er ow = 27i-n(T,a):f(a):(a-—a)=0. O 

r w-a 
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THE TEACHING OF MATHEMATICS 


EpItEp BY MELVIN HENRIKSEN AND STAN WAGON 


The Self-Paced Calculus Program at Berkeley 


Murray H. PROTTER 
Department of Mathematics, University of California, Berkeley, CA 94720 


I. Introduction. About twenty years ago there were many experiments with 
self-paced courses at the beginning college level. The University of California was 
no exception; in 1971 physicist Robert Karplus and I introduced a self-paced 
course that combined first-year physics and first-year calculus. We wanted to 
remedy the justifiable complaint of many physics teachers that their students never 
seem to learn the mathematics needed for understanding principles of physics until 
after these principles are taught in the basic physics course. Physics instructors 
frequently must teach bits of mathematics to students who otherwise would be 
bewildered by the physics being presented. 

By interweaving mathematics and physics we achieved in our interdisciplinary 
self-paced course a program that taught both subjects in a logical and consistent 
manner. The course was taught for two years and then dropped because combining 
physics and mathematics in one course put too heavy a burden on the students. 

As I was preparing the materials for the joint mathematics-physics course, I 
realized that there might also be a place in the mathematics department for a 
self-paced alternative to the regular first-year calculus course. It was not difficult 
to adapt the mathematics portion of the joint mathematics-physics course into a 
straight calculus sequence, and I announced an experimental self-paced one-year 
calculus and analytic geometry course to begin in the fall term of 1972. The initial 
enrollment was 85. Students were enthusiastic about this form of instruction; word 
spread and within a few years enrollment grew to approximately 500, the maximum 
we can handle with the facilities we have available. Now, after 18 years the 
self-paced calculus course is firmly established in the Berkeley curriculum with 
about one-third of the students opting for the self-paced course and two-thirds 
taking the usual lecture-discussion section type of course. 


2. How the self-paced course works. The school year at Berkeley consists of 30 
weeks of instruction plus examination periods. I decided to divide the syllabus for 
first-year calculus with analytic geometry into 30 equal parts, called segments. A 
student proceeding at a normal pace would complete one segment each week, of 
course. An institution with an academic year with length different from ours could 
easily add or subtract segments without disturbing the continuity of the course. At 
Berkeley we use a standard calculus text and a specially prepared pamphlet which 
contains 30 Study Guides,* one for each of the 30 segments. 

One faculty member is in charge of the course; however, he is aided by several 
teaching assistants and undergraduate tutors. There are no lectures or section 


*Between 1972 and 1988, the text used in the self-paced course was Protter and Morrey, College 
Calculus with Analytic Geometry, second and third editions. Beginning in the fall of 1988 the text was 
Protter and Protter, Calculus with Analytic Geometry, fourth edition. The pamphlet is Student Study 
Guide to Accompany the Self-Paced Program of Calculus with Analytic Geometry, Jones and Bartlett, 
publishers (Boston). 
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meetings. Each study guide has (i) a set of objectives, (ii) a reading assignment in 
the text, (iii) one or two paragraphs providing the student with study hints, (iv) a 
set of required problems in the text, (v) additional average problems for the 
student who needs them, and (vi) additional challenging problems for the above 
average student. 

The course requires the use of two fairly large classrooms for most of the day at 
least three days a week. One classroom is used exclusively for test-taking and the 
other for tutoring. Typical students start by reading the assignment described in 
Study Guide 1. After they believe they have mastered the objectives and solved 
enough of the problems to be convinced that they are prepared, they go to the 
testing room, where a teaching assistant gives them a test for Study Guide 1. Each 
student takes the test with normally no time constraints on how long he or she 
works. After finishing the test he* sits down with a teaching assistant to go over 
the examination. Typically there are three problems on a test; getting one problem 
wrong constitutes failure. (Passing the test means that, except for minor errors, all 
the problems are done correctly.) If the student passes he goes on to Study Guide 
2. If he fails, he and the grader discuss both the reasons for failure and in what 
way the objectives of that study guide have not been met. Next, the student studies 
the material again, returning on a subsequent day to take an equivalent test for 
Study Guide 1. (There are at least four equivalent tests available for each Study 
Guide. Also, a key to every test is available to help the teaching assistant when he 
grades a test together with the student.) Thus a student can proceed to the next 
Study Guide only when he has mastered completely all preceding Study Guides. 

The tutoring room is just that. Any student can drop in at any time the room is 
open and get individual help from a tutor. Tutors encourage the students to read 
the text carefully and to discuss with the tutor passages they find difficult. Of 
course, tutors help students with homework problems when such help is requested. 
Since homework is not graded, the students ask questions about such problems 
only when they need help in achieving the objectives of the Study Guide. 


3. Grading and units credit. At Berkeley the regular first-year calculus course 
provides four units credit each semester. Thus, if a student completes 15 segments 
of the self-paced course each semester, he is credited with four units. However, the 
course actually is denoted a “variable credit course” and, through an arrangement 
with the registrar, the instructor assigns the number of credits a student receives 
(as well as a grade) at the end of each semester. There is a minimum of two units 
credit for which a student must enroll at the beginning of the term; thus he is 
required to complete seven or eight segments during the semester.** However, if 
he completes more than the minimum, he will receive credit for the actual number 
of segments he has completed. 

The final examination, which every student is required to take, does pose a 
problem. Each term the faculty member must make up an examination with thirty 
problems—one corresponding to each segment. In fact, we announce that the 
number of the problem on the final examination identifies the segment from which 
the material has been selected. Then each student is required to answer only those 
problems which cover the segments he has completed during the term. However, 
since the final examination is restricted to three hours, there remains a difficulty 


*Simply for convenience we shall, henceforth, use the term fe rather than he or she. 
**We make an exception for students who have already completed seven units of credit (26 or 27 
segments). They are allowed to register for the one remaining unit of credit. 
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for those students who have completed more than ten segments during the term. 
For this contingency, we announce that if a student has completed eleven or more 
segments, he is required to answer only the ten problems which correspond to the 
most recent ten segments he has completed. The letter grade the student receives 
in the course is based on the scores on the segment tests (which are graded 
numerically) and the score on the final examination. 


4. Record keeping. Good records are essential for a self-paced course. When 
the course began in 1972 we used a simple technique for keeping the extensive 
records we need. At the beginning of the term we assigned about 75 students to 
each teaching assistant. This assignment provided each student with a contact for 
answering questions about the course and for help with logistical problems. In 
addition to keeping a class list of his students, the TA recorded every segment test 
the students took and the grade received on each test. Each segment test is written 
on a sheet of typewriter-sized paper containing the examination questions; the 
student works the problems on that sheet (using additional sheets, if necessary). 
Everything is turned in to the TA so that no copies of the test are left in the hands 
of students. In a storage room we have a set of filing cabinets in which all student 
records are kept (including their tests). Since some students take several semesters 
to complete the course, we keep the records for a few years. 

We began the use of computers for keeping records in the mid-seventies. Since 
each student entering Berkeley receives an identification number by the registrar, 
we use these numbers as the basis for our data bank. Currently we have a 
computer program and a terminal in the test-taking room so that when a student 
wishes to take a test he identifies himself; the TA then brings up his record on the 
computer screen. The record shows which segment tests the student has taken and 
his grades on all of them. Then the TA enters into the computer which test he is 
now giving to the student. After the test is completed, the grade is recorded and 
the examination paper is kept for the files. Periodically a clerk files all the test 
papers in the storage room. 


5. Who should take self-paced calculus? It is easy to describe the students who 
should not take a self-paced course. We are all familiar with those students who 
don’t begin to study until the instructor in the course announces an hour exam. 
Since in a self-paced course there is never any external pressure to do anything, a 
student who needs a deadline to get started in his work will wait until the last two 
or three weeks of the semester before doing anything at all. Such students 
invariably fail the course. Since it is difficult to prevent all students of this type 
from registering in the course we therefore instituted what is known as the 
“three-week rule.” This rule restricts the number of segment tests a student may 
take during the last three weeks of the semester. The rule has a practical purpose 
as well as a psychological one, since without it the TAs and tutors would be greatly 
overworked near the end of the semester. While self-pacing allows the student the 
freedom to work in an irregular way, it does not allow him to bunch all of his 
test-taking toward the end of the term. 

On the other hand, self-pacing is suitable for many kinds of students. In 
addition to those students who are turned off by large lectures and who wish to 
work at their own speed, there are also two other groups of students who profit 
from self-pacing. First, there are those students who have had some calculus in 
high school but who have either failed the AP test or believe that they don’t know 
the subject well enough to skip first-year calculus entirely. Such students are bored 
in the large calculus lectures and believe (often mistakenly) that they already know 
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much of the material. If they really do know the subject, they can go through the 
self-paced course quickly and, as they pass all 30 tests, acquire the confidence and 
understanding they need in order to go on. If they don’t understand the subject, 
they find that out quickly as they fail the tests and have to repeat them. Because 
many of these students are bright but are the victims of their high school courses, 
self-pacing is a boon to them. The second class of students for whom self-pacing is 
useful are those who have the ability to learn mathematics but are slow at it. In a 
regular course such students will get about a C and thus learn a random 70% of 
the material. However, in a self-paced course, the same students will learn 
thoroughly the first 70% of the course, get a good grade, but earn three units 
credit instead of four. As a bonus the student learns to read mathematics, gains 
confidence, and comes to appreciate the positive reinforcement that goes with 
passing the segment tests. 

There are additional reasons for the great appeal of the self-paced program. 
For example, if an instructor in another course announces a term paper or an hour 
exam, the student can easily plan to do his math at times that don’t interfere with 
his other deadlines. Perhaps the most remarkable difference between the standard 
lecture course and the self-paced course is the student reaction. A large majority 
of students dislike the lecture-discussion type of course; however, almost all 
students who choose the self-paced course are enthusiastic about it. 

All students who take sophomore mathematics at Berkeley do so in the 
lecture-discussion format. From time to time we have compared the performance 
in sophomore mathematics of students who took the first-year lecture classes with 
those who took the self-paced course. Statistically, the two groups do equally well. 

There is another way in which the self-paced course is helpful. Freshmen 
entering Berkeley can choose to take a weaker calculus course, one designed for 
students planning to study economics, business administration, or biology. This 
course, called Math 16AB, usually has an enrollment of more than 1,000 students, 
but it has the feature that once a student completes this course he is not qualified 
to go on and take any additional mathematics courses. As one would expect, each 
year some of these students wish to go on in mathematics. They have two choices: 
they can go back and take the entire regular calculus course (called Math 1AB) 
over again, or they can register for the self-paced course (Math 1S) and pass a 
modest number of segment tests selected by the mathematics department so that 
their background for second-year mathematics will be adequate. The students who 
select this last option are grateful indeed, for the self-paced course saves them 
considerable time and work. 

Since there are many students for whom a self-paced course would be disas- 
trous, it is important to emphasize that self-pacing cannot replace entirely the 
regular lecture-discussion section course. However, institutions that have several 
sections of calculus are in a good position to offer students the choice of two ways 
to learn the subject; small colleges with fewer than 50 students taking calculus 
should not attempt self-pacing. 


6. Cost. It is difficult to assess whether or not a self-paced calculus course is 
more expensive than a regular lecture-discussion section course. At Berkeley we 
use a regular faculty member (usually a full professor) to lecture to 250 students. 
There is the expense of involving four faculty members to staff the first-year 
lecture-discussion course, which has an enrollment of 1,000 students. The self-paced 
course requires one faculty member for 500 students. However, all the students in 
the lecture-discussion section course get four units credit while the credit for the 
students in the self-paced course averages between three and four units. Both 
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courses use about the same number of teaching assistants per student. In the 
lecture-discussion section course we employ readers (upper-division students and 
some graduate students) to grade the homework that all students are required to 
turn in. In the self-paced course we employ tutors (upper-division students and 
some graduate students) on an hourly basis to help students, as described earlier. 

There is also the problem of the efficient use of space. For the lecture course 
we use four large lecture halls three days a week for one hour and dozens of small 
rooms twice a week for one hour. In the self-paced course, two rooms are used all 
day at least three days a week. There is also a storage room. Several people here 
have tried to analyze this information with respect to cost and have decided that 
self-pacing is slightly more expensive but worth the outlay. Each institution would 
have to examine its own data to decide which method is more costly. 


7. Other courses. Every beginning student at Berkeley who wishes to elect a 
mathematics course must take a diagnostic test in elementary algebra and 
trigonometry. Those who pass the test can choose the standard calculus course, 
Math 1AB (lecture-discussion section), Math 1S (self-paced), or Math 16AB, the 
weaker course described earlier. Those who fail the test must take a one-semester 
precalculus course, called Math P. After the success of the self-paced Math 1S was 
fully recognized, we introduced self-paced versions of Math P and Math 16AB. 
These courses have been operating successfully for several years, so that all our 
beginning mathematics students have the option of taking a lecture-discussion 
section course or a self-paced course. I believe that all lower division courses with 
large enrollments should give each student the option of attending large lectures 
and small discussion sections or of learning the subject at his own speed. I 
anticipate that the second-year mathematics course at Berkeley, which has large 
enrollments every term, will soon offer students these two options. It may develop 
that the self-paced second-year course will be even more popular than the 
first-year, self-paced courses are. 


Number Systems With a Complex Base: a Fractal Tool 
for Teaching Topology 


DANIEL GOFFINET 
9 Rue de la Republique, St. Etienne, 42100 France 


It so happens that I teach a first course in topology and a course in program- 
ming to the same students. Number systems other than the decimal one have 
proven to be a useful tool in the two areas. The number system with a complex 
base and digits 0 and 1 that is described in this note illustrates many topological 
notions and enthralled most of my students. An historical survey of the idea of 
using nonreal bases can be found in [1, pp. 188-190]. 

I gave them the program to play with, without any specific questions, in the 
hope of helping them to formulate and answer their own questions. Because they 
were only beginners in topology they mostly asked about programming details 
rather than the questions on connectivity that I had in mind. Still, I think it would 
make students with some topological background understand the necessity of the 
so-called “‘abstract”’ definitions. 
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b = 0.65 — 0.31 b = —0.697e27'/5 
Fic. 5. Fic. 6. 


b=0.8 + 0.21 0.5 + 0.51 
Fic. 7. Fic. 8. 


b = 0.2 + 0.61 
Fic. 9. b = —0.62e°7'/7 


Fic. 1-10. Some examples of %8, where the images are produced by considering only sets A 
arising as subsets of {1,..., 15} (except for Fic. 9 which uses subsets of {1,..., 13}). 
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F is complete. If A, is a Cauchy sequence in & then for every p © N* the 
sets A, M [1, p] are eventually constant, which yields the limit. 


F is totally bounded. For every p, # is a finite union of sets with diameter 
smaller than 2°”, namely the 2” sets obtained by considering, for each subset of 
[1,p], all its supersets in YH. Alternatively, compactness follows from the fact that 
the symmetric-difference topology coincides with the product topology obtained by 
considering # as {0,1} (where {0, 1} is given the discrete topology). 

If we define 9,: A> C by o,(A) = Lib": n € A}, then g, is continuous; 
hence, for each b, 8 = o,(#) is compact. Another proof of compactness appears 
at the beginning of §3. 

We will use the compactness of 88 to prove that C is the union of copies of the 
set 8 in the case that b = (1 + 1)/2 (See Fic. 8). The easy way to see this is to 
look at [1, §4.1]. But you can do it yourself with a pencil and graph paper. Because 
the set is in some ways a counterpart to [0, 1] (which is 8 for b = 1/2), let us first 
look at the counterpart of N (which is obtained from binary expansions in [0, 1] by 
replacing each 1/2 by 2 and considering only finite expansions). Because b is now 
(1+i)/2, b=1/B with B = 1 —i. Taking this B as base, and observing that 
B® = 16, leads us to look at @ = {X{B*: k © C}: C € {0, 1, 2, 3, 4, 5, 6, TH}. 

We start with the baby dragon @, = {LX{b*: k © C}: C C {0, 1, 2, 3}}. When 4 is 
added to the set of admissible exponents, @, becomes %,, which consists of two 
copies of @ 3. Continue until reaching ~@,, which consists of 16 @,s (see Figs. 
11-12). Now it is easy to check that another copy of @, translated 16 units 
rightward will fit with the first copy as in a jigsaw puzzle. Repeating yields a strip 
of @.s (see Fig. 13). Again, inspection shows that translating this doubly infinite 
strip by 167 yields a strip that fits with the first one. Thus every Gaussian integer 
(i.e., element of G = Z + Zi) may be written uniquely in the form c + 16(p + gi) 
with c © @, and p,q € Z. Inshort: G = 16G + @.. 

Now, let us return to proving that C can be covered by G-translations of 
B= {r,,,4a,b*: a, € {0, 1}. Let B, = (Ly ce 9,4,b*: a, € {0, 1}}, with By = 
{0}. Then 8, = b°¢, = @,/16. Because 8, /16 = {Lo 2, <154,b*: a, € {0, 1}}, we 
have that 6, = 8, + 1/16)8, = 8, + (1/167)@,. Similarly 6, = 8, , + 
(1/16")¢,. Assume inductively that 8, _, + G = G/16”"~' (clearly, 8, + G = G). 
Then 6, + G = (%,_, + G) + 1/16"), = 1/16""')G + 1/16")4, = 
(1/16")\16G + @,) = G/16". Since 8 + GDB, + G, B + G contains all points 
in a grid of mesh (1/16)”. It follows that for any square there is a finite union of 
G-translates of 8 (a compact set) which is dense in the square. Hence, 6 + G 
contains the square, and so 8 + G=C. 

As with the usual base-10 representations, we lose uniqueness when we pass to 
the limit; for example: b = 1 + (67+ b° +---), 


3. Going deeper into the topology of 8. I owe the following fixed-point idea to 
[5]. If .% is the set of nonempty, compact subsets of the plane, the Hausdorff 
distance is defined as 6C(X,Y) = max{sup d(x, Y), sup d(y, X)}; then (%, 8) is a 
complete space [2, p. 56]. I now define T,) by T)(z) = bz and T, by T(z) = b + bz. 
The functions T, and 7, are contractions of C, and also clearly induce contractions 
of (¥%, 5). If T = T, U T, is defined by TX) = T,CX) U T,CX), then T is also a 
contraction of (%, 6), because we can easily prove that 


6(X, UX,,Y, UY) < max{6(X,,Y,), 6(X,,Y,)}. 


Therefore T has a unique fixed point in %, and it is the limit of the sequence of 
the T"(K,) for any Ky. Taking K, = {0} we have that T"( Ky) equals those points 
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~]Q sccccces 5 10 


Fic. 11. Two copies of the baby dragon, Fic. 12. The set @ , which is built 
63, yields &. up from 8 copies of @. 


~ 10 gti aS 


Fic. 13. The Gaussian integers can be tiled with copies of the set @. 
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in 8 with at most n “digits” in their representation. So %8, the limit of the 


sequence, is in .% (i.e., is compact) and is a fixed point of T. 


4. Connectivity. I now claim that %8 is either connected or totally disconnected. 


Set 6° = 7,(%) and B' = T,(%); then B = T(B) = Bu B!. 


Case 1. 8° 71 B' = @. If m € B', then .4, its connected component in %, is 
contained in 8‘ (8° and %! are compact as images of 8 by the continuous 
functions T, and T,, respectively). But ®' itself is a disjoint union of compact sets 
so .#@ is included in some 7;7;(8). By induction we can prove that .4 is a subset of 
the intersection of a decreasing sequence of compact sets whose diameters tend to 


zero. It follows that 4 is simply {m}. 


Case 2. 8° Nn 8! # W. In this case B is connected. Choose a € 8° N B!, say, 
a= ba, =b + ba,. Because 8 is compact, to prove that it is connected it is 
enough to check that for every z € % and «e > 0, there is an e-chain in %8 from z 
to a. If € is greater than the diameter of 8 then (z, a) is such a chain. Let us show 
that from a-chains we can build Aa-chains where A < 1 is the modulus of b. Since 


z € 8 we have two cases: 


If z € B° then z = bd, and we can “a-link” d to a, then a to ay. Now take the 
images by 7, of these chains (i.e., simply multiply by b) to Aa-link bd to ba to 


bay = a. 


If z € %' then z = b + bd, and we can a-link d to a to a,. Then take the 


images of these chains by T),. 
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“digits” are needed to make %8 look whole. Another example: ‘8, is disconnected, 
although one cannot really tell from the image—a proof is needed. The four upper 
right blobs can be separated from the four bottom left ones by a strip that can be 
physically measured and seen to be greater than 1/50. The modulus of b is smaller 
than 0.64, so with the powers greater than 13 a gap wider than 0.64'4/(1 — 0.64) 
(which is well under 1/50) cannot be bridged. Therefore 8, is indeed discon- 
nected. 

The set of those b for which % is connected is described in [3], where, among 
other things, it is proven that 8 is connected if |b] > V2 /2. Among the Bs shown 
in Figures 1-10, only 8, and %8,, are disconnected. 


8. Miscellaneous. A lot of things are left to be explored. When % is connected 
and simply connected (see Fics. 7 and 8), is the boundary of fractal dimension 
strictly greater than 1 (as it seems)? Is that boundary the image of a continuous 
map from [0, 1] in R? (as it seems)? Why does 8. have %.-like eggs inside it (dark 
gray)? 

I tried to replace C by the algebra with (a, b) - (c,d) = (ac + bd, ad + bc), but 
got nothing worth showing. Why not? 


Acknowledgement. I am grateful to the referees for many suggestions. 
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A Characterization of the Cantor Function 


DONALD R. CHALICE 
Department of Mathematics, Western Washington University, Bellingham, WA 98225 


This note presents a characterization of the Cantor function that might be used 
as an alternative or addition to the development usually presented in real analysis 
courses. It may also be utilized to give a short program in Mathematica that easily 
generates the Cantor function artd other similar functions which we call “devil’s 
staircases” (see [4]). 


THEOREM. Any real-valued function F(x) on [0,1] that is monotone increasing 
and satisfies (a) F(O) = 0, (b) F(x/3) = F(x)/2, and (c) FQ — x) = 1 — F(x), is 
the Cantor function. 


Before presenting the proof, recall (see [1]) that if we consider the closed 
interval [0, 1] and remove the open middle third, (1/3, 2/3), and next remove the 
open middle thirds (1/9, 2/9) and (7/9, 8/9) of the two remaining intervals, and 
then remove the open middle thirds of the remaining four intervals and so on, 
indefinitely, what remains is the Cantor set C. Alternatively, x is in C iff x has a 
base-3 expansion consisting only of the digits 0 and 2. 
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A Characterization of the Cantor Function 


DONALD R. CHALICE 
Department of Mathematics, Western Washington University, Bellingham, WA 98225 


This note presents a characterization of the Cantor function that might be used 
as an alternative or addition to the development usually presented in real analysis 
courses. It may also be utilized to give a short program in Mathematica that easily 
generates the Cantor function artd other similar functions which we call “devil’s 
staircases’’ (see [4]). 


THEOREM. Any real-valued function F(x) on [0,1] that is monotone increasing 
and satisfies (a) F(O) = 0, (b) F(x /3) = F(x)/2, and (c) FU — x) = 1 — F(x), is 
the Cantor function. 


Before presenting the proof, recall (see [1]) that if we consider the closed 
interval [0,1] and remove the open middle third, (1/3, 2/3), and next remove the 
open middle thirds (1/9, 2/9) and (7/9, 8/9) of the two remaining intervals, and 
then remove the open middle thirds of the remaining four intervals and so on, 
indefinitely, what remains is the Cantor set C. Alternatively, x is in C iff x has a 
base-3 expansion consisting only of the digits 0 and 2. 
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The Cantor function G(x) may be defined as follows. First define it on C: if 
x= ),2-37™, then G(x) = u;27". The function G is monotone and has the 
same values at the endpoints of each removed interval, so G extends to a 
continuous function on [0, 1] (see Figure 3(a)). 

On [1/3, 2/3], G has value 1/2, on [1/9, 2/9] and [7/9, 8/9], G has values 
1/4 and 3/4, respectively. On the intervals [1/27, 2/27], [7/27, 8/27], [19/27, 
20/27], and [25/27, 26/27], the values of G are 1/8, 3/8, 5/8, and 7/8, 
respectively. Since G is (locally) constant on some neighborhood of every point in 
[0,1] \ C, G(x) = 0 almost everywhere on [0, 1]. (See [1] or [3].) 


Proof of the theorem. First observe that in constructing the Cantor set, the 
removed intervals (in base 3) are as given in TaBLe 1. The endpoints of any 
removed interval at the nth stage are found by either multiplying those from the 
previous stage by .1 (base 3), or by multiplying by .1 and adding .2. 


TABLE 1 


Step Values of G Removed Intervals (closures) 
1/2 [.1, .2] 
1/4, 3/4 [.01, .02], [.21, .22] 
1/8, 3/8 [.001, .002], [.021, .022], 


5/8, 7/8 [.201, .202], [.221, .222] 

1/16, 3/16 [.0001, .0002], [.0021, .0022], 

5/16, 7/16 [.0201, .0202], [.0221, .0222] 

9/16, 11/16 [.2001, .2002], [.2021, .2022], 
13/16, 15/16 [.2201, .2202], [.2221, .2222], etc. 


Alternatively, any permutation of m — 1 Os and 2s after the ternary point 
followed by a 1 or a 2 gives a removed interval at the nth stage. 

Recursion is now used to characterize F: By (a) and (c), FQ) = 1, so FC.1) = 
1/2. By (b) and (c), FC2) = 1 — FC1) = 1/2; (b) implies FC01) = FC02) = 1/4 
and (c) implies F(.22) = F(.21) = 3/4; (b) then yields the correct values of 1/8, 
3/8 on the firsts two intervals at stage 3, while (c) yields the value F(.201) = 1 — 
F(.022) = 5/8. It follows by induction that F and G agree on a dense subset of 
[0,1]. Since F is monotone increasing and has no jump discontinuities, F' is 
continuous (because any discontinuity of a monotone function is a jump; see 
[1, p. 129]). Thus since F' is continuous and agrees with the Cantor function on a 
dense set, F is the Cantor function, and the proof is complete. 

As an alternative proof of continuity, F may be exhibited as a uniform limit of a 
sequence of monotone increasing step functions {s,} on [0,1] with the jumps 
converging to 0. For example, define 


0 if x < .1 
s(x) =(1/2 ifl<x< 2 
1 if .2<x 


and inductively, define 
S, (3x) /2 ifx < 1 
s(x) = 41/2 ifl<x< 2 
1-s(1-—x) if. 2<x 


(assuming s, as defined on the first line). The continuity of F is then a conse- 
quence of the following lemma, which may be of independent interest. 
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1 (+e 


l 
Figure 1(a) Graph of s,. 


Figure 1(b) Graph of 55. Figure 1(c) Graph of s3. 


Lemma. Let {s,} be a Cauchy sequence in uniform norm of step functions on 
[0, 1], each with a finite number of jumps such that the heights of the jumps for s,, 
converge uniformly to 0. Then s,, converges uniformly to a continuous function F on 
[0, 1]. 


Proof. Pick x in [0,1] and e > 0. There exists an integer N such that n,m > N 
implies |s,(x) — s,,(«)| < ¢/4 for all x in [0, 1], and so that the jumps of s, are all 
less than ¢/4. Now any s, is uniformly continuous on each interval on which there 
is not a jump, so we can pick 6 so that |x — y| < 6 implies |s,(x) — sy(y)| < 2€/4. 
Hence, |x — y| <6 implies |F(x) — FCy)| < |F(x) — sy(x)| + |sylx) — syCy)| + 
Isv(y) — F(y)| < e. Thus F is continuous. QED 


Since n < m implies |s,(x) = 5,,(0)| < |s,_,(y) — s,,_-,(y)|72 where y is one of 
3x or 1 — 3x, it then follows inductively that |s,(x) — s,,(x)| < 1/2” when n < m. 
Thus {s,} is uniformly Cauchy and the continuity of F follows from the lemma. 

Note also that / may be generated by a geometric algorithm, where we begin by 
defining F(x) = 1/2 on the interval [.1,.2] and F(O) = 0, F(1) = 1, and where at 
each stage of the algorithm we shrink the x axis by a factor of 1/3 and the y axis 
by a factor of 1/2 and then flip the resultant graph across the line x = 1/2, and 
then again across the line y = 1/2. We see F as coming more into focus at each 
stage. 

The preceding characterization was used (together with the recursive feature of 
Mathematica) to generate FiGures 3(a)—(b) (programmed by Gerald Harnett). 
Other “devil’s staircases” were obtained by taking any p> O and changing 
condition (c) to (c’): Fd — x) = 1 — pF(x), for x < 1/3 and condition (b) to (b’): 
F(x /3) = F(x)/(p + 1). These generalizations arise as the cumulative distribu- 
tion functions of some of the probability measures invariant under the “inverted 
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Figure 2(a). Figure 2(b) 
Shrink 2(a) and superimpose on 2(a). 


o—_——_———* 
I I 
Figure 2(c) Figure 2(d) 
Double flip 2(b) and superimpose on 2(b). Shrink 2(c) and superimpose on 2(c). 


().25 0.5 ().75 l. 0.25 0.5 ().75 
The Cantor function (p = 1). The Cantor function ( p = 2). 
Fic. 3a Fic. 3b 


V” transformation discussed by Mandelbrot (see [2, 4]). Finally, by letting p take 
on, for example, the values .01, .1, .3, .5, 1, 4, 10, and 100, one can create a movie 
in Mathematica that shows the Cantor function “stressed” by the varying of the 
parameter p (see FIGURE 3). 
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Parametric Differentiation Revisited 


L. R. BRAGG 
Department of Mathematical Sciences, Oakland University, Rochester, MI 48309 


A number of now-ancient texts in advanced calculus employed the method of 
parametric differentiation to deduce evaluations for a variety of complicated 
definite integrals (e.g., [4, 7]). In his autobiography [3], R. P. Feynman mentioned 
how he frequently used this approach when confronted with difficult integrations 
associated with mathematical and physical problems. He referred to that method 
as “a different box of tools.” Most modern texts either ignore the subject or 
provide only one or two examples to illustrate theorems on the uniform conver- 
gence of improper integrals. (See [8] for a more recent treatment of integrals.) 
However, the method of parametric differentiation has a much broader scope than 
this. It can, in fact, be introduced into the elementary calculus to simplify the 
treatment of partial fractions with repeated factors, to carry out integrations 
ordinarily handled by parts, and to treat integrals that use substitutions. When tied 
in with other mathematical machinery (e.g., complex variables), it can be used to 
deduce formulas for special functions and to solve problems in differential equa- 
tions that involve iterated operators. Finally, the method can be conveniently used 
in conjunction with mathematical software packages that handle symbolic differen- 
tiations. 

In this note, we give three examples from calculus to illustrate the flexibility of 
this approach at the elementary level. One of the integrals treated is connected 
with the ¢t-distribution. We also note, without providing a complete discussion, how 
this technique can be used to evaluate the Riemann zeta function at 2p, p = 
1,2,..., and how it can be used in differential equations. A more extensive set of 
applications is available from the author. In the work to follow, we call upon the 
Leibniz product rule for differentiation as well as theorems that permit interchang- 
ing orders of differentiation and integration (see [5,6]). We also need to make 
frequent use of the formula 


D?-"x +a) °=(-1)" ‘cele +1) -+- (ec +n — 2)(x + a)7"*"~?. (1) 


When c = 1, the right-hand side reduces to (—1)”"'(n — 1)!(x + a)~”". We omit 
constants of integration from the indefinite integrations. 


A Partial Fraction Decomposition. Suppose we wish to obtain a partial fraction 
decomposition of the rational function (x + 2)~’(x + 1)~" where m and n are 
positive integers greater than 1. We associate with this the parameterized function 
h(x) = (x +a)" (x + b)-!,a>0,b> 0, a # b. By the usual algebraic decompo- 
sition method, we find 


h(x) =(b-—a) ‘(x +a)°'+(a-—b) ‘(x +5). (2) 


Upon differentiating (2) m — 1 times with respect to a and n — 1 times with 
respect to b and applying the Leibniz rule and (1) to each of the summands on 
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(2)’s right-hand side, we can show, after simplification, that 
(x+a) "(x+b)" 


1 +j-—-1)! . 
om, = cy (b= aya +a) 


n—l +j-1)! 
‘Gacpi 2 E (-1 py a bye BO (3) 


The required decomposition follows by replacing a in (3) by 2 and b by 1 

A brief test was carried out on Maple 4.2 for the Macintosh to compare the 
speed of Maple’s built in partial fraction decomposition procedure with the speed 
of the parametric method. The preceding example was selected with m and n 
approximately 20. The differentiation method ran somewhat faster with less 
garbage collecting. Programs are now being developed for implementing this 
parametric decomposition approach on the HP-28S calculator. 


An Indefinite Integral. The indefinite integral 
I(x,a,p +n) = [{(at+x?) "de, (4) 


a> 0, p > 0, n a positive integer, appears frequently in applications, in particular 
in probability and statistics. Calling upon (1), it follows that 


(—1)" 
P(pt+1)°::(pt+n-1) 


If the integral [(x, a, p) can be evaluated, then (5) provides a convenient way for 
computing I(x, a, p + n). For example, if p = 3/2, then 


I(x,a,p+n) = D"I(x,a,p). (5) 


3 _ 
if. 4,5} =x(a +x?) 1+ a7! (6) 


and the right-hand side is well suited for applying the Leibniz rule and (1) to 
evaluate (4). 

This can be used to quickly obtain a finite sum formula for the ¢-distribution 
with 2m degrees of freedom. That distribution is defined by the improper definite 
integral 


where C,,, = ['(m + 3)/(V2mazrT(m)). Analogous to (6), we have 


a) —-1/2 
f 


~3/2 


2 
a+ S| da = ya 


y 
“'la+ — + ¥2ma_'/?. 
~ 2m 2m 


If we differentiate both sides of this m — 1 times with respect to a, evaluate at 


a = 1, and take into account the constant C,,,, we can establish (see [1]) 


1 fy "STU +2) 
2V2m | vr jy I! 


The case of the f¢-distribution with 2m + 1 degrees of freedom can be treated 
similarly. 


2 \ ~U+9) 
T(2m,y) = 1 | +m) (7) 
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An Integration by Parts. We wish to evaluate /x”e* cos x dx. This quadrature is 


usually handled by a repeated integration by parts. Let us, however, associate with 


this the complex antiderivative fe“ dx = ec” ' where c = a + ib. Then 


[aren dx = Di fee dx = D"(e°*c"') 
n . ani 
—_ dh (1) : x2 Ipcxe-GUtl). (8) 


Writing c in the polar form c = pe” with p = (a* + b*)!/* and 6 = tan” '(b/a), 
(8) becomes 

yn iela tb - G+ DOIx 
(n Ty (a2 + b2)07P? | 


Finally, if we replace a and b by 1 in (9) and equate the real parts of its two 
members, we find 


[xtc de = LO Gy (9) 


n—-Jj 


nx x : J nix v 
iE: e*~cos xdx =e CY (n —j) 1204072 x- (+17 . (10) 


Other Uses. 
(i) Evaluation of ¢(2p). The following equality for meromorphic functions is 
developed in complex variables: 
dL, (n* + 2°) (11) 
n=1 
From the fact that cosh 7z/sinh 7z = 1 + 2/(e*"? — 1), the right-hand side of 
(11) can be replaced by 7/2z ~— 1/(2z”) + (/2z)L7_)B,Qaz)/j! in which B 
denotes the jth Bernoulli number [1] with B,,,,, = 0, m > 1. If we replace z* in 


(11) by a positive parameter a and then differentiate (11) p times with respect to a 
at a = 0, we obtain the familiar formula 


, ; 1 (-1)? (27) 
Gi) Differential equations with repeated factors. Let L(x, D,) denote a linear 
ordinary differential operator in x and D, and suppose that y(x, a) is a solution 
of the nonhomogeneous equation (L(x, D,) — a)y(x, a) = f(x). Then, with a little 
work, the reader can show that\the function y,(x) = (D”~'y(x, a@|q=o)/(n — 1)! 
satisfies the iterated nonhomogeneous differential equation [L(x, D,)I"y,(x) = 
f(x). For example, if L(x, D,) = D,—A,A a constant, then y(x, a) = 
[xeA tPC—O Fo) do and 


y,(x) =((n-1)!)7! [iG ~ oo)" 1e-9F(G) do. 


If the q operators L,(x, D,),..., L(x, D,(L; # L;) commute, then the above 
method can be extended to construct a solution (x) of the “iterated”? nonhomo- 
geneous equation LT'(x, D,) +++ Lax, D,)9(x) = f(x). To obtain 9(x), we solve 
the parameterized problem 


[ (L(x, D.) - 7) ese = f(x), (13) 


1 a cosh az 1 


Qzsinhaz 222° 


B,,, p=1,2,.... (12) 
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and then evaluate the following expression 


q 
1 _ - 
[1 [(m, - )!] Dy bones Dia 'y(X,@y,-..,@,)la=ay= =a,=0> 


n=1 


This method can also be applied to partial differential equations. 
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ELEMENTARY PROBLEMS 


E 3427. Proposed by R. Padmanabhan, N. S. Mendelsohn, and B. Wolk, 
University of Manitoba, Winnipeg, Canada. 


Let S denote the set obtained by formally adjoining an element e to the set Q 
of rational numbers. On S define a binary operation © as follows: 


peq=(3+pq)/(p+q) if pE€Q,qeQ,pF# —4|, 

p°q=e if pEQ,qeQ,p= —g|, 

x°e=x forall xeES. 
(a) Prove that (S,°) is an abelian group isomorphic to a subgroup of the multi- 
plicative group of non-zero real numbers. 
(b) If p is a positive rational number, put p, =p, p» =p°p, p3 =P°P°P,.... 
Show that lim, _,.. p, exists and find the limit. For which values of p is the 
sequence {p,}”°_, monotonic? 


E 3428. Proposed by Artin B. Boghossian, Aramex Investments Inc., Willowdale, 
Ontario, Canada 


Let S be a non-empty interval on the real line. Let f: S — S be a continuous 
function having the property that for each x € S there exists a positive integer 
n = n(x) with f"(x) =x, where f” denotes the n-th iterate of f. For given S 
characterize all such functions. 
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Part (a) was solved also by N. K. Artemiadis (Greece), G. Boros, D. Callan, S$. Gaignoux (France), 
R. A. Groenveld, P. L. Hon (Canada), W. Janous (Austria), S. C. Kwok, L. Kuipers (Switzerland), 
K.-W. Lau (Hong Kong), I. E. Leonard (Canada), R. Lee, I. A. Sakmar (Turkey), V. Schindler 
(Germany), M. Saravi (Iran), P. Severa & O. Such (Czechoslovakia), and the Lamar University Problem 
Solving Group, which also solved part (b). Two (incorrect) counterexamples were received. 


Small Arcs 


E 3324 [1989, 357]. Proposed by Mark D. Meyerson and R. Bruce Richter, U.S. 
Naval Academy, Annapolis, MD. 


Suppose that a and b are two closed arcs (homeomorphs of [0, 1]) in the plane, 
each of which contains the origin O but not as an endpoint. Prove or disprove: 
(a) there are arbitrarily small neighborhoods N of O such that Na is con- 
nected. (b) there are arbitrarily small neighborhoods N of O such that both NN a 
and NO b are connected. 


Solution by Richard Stong, Harvard University, Cambridge, MA. Statement (a) is 
true and statement (b) is false. 

For (a), let f: [0,1] — a be an arbitrary homeomorphism and let U be any open 
neighborhood of O small enough to miss the endpoints of a. Then Ufa is 
open in a, so f (Ua) is a disjoint union of open intervals. Let (s,t) be the 
open interval containing f-'(O). Then N = U — (f[0,s]) U f(t, 1)) is an open 
neighborhood of O inside U such that N Ma = f(s, t)) is connected. 

For (b), let {a,;} and {b,} be sequences converging to 0 such that 1 = a, > b, > 
a,>b,> -:: >0. Let f be a continuous nonnegative function on [—1, 1] with 
(1) f(x) = 0 for x < 0, (2) f(a;) = 0 for all i, and (3) f(b,) > b;_, for i > 3. Let 

= {(x,y): ~l1<x<1 and y = f(x)}, and let b= {(x, y): -1<y <1 and 
x = f(y)}. Suppose N is a neighborhood of O such that NM a and N1Mb are 
connected. Then (a,,0) € N for some n. Hence the arc in a from (a,,0) to 
(b,, f(b,)) is in N. This arc meets the arc in b from (f(b,_.), b,_,) to ©, a, _,), 
because f(b;) > b;_,. Therefore, (0, a,,_,) € N. Iterating this argument shows that 
(a,,,9) and (0, a,) belong to N for n > 2. Thus it is impossible for arbitrarily small 
neighborhoods of O to have a connected intersection with both a and b. 


Solved also by A. W. Schurle, J. H. Steelman, and the proposers. One incorrect solution was 
received. 


Must a Product of Derivatives Be a Derivative? 


E 3329 [1989, 446]. Proposed by Michel Balazard, Faculté des Sciences, Limoges, 
France. 


Suppose f and g are differentiable real-valued functions defined on (— ©, +), 
Must there exist a differentiable real-valued function 4 defined on (—«, +) such 
that h’ = f’g'? 


Solution found independently by Michael W. Botsko, St. Vincent College, Latrobe, 
PA, Jean-Pierre Grivaux, Paris, and Marcin E. Kuczma, University of Warsaw, 
Poland. The answer is no. Let o(x) = cos(x~') if x # 0, p(0) = 0. We will show 
that » has a primitive, but that y* does not. First, let w(x) = x sin(x~') for x # 0 
and wW(0)=0. Then w is continuous everywhere, and the function x-w is 
differentiable everywhere. The derivative of the latter is 2y% — gy, which thus has a 
primitive. Since % is continuous, g must have some primitive f. If go? =h' 


268 PROBLEMS AND SOLUTIONS [March 


everywhere, for some h, then 
d 
a (Me) ~ FO/2)) = @°(x) — 20(4/2) 


_ {cost(/a) ~}cos(2/x) = 1 ifx#0 
0 ifx=0. 


One can show directly that no differentiable function can have such a derivative, or 
one can appeal to the Darboux Theorem on intermediate values of derivatives. 
See, e.g., §129 of G. H. Hardy, Pure Mathematics, Cambridge University Press, or 
§21 of R. P. Boas, A Primer of Real Functions, Carus Mathematical Monographs 
No. 13. 


Editorial comment. The solution above is essentially the same as an argument 
originally given by Witold Wilkosz, “Some properties of derivative functions,” 
Fund. Math. 2(1921), 145-154. Several readers supplied that reference, as well as 
references to two articles by Andrew Bruckner: “Creating differentiability and 
destroying derivatives,” this Monthly 85(1978), 554-562, and ‘Derivatives: Why 
they elude classification,” Math. Mag. 49(1976), 5-11. Hugh Noland comments 
that in the latter article Bruckner cites Choquet’s result: if # 1s continuous, then 
h(f) is a derivative for every derivative f if and only if A is linear, and then argues 
“Suppose both f and f* are derivatives on [0, 1]; then f*, being non-negative, is 
the derivative of an increasing function F. But that makes f? integrable in the 
Lebesgue sense, and guarantees that f is also Lebesgue integrable. Thus, we need 
only take a derivative which is not Lebesgue integrable to furnish an example of a 
derivative whose square is not a derivative.” Noland concludes by noting that 
f(x) =x7!' sin(x~?) for x # 0 and f(O) = 0 provides such an example; it is the 
derivative of 


i sin(t~*) dt — (x*/2)sin(x~*) 
0 


Solved also by A. A. Jagers (Netherlands), D. J. Johnson, I. Kozma (Israel), O. P. Lossers 
(Netherlands), X. Liu, K. Merryfield, J.-M. Monier (France), R. Mortini (Germany), H. Noland, A. 
Riese, R. R. Rogers, K. Schilling, N. Starr, J.-Y. Thibon (France), and the proposer. 


A Characteristic of Low Degree Polynomials 


E 3338 [1989, 641]. Proposed by Walter Rudin, University of Wisconsin, Madison. 


Let s and ¢ be given real numbers. Find all differentiable functions f on the 
real line that satisfy 
f(y) — f(x) 


f'(sx + ty) = = 
y-x 


for all real x, y with x #y. 


Solution by Mordechai Falkowitz, Tel-Aviv, Israel. The only such functions are 
f(x) = ax + b, except that when s = t = $ any polynomial of degree 2 also works. 

If s+t+#0, then setting x =(z-1f)/(s+t) and y=(z+s)/(s4+t) 
for arbitrary real z yields f(z) = f(y) — f(x), so that f’ is differentia- 
ble. Since s+¢+#0, we may assume ¢ #0 and differentiate the relationship 
(y —x)f'(sx + ty) = f(y) — f(x) with respect to x to obtain 


—f'(sx + ty) + Cy — x) sf"(sx + ty) = —f'(x). 
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For the value y = —(s/t)x, we obtain f'(x) = f'(0) + (s + tXs/t)f"O)x and 
conclude that f is a polynomial of degree at most 2. Testing f(x) = ax? + Bx + y 
shows that a can be nonzero if and only if s = t = 5. The computation for any 
y#x is 2a(sx + ty) + B =[aly’? — x7) + Bly — x)I/(y — x), or 2a(sx + ty) = 
a(x + y). 

If s+t=0, put y=x+u for some u #0. We have f'(tu) =[f(x + u) - 
f(x)|/u for all x. Differentiating with respect to x yields f'(x + u) — f(x) = 0. 
Setting x = 0 yields f’(u) = f'(O). Since this holds for arbitrary u, f’ is constant 
and the proof is complete. 


Editorial comment. Many would-be solvers assumed without proof that in all 
cases of s,t, the function f must be twice differentiable everywhere. 
John A. Baker proved the following generalization: 


THEOREM. If s,t are given real numbers, then the necessary and _ sufficient 
condition for f,g to satisfy f(y) — f(x) = (y — x)g(sx + ty) for all x,y is as 
follows: If s =t =0 or s* # t*, then f is a polynomial of degree at most 1 and 
g=f'. Ifs =t #0, then fis a polynomial of degree at most 2 and g(x) = f'(x/2t). 
Ifs = —t # 0, then f(x) =a + A(x) and g(x) = A(x/t)/(x/t) for x # 0, where a 
is a real constant and A is an additive function (i.e., a function on the real line which 
satisfies Au + v) = A(u) + A(v) for all (u, v)). 


M. S. Jacobson & P. K. Sahoo provided references for published solutions of 
the problem s=t = 5 (J. Aczel, “A mean value property of the derivative of 
quadratic polynomials—without mean values and derivatives,” Math. Mag. 
58(1985), 42-45, and S. Haruki, “A property of quadratic polynomials,” this 
Monthly 86(1979), 577-579), and for a generalization of the problem (G. E. Cross 
and Pl. Kannappan, ‘‘A functional identity characterizing polynomials,” Aequat. 
Math. 34(1987), 147-152). 


Solved also by the proposer and 22 other readers. Several further solutions received contained 
errors, omissions, or hidden assumptions. 


A Sequence of Polynomials Related to the Roots of Unity 


E 3339 [1989, 641]. Proposed by Antonio J. Duran, University of Seville, Spain. 


If k is a given positive integer, prove that we can find a polynomial g, with 
rational coefficients and degree at most k such that 


SE (ernie = 1)* = ayn) 
j=1 
for any positive integer n. For example, 
g(n) = —(n-1)/2, — g,(n) = —(n = 1)(n — 5)/12, 
83(n) = (n — 1)(n — 3) /8. 


Solution by R. J. Chapman and R. W. K. Odoni, University of Exeter, UK. 
Consider the formal power series in ¢ 


fn.) = YS aeln)p 
k=1 
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For the value y = —(s/t)x, we obtain f(x) = f'0) + (s + tXs/t)f"(O)x and 
conclude that f is a polynomial of degree at most 2. Testing f(x) = ax? + Bx + y 
shows that a can be nonzero if and only if s = t = 5. The computation for any 
y#x is 2a(sx + ty) + B =[aly? — x7) + Bly —x)|/(y — x), or 2a(sx + ty) = 
a(x + y). 

If s+t=0, put y=x+u for some u #0. We have f'(tu) =[f(x + u) - 
f(x)|/u for all x. Differentiating with respect to x yields f’(x + u) — f(x) = 0. 
Setting x = 0 yields f’(u) = f'(O). Since this holds for arbitrary u, f’ is constant 
and the proof is complete. 


Editorial comment. Many would-be solvers assumed without proof that in all 
cases of s,t, the function f must be twice differentiable everywhere. 
John A. Baker proved the following generalization: 


THEOREM. If s,t are given real numbers, then the necessary and sufficient 
condition for f,g to satisfy f(y) — f(x) = (y — x)g(sx + ty) for all x,y is as 
follows: If s =t =0 or s* # t’, then f is a polynomial of degree at most 1 and 
g=f'. Ifs =t #0, then f is a polynomial of degree at most 2 and g(x) = f'(x/2t). 
Ifs = —t #0, then f(x) =a + A(x) and g(x) = A(x/t)/(x/t) for x # 0, where a 
is a real constant and A is an additive function (i.e., a function on the real line which 
satisfies ACu + v) = A(u) + A(v) for all (u, v)). 


M. S. Jacobson & P. K. Sahoo provided references for published solutions of 
the problem s=t = 5 (J. Aczel, “A mean value property of the derivative of 
quadratic polynomials—without mean values and derivatives,” Math. Mag. 
58(1985), 42~45, and S. Haruki, “A property of quadratic polynomials,” this 
Monthly 86(1979), 577-579), and for a generalization of the problem (G. E. Cross 
and Pl. Kannappan, ‘A functional identity characterizing polynomials,’ Aequat. 
Math. 34(1987), 147-152). 


Solved also by the proposer and 22 other readers. Several further solutions received contained 
errors, omissions, or hidden assumptions. 


A Sequence of Polynomials Related to the Roots of Unity 


E 3339 [1989, 641]. Proposed by Antonio J. Duran, University of Seville, Spain. 


If k is a given positive integer, prove that we can find a polynomial g, with 
rational coefficients and degree at most k such that 


(etme = 1)* = y(n) 
j=l 
for any positive integer n. For example, 
g(n) = —(n—1)/2, — g,(n) = —(n — 1)(n — 5)/12, 
83(n) = (n — 1)(n — 3) /8. 


Solution by R. J. Chapman and R. W. K. Odoni, University of Exeter, UK. 
Consider the formal power series in f 


00 th 
f(n,t) = XB) 
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Richard Guy provided a table computed by H. K. Farahat. The next values of 
g,(n) are 


g,(n) = (n — 1)(n? + n? — 109n + 251) /(2* - 3% - 5) 

g,(n) = —(n — 1)(n — 5)(n* + 6n — 19) /(2? - 37) 

g.(n) = —(n — 1)(2n° + 2n* — 355n? ~ 355n? + 11153n — 19087) / 
(2°-33-5-7) 

g4(n) = (n — 1)(n — 7)(2n* + 16n? ~— 33n* ~ 376n + 751) /(2’ - 3° + 5) 

gg(n) = (n — 1)(3n’ + 3n® — 917n? — 917n4* + 39697n? + 39697n? 
— 744383n + 1070017) /(28 - 3% - 52 - 7) 

go(n) = —(n — 1)(n — 3)(n — 9) (n° + 13n* + 10n? — 350n? — 851n + 2857) 
/(2?:3-+5*-7). 

Most solvers noted that the numbers (e27”/" — 1)—! (1 <j <n — 1) are the 
roots of (w + 1)” — w” = 0 and so their k-th elementary symmetric function is 
(—1)* ("> /(k + 1) for k =1,2,.... The power sums g,(”) can then be 
calculated recursively by using Newton’s formulas. 

Solved also by S. J. Bang (Korea), D. Callan, J. Ferrer (Spain), S. M. Gagola, Jr., I. Gessel, A. A. 
Jagers (The Netherlands), W. Janous (Austria), H. Ki (Korea), E. Levine, O. P. Lossers (The 


Netherlands), J. McHugh, H. C. Morris, J. H. Nieto (Venezuela), A. Nijenhuis, H. C. Wei (China), and 
the proposer. 


A Transcendental Sum 


E 3353 [1989, 838]. Proposed by H. G. Diamond, University of Illinois, Urbana, 
IL. 


For each positive integer n, let G(n) be the number of digits in the decimal 
expansion of 2”. Is it true that 
° G 1169 
$ Gln) _ 1169, 
2” 1023 


n=1 


Solution I by Kee-Wai Lau, Hong Kong. We show that the desired sum S exceeds 
1169/1023. Using log x for log,, x, we have S = L*_,|1 + n log 2|/2”, or 
10 |[nlog2| ~ [(n + 10)log 2] 


S=1+ Lan t > 


n+10 
n=1 n=1 2 


1167 1 2 |(n + 10)log 2] 
~ joo * Gooa 
Since |[(n + 10)log 2] > [ndog 2) + 3], with strict inequality at n = 93, we have 
1167 1 2 |n(log2) +3] 1167 1 
S> 034 * toa a ona * Tone * 
which implies S > 1169/1023. 


Solution II by Reiner Martin, Karlsruhe, Germany. If n, is the least positive 
integer with G(n,) =k, then l=n,<n,< -*- and S=YR_,UR_,2°" = 
712. "*. Since 2'% < 10°! < 2'% < 10°, we have n3, = 103. Hence the 102nd 
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digit in the binary expansion of S is 1, but the 102nd digit in the binary expansion 
of 


1169/1023 = 1+ 146 > 9 —10n —~ f+ y (27~ 10n 4 94—10n 4 2!—10n) 


n=1 n=1 
is zero. 


Solution III by B. M. M. de Weger, University of Twente, Enschede, The 
Netherlands. William W. Adams and J. L. Davison, in “A remarkable class of 
continued fractions,” Proc. Amer. Math. Soc. 65(1977) 194-198 showed that if a 
is irrational, then T(a@) = LZ_,27'**! is transcendental. Now it is easy to see that 
(in the notation of Solution IT) 


n, =1+ |(k — 1)log 10/log 2]. 


Thus § = 1 + T(Ulog 10/log 2). Since log 10 /log 2 is irrational by the unique factor- 
ization theorem, S is transcendental and cannot equal the rational number 
1169/1023. 


Editorial Comment. Adams and Davison proved their result by showing that the 
partial quotients in the continued fraction expansion of 7(a@) increase more 
rapidly than is possible for an algebraic number (by Roth’s Theorem). In the case 
of S the first few partial quotients are 


1,7, 146, [2°°/1023| = 23(2° ~ 1)/(2'° — 1),.... 


Thus, 1+ 1/(7 + 1/146) = 1169/1023 is a very good rational approximation 
to S. 

Michael Filaseta solved the problem by using Roth’s Theorem to prove that if a 
is irrational, then L*_,|naJ2~" is transcendental. Now, if @ is an irrational 
number between 0 and 1, Doug Bowman remarked that 


(1-x) ¥ [nalx" =x Vxl’/ ({x| <1). 
n=1 n=1 
Thus Filaseta’s result is equivalent to the theorem of Adams and Davison quoted 


in Solution III. 
The smallest positive integer N such that 


N 
Y* G(n) + 27" > 1169/1023 is N = 109. 


n=1 
Cf. the solution of Problem 6609 in this issue. 


Thirty-three correct solutions ‘and three incorrect solutions were received. 


ADVANCED PROBLEMS 


6652. Proposed by D. M. Bloom, Brooklyn College of the City University of 
New York. 


For x a positive integer put 


~1)' —— 
E(x)= > | a (x —i)'e*"'. 


O<i<x 
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Evaluate 
lim {E(x) — 2x}. 


6653. Proposed by D. K. Lee, University of Ulsan, Korea, and B. A. Murray, 
University of Newcastle-upon-Tyne, UK. 


For non-negative real a evaluate the integral 
I(a) = ["p(8)sin 6 a0, 
0 
where (6) = arccos({a — cos 6}/ V1 + a? — 2acos@),0 < W(0) < 7. 
6654. Proposed by W. O. Egerland and C. E. Hansen, Aberdeen Proving 
Ground, Aberdeen, MD. 


Suppose w is real, n is a positive integer greater than 1, and a,,a,,...,a, are 
complex numbers with |a,| < 1 for k = 1,2,...,n. Prove that the equation 
e'°(z —a,)(z — ay) +++ (2 -4,) =2Z1 — Gz) — a,z) ++: (1 — G,z) 


has at least m — 1 roots on the unit circle. 


SOLUTIONS OF ADVANCED PROBLEMS 
Successive Divisor Sums 


6592 [1989, 165]. Proposed by Dietrich Haase, Free University of Berlin, and 
Adolf Hildebrand, University of Illinois at Urbana-Champaign. 


For real k let o,(n) denote the sum of the k-th powers of the (positive) divisors 
of the natural number n. Show that there exists a number k, satisfying 1 < ky < 2 
such that 

(i) if k > ky, then {o,(n + 1) — o,(n)}(—1)"*' > 0 for all sufficiently large n, 

(ii) if 0< k < ko, then {o,(n + 1) — o,(n)}(—-1)"*?* changes sign infinitely of- 
ten. (Cf. Problem 6555.) 


Editorial remark. Through an unfortunate editorial blunder, Hildebrand was 
listed as the sole proposer in the February, 1989 issue. The assertion of part (i) is 
due to Haase; the assertion of part (ii) is due to Hildebrand. 


Composite solution, based on the solutions of the proposers and Eugene Levine, 
Adelphi University, Garden City, NY. Put 


S,(n) = (-1)"""{o,(n + 1) — o(n)}. 
Let f: (1,%) — (0, ©) be the decreasing continuous function defined by 


f(x) = E+ 1y2)™ (x > 1). 


In terms of the Riemann zeta-function we have f(x) = (2* — 1)g(x) — 2*. Let k, 
be the unique real number greater than 1 such that f(k,) = 1, that is, ky = 
1.9401... . With this choice of k, we strengthen (i) and (ii) by proving the 
following assertions. 
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Evaluate 
lim {E(x) — 2x}. 


6653. Proposed by D. K. Lee, University of Ulsan, Korea, and B. A. Murray, 
University of Newcastle-upon-Tyne, UK. 


For non-negative real a evaluate the integral 
I(a) = ["(6)sin 6 do, 
0 
where (0) = arccos({a — cos 0}/ V1 + a* — 2acos@),0 < W(@) <7. 
6654. Proposed by W. O. Egerland and C. E. Hansen, Aberdeen Proving 
Ground, Aberdeen, MD. 


Suppose w is real, n is a positive integer greater than 1, and a,,a,,...,a, are 
complex numbers with |a,| < 1 for k = 1,2,...,n. Prove that the equation 
e'°(z—a,)(zZ — a2) +++ (2 -a,) =z. ~ &z)(1 — az) ++ 1 ~ Gz) 


has at least nm — 1 roots on the unit circle. 


SOLUTIONS OF ADVANCED PROBLEMS 
Successive Divisor Sums 


6592 [1989, 165]. Proposed by Dietrich Haase, Free University of Berlin, and 
Adolf Hildebrand, University of Illinois at Urbana-Champaign. 


For real k let o,(n) denote the sum of the k-th powers of the (positive) divisors 
of the natural number n. Show that there exists a number k, satisfying 1 < ky < 2 
such that 

(i) if k > ky, then {o,(n + 1) — o,(n)}(—1)"*' > 0 for all sufficiently large n, 

(ii) if 0< k < ko, then {o,(n + 1) — o,(n)}(—-1)"** changes sign infinitely of- 
ten. (Cf. Problem 6555.) 


Editorial remark. Through an unfortunate editorial blunder, Hildebrand was 
listed as the sole proposer in the February, 1989 issue. The assertion of part (i) is 
due to Haase; the assertion of part (ii) is due to Hildebrand. 


Composite solution, based on the solutions of the proposers and Eugene Levine, 
Adelphi University, Garden City, NY. Put 


S,(n) = (-1)"*"{o,(n + 1) — &(n)}. 
Let f: (1,2) — (0,0) be the decreasing continuous function defined by 


f(x) = EU 41/2)" (x>1). 


In terms of the Riemann zeta-function we have f(x) = (2* — 1I)g(x) — 2%. Let k, 
be the unique real number greater than 1 such that f(k,) = 1, that is, ky = 
1.9401... . With this choice of k, we strengthen (i) and (ii) by proving the 
following assertions. 
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I. If |k| > k,, then S,(m) > 0 for all sufficiently large positive integers n. 

II. If |k| < k,, then 

(a) S,(n) > 0 for infinitely many odd values of n, 

(b) S,(n) > 0 for infinitely many even values of n, 

(c) §,(n) < 0 for infinitely many odd values of n, 

(d) S,(n) < 0 for infinitely many even values of n. 

(The method of solution of 6555 [1987, 800; 1990, 351] would probably make it 
possible to replace (a), (b), (c), (d) by positive density assertions.) 

The extension of (i) and (ii) to non-positive values of k is straightforward, but 
the extension of (i) to the case |k| = k, requires the following lemma. 


LEMMA. If n is a sufficiently large positive integer, there exists a prime number q 
not exceeding n'/* and not dividing n + 1. 


Proof of Lemma. We use Bertrand’s postulate (Niven and Zuckerman, An 
Introduction to the Theory of Numbers, §8.3). Suppose n > 256 and r = [n'/2/8]. 
Bertrand’s postulate asserts the existence of primes p,, Py, p3 in the intervals 
(r,2r), 2r,4r), and (4r, 8r) respectively. Then 


3 
P1P2p3 > |n'/7/8| >n+1 


for n > 10°, and so at least one of the primes p,, po, p3 does not divide n + 1. 
Thus the lemma is proved. 


Proof of I. We begin by noting that if 0, = 1 + 2~* and if 


A, = ¥ (2t4+1)-*=14+2-*f(k) fork > 1, 
t=0 


then 6, >A, for k>ky, 0,=A, for k=ky, and 0, <A, for 1<k<kp. 
Further we have 


o,(n)/n* =oa_,(n) > 96, for even n, 
o,(n)/n* =oa_,(n) <A, forn odd, k > 1, 
o,(n)/n* =o_,(n) <A, —n-* forn odd, k > 1, 


where m is any odd integer not dividing n. 
If k > k, and n is odd, then 


o(nt+1) (nt+i)"o_,(n+1)_ o,(n+1)_ 4, 


=? COO hm ll. 
o,(n) n“o_,(n) o_,(n) Ax 
Thus S,(n) > 0 for |k| > ky and n odd. 
If k > k, and n is even, we have 
o_,(n) (n + 1)“o,(n) o,.(n) 0,.n* 5 
o_,n+1)  nko,(n +1) o(nt+1)~ an+ ko” 


providing n is sufficiently large. Thus S,(n) > 0 for |k| > k, and n a sufficiently 
large even number. 
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provided p is sufficiently large. Also 
o(2p) —  (2p)" ao _, (2p) 
o.(2p-1) (2p —1)* o_,(2p ~ 1) 
if p is sufficiently large. Thus if 0 < |k| < ky, we have S,(Q2p — 1) < 0 for any 
sufficiently large prime p satisfying the congruence p = (b + 1)/2 (mod b). Hence 
assertion (c) holds for 0 < |k| < ko. 

If 0<k<k,, we may again choose N so large that LN (t + 5) *=1+4+¢, 
where c > (0. By Dirichlet’s theorem there are infinitely many primes p such that 
2p+1 is divisible by b=3-5-7...(2N +1), ie., such that p = (b — 1)/2 
(mod b). Then 

a(2p+1) (2p+1)*o_,(2p+1) o_,(2p +1) 
— a Se. > ——— 
o,(2p) (2p)“o_,(2p) o_,(2p) 
14+3-*45-*4---4(2N4+ 1)“ 
14+27*+p-*4+(2p)“ 

14+2°*+2-*e 

~~ (14+ 27*)(1 + p-*) 
provided p is sufficiently large. Thus, if 0 < |k| < ky, we have S,(2p) < 0 for any 


sufficiently large prime p satisfying the congruence p = (b — 1)/2 (mod b). Hence 
assertion (d) holds for 0 < |k| < kp. 


Solved also by Paul Erdos (Hungary) and Joachim Herzog (Germany). 


WV 


> 1, 


Circles Passing Through an Odd Number of Lattice Points 


6605 [1989, 533]. Proposed by E. Ehrhart, Université de Strasbourg, France. 


We use the term “lattice point” to mean a point with integer coordinates. 
If k is a positive integer, Schinzel [Enseignement Math. (2) 4 (1958) 71~72] 
proved that the circle 
2 


[x - | + 2 = (5*/3)° 


passes through exactly 2k + 1 lattice points; clearly the two coordinates of any one 
of these 2k + 1 lattice points are of like parity. Thus by making the substitution 
X=xt+y,Y=x-—~y we see that the smaller circle 


[x- = + (»- 2) = (N55) («) 


also passes through exactly 2k + 1 lattice points. 

Gi) Show that when k = 1 no circle smaller than (*) passes through exactly 
three lattice points. 

(ii) Show that when k = 2 no circle smaller than (*) passes through exactly five 
lattice points. 

(ii) Show that when 2k + 1 is composite there is a circle smaller than (*) which 
passes through exactly 2k + 1 lattice points. 


2 


Solution by David Borwein, University of Western Ontario, London, Ontario. For 
(i) it is readily seen that the smallest circle passing through exactly three lattice 
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points must circumscribe a triangle congruent to (0, — 1), (—1,0), (1,1) and so 
have radius 5V2 /6. Any smaller circle passes through either an even number of 
lattice points or no lattice points. We do not have a simple deductive argument for 
the truth of (ii), but a computer search failed to yield any smaller circle passing 
through exactly five lattice points. For (ili), suppose 2k + 1 = r+ 1)Qs + 1) 
where r and s are positive integers with r < s. By standard number theory the 
equation 


u* + v* =2- 5771375 
has exactly 42r + 1)2s + 1) = 42k + 1) integer solutions. Further, since 
2 - 5*"13*5 = 2 (mod6), 
all solutions must be of the form u = +1 (mod6), v = +1 (mod 6); all other 


possibilities are easily excluded. Hence all solutions have the form u = +(6x — 1), 
vu = +(6y — 1) where x and y are integers. Therefore the equation 


(6x — 1)? + (6y — 1)? = 2 - 57"1325 


has exactly 2k + 1 integer solutions, and so the circle 


1 \? 
! 


+ 
passes through exactly 2k + 1 lattice points. But 5"13° < 5* since 13° < 5*7" = 
581 +2r)" and so this circle has smaller radius than (*). This completes the proof; 
we note that it is straightforward to show, along the lines of the argument used to 
establish (iii), that when 2k + 1 is prime (*) is the smallest circle with center 
(1/6, 1/6) that passes through exactly 2k + 1 lattice points. 


1 2 
y- 5 = (5"13°V2 /6) 


Solved also by the proposer. 
An Application of the Artin-Schreier Theorem 


6606 [1989, 652]. Proposed by Daniel B. Shapiro, Ohio State University, Columbus. 


Suppose k is a field containing an element i with i? = —1 and suppose k is the 
algebraic closure of k. Suppose F and K are fields with k CF CK and [K: F] 
finite. 


(1) If k CK, prove that in fact k CF. 
(2) More generally, if L is an algebraically closed field with k C L C K, does it 
follow that L Cc F? 


Solution by I. Martin Isaacs, University of Wisconsin, Madison. We shall prove 
(1) and then construct for (2) an example in which L is not contained in F. For (1) 
we may assume that k =k OF. Since i € k, it follows via the Artin-Schreier 
theorem that if k properly contains k, then there exists a © k with minimal 
polynomial g (over k) of arbitrarily large degree. The minimal polynomial f of a 
over F’, however, divides this polynomial, and so all of its roots (in an appropriate 
extension field) are algebraic over k and so lie in k. It follows that all the 
coefficients of f lie in k and so f divides g in the ring k[x]. As g is irreducible, 
we deduce that f = g and so deg(g) = deg(f) < [K: F]. This contradicts the fact 
that deg(g) can be made arbitrarily large. We deduce that k = k. 

For (2), suppose k is any algebraically closed field of characteristic zero and let 
L >k be an algebraic closure of a simple transcendental extension of k. Write 
R=L®, L so that R is a k-algebra. Note that R has subrings L, = L ® 1 and 
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L, = 1 ® L, each isomorphic to the algebraically closed field L, and also R has an 
automorphism o mapping x ® y to y ® x, so that o has order 2 and interchanges 
L, and L,. Since k is algebraically closed in L and has characteristic zero, 
Theorem 4.5.2 of Nagata’s book Field Theory guarantees that R is an integral 
domain. Let K be its quotient field. Then the unique extension of a to K has 
order 2. If F is the fixed field of this extension, then k C F CK and F does not 
contain the algebraically closed field L although |K : F| = 2. 


Editorial comment. A good reference for the Artin-Schreier theorem is the last 
section of Volume 3 of Nathan Jacobson’s Lectures in Abstract Algebra, Van 
Nostrand, Princeton, 1964 or Section 12 of Irving Kaplansky’s Fields and Rings, 
University of Chicago Press, 1972. In a paper of S. Abhyankar, “On the composi- 
tum of algebraically closed subfields,” Proc. Amer. Math. Soc. 711956) 905-907, it 
is shown that (2) is true when K is finitely generated over k. In fact, L ¢ k in that 
case. 


Solved also by R. J. Chapman & R. W. K. Odoni (United Kingdom) and by the proposer. Two 
incorrect solutions were received. 


Polynomials Related to the Laguerre Polynomials 


6607 [1989, 652]. Proposed by Ira Gessel, Brandeis University, Waltham, MA. 


The Laguerre polynomials are defined by 
k(nt+a xk 

L(x) =(-)"E(-y*(" 7 “*)-— 
P= DE eo" op 


where a is an arbitrary parameter and n is a non-negative integer. Show that for 
each non-negative integer k there exists a polynomial q, of degree |k /2] depend- 
ing only on k& such that 
n xnwk 
L(x +n+a) =(-1)" ¥ a,(n + a) ———. 
kno (n—k)! 


For example g,(u) = 1, qu) =0, qu) = -u/2, qu) = —u/3, qu) = 
u*/8 — u/4. 


Solution by Mourad E. H. Ismail, University of South Florida, Tampa. Replace x 
by x +n +a in the definition of a Laguerre polynomial and apply the binomial 
theorem. We obtain 


n n—k J n-k-j 
LO(x+nt+a)= y (" ~1)"" TF 
k ( aX i\(n —k-—j)! 


k=0 
= (-1)" J nt+a _ ; 
yd dnl ° Th )! 
where 
m ry\(—1)\u™* 
Ink u) = X (x) (m—k)! 
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Clearly q,,(u) depends only on m. Since q,,(u) is a Cauchy product, we see that 


x q,,(u)t™ = (1 — t) e” = exp {u[t + In(1 — ¢)]} 
m=0 


CO t” 
exp ur > | 


noon t+2 


By expanding the right-hand side we find that q,,(u) has precise degree |m/2] and 
that the coefficient of u!’”/7! is 


(-1/2)"""7/([m/2])!. 


Solved also by Seung-Jin Bang (South Korea), David Callan, Michael Golomb, Bruno Haible 
(Germany), A. A. Jagers (The Netherlands), and the proposer. 


A Rational Sum 


6609 [1989, 743]. Proposed by Doug Bowman and Tad White, University of 
California at Los Angeles. 


For each positive integer n let g(n) be the number of digits greater than 4 in 
the decimal expansion of 2”. Is it true that 


Editorial remark. Exactly the same result was obtained previously by Eugene 
Levine, who published it in the College Mathematics Journal as Problem 386 [1988, 
448; 1990, 151]. Levine also generalized the result to even bases b > 2, with sum 
2/(b — 1) rather than 2/9. There were eight CMJ solvers, one of whom (Dave 
Ohlsen) also proposed a generalization; among the nineteen AMM solvers several 
proposed generalizations (not all different). 

Bowman and White actually provided the editors with a variety of related 
questions, only one of which duplicated Levine’s problem. With hindsight the 
following would have been a better choice: 

Define the Fibonacci sequence by F, = F,=1 and Fi. =F,_,+F,_, for 
n > 3. For each positive integer n let f(n) be the sum of the Fibonacci numbers F; 
over all integers i such that the ith digit of 2” (counting from the right) exceeds 4. 
Show that 

° f(n) 20 
XL 2” 89 


n=1 


(*) 


Solution by Doug Bowman and Tad White (of both the proposed problem and (*). 
Let 79,171,12,..., be a sequence of integers with r, > 1, and define another 
sequence a, inductively by a, = 1 and a, ,, =a, r,. Let the expansion in base b 
of a, be 

ay, = » d,, ;b'. 


iz0 
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Clearly q,,(u) depends only on m. Since q,,(u) is a Cauchy product, we see that 


Y dy(u)t™ = (1 —t)"e" = exp {ult + In(1 — £)])} 
m=0 


CO t” 
exp | ut > | 
n=0 


n+2 


By expanding the right-hand side we find that q,,(u) has precise degree |m/2] and 
that the coefficient of u!”/*! is 


(-1/2)'"/717([m/2])!. 


Solved also by Seung-Jin Bang (South Korea), David Callan, Michael Golomb, Bruno Haible 
(Germany), A. A. Jagers (The Netherlands), and the proposer. 


A Rational Sum 


6609 [1989, 743]. Proposed by Doug Bowman and Tad White, University of 
California at Los Angeles. 


For each positive integer n let g(n) be the number of digits greater than 4 in 
the decimal expansion of 2”. Is it true that 


Editorial remark. Exactly the same result was obtained previously by Eugene 
Levine, who published it in the College Mathematics Journal as Problem 386 [1988, 
448; 1990, 151]. Levine also generalized the result to even bases b > 2, with sum 
2/(b — 1) rather than 2/9. There were eight CMJ solvers, one of whom (Dave 
Ohlsen) also proposed a generalization; among the nineteen AMM solvers several 
proposed generalizations (not all different). 

Bowman and White actually provided the editors with a variety of related 
questions, only one of which duplicated Levine’s problem. With hindsight the 
following would have been a better choice: 

Define the Fibonacci sequence by F, = F, = 1 and F,=F,_,+F_, for 
n > 3. For each positive integer n let f(n) be the sum of the Fibonacci numbers F; 
over all integers i such that the ith digit of 2” (counting from the right) exceeds 4. 
Show that 


f(n) — 20 


2X 2 ~ 89" ~) 


Solution by Doug Bowman and Tad White (of both the proposed problem and (* ). 
Let 79,171,12,---, be a sequence of integers with r, > 1, and define another 
sequence a, inductively by a, = 1 and a, ,, = 4,r,. Let the expansion in base b 
of a, be 

ay, = > d,, ;b'. 


iz0 
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The computation of the expansion in base b of a,,,, may be displayed as follows: 


22+ On 2 Choi Cn ,0 
.- dy 9 An | ano 
XT, 
Ans 2 An+i4 An+1.0 
Here the c, ; denote “carries,” c, 9 = 0 for each n, and 0 <d, , < b for all n and 


i. Note that if r, < b, the base b representation of r, is a 1-digit number, and our 
ordinary elementary-school definition of “carry” is valid. If r, > b, we multiply 
nonetheless as the diagram indicates; the multiplication is distributed over the 
base b expansion of a,, but not over the expansion of r,. In other words, the “next 
digit” and ‘‘next carry” are given by the equation 


Mani t Cri = Anat + OCy i+) ( ** ) 
for all n > 0 andi > 0. Now define, for z real or complex, the function 
e(n; 2) = Dear! 
i>0 


Proposition. If |z| <b, then 
= 8(n;z) z 
n=0 An+ b-—z 


Proof. Multiply the “next digit” equation (**) by z'*'/a,,, and collect c’s 
and d’s to obtain 


1 


i+1 _ . wl _ 
(bz Cniitl Zz z'cy i) =z 
an+1 


Since Cc, 9 = 0, summation on 7 yields 


coed 


n+1 iz0O 


ay an+ 


i i 
Z'dy et 


Hence 


3 (b- 2) SO sys y [Set — Set 


O<n<N nt+l iz0 O<n<N ay, an+ 
i i 
z'do j Z'dn4 ti 
i>o0 40 i>d0 4Nn+1 


Since do, = 1 and dy ; = 0 for i > 1 (recall ay = 1), this says that 


g(nsz) Zz fp ~ See 


O<n<N an+1 b-z An+ 


where 
(Qyyi:zZ) = dni iZ's 
i>0 
i.e., (a,,,,:2Z) is the result of writing a,,,, in base b and then replacing b by the 


(smaller in modulus) number z. It is easy to see that (a,,,:2z)/ay,, 20 as 
N — ©, and the result follows. 
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For Levine’s result take a, = 2”, z = 1, and b an even number greater than 2. 
The only possible carries are 1’s: They occur whenever a digit of a, exceeds 
b/2 — 1. As for (*) note that F, = (a” — B”)/(a — B) where a and B are the 
roots of z* — z — 1 = 0. Take as before a, = 2”. The only carries in the computa- 
tion of 2 - 2” occur when a digit of 2” exceeds b/2 — 1, so that 


_ g(nza) — g(n; B) 
[(n) = ~~ 


The proposition yields 


hence the result («) for b = 10. 


Editorial comment. Several solvers remarked that Dg(n)2~" must be £2 - 10~* 
since the Ath decimal digit of 2” exceeds 4 if and only if the mth binary digit of 
2-10-* is 1. B. M. M. de Weger (The Netherlands) wrote out the decimal 
expansion 2” = La,(n)10*, defined A,(n) to be 1 if a,(n) > 5 and 0 otherwise, 
and deduced Levine’s result from the identity 


gasket 4 4Sk14k-1 
4-5k-l_ntk—-1 
——__ = A,(n)2 . 
5k k 
n=k 


We also note that the reader may find it of interest to contrast this “advanced” 
problem with the “elementary” problem E3353 whose solution is also in this issue. 


Solved also by William Armacost & Terence Shore & Douglas B. Tyler, David Callan, Curtis 
Cooper, Alan Frank, Joachim Herzog & Thomas Maxsein & Paul R. Smith (Germany), Ilias Kastanas, 
Kee-Wai Lau (Hong Kong), Eugene Levine, O. P. Lossers (The Netherlands), Reiner Martin (Germany), 
Thomas McDonald, Ian McGee (Canada ) & Cecil Rousseau, Paul Monsky, Allen J. Schwenk, A. C. 
Thompson (Canada), Jack V. Wales, Jr., and B. M. M. de Weger. One incomplete solution and one 
incorrect solution were received. 
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EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


Basic Hypergeometric Series. By George Gasper and Mizan Rahman, Encyclopedia 
of Mathematics and Its Applications, Great Britain, 1990, xx + 287 pp. 


GEORGE E. ANDREWS 
Department of Mathematics, Pennsylvania State University, University Park, PA 16802 


Reviewers for the MonTHLY are told to write “a chatty essay [that]... does not 
have to be closely tied to the book under review.” At the risk of violating this quite 
sensible suggestion, I shall begin by saying, “‘I love this book! It is great!” 

I want to emphasize how strongly I feel about this achievement because both 
the book and, indeed, the entire subject can be misunderstood and dismissed by 
casual and brief perusal. First of all, the title, while fully consistent with historical 
usage, can, perhaps, mislead the outsider. The word “Basic” does not refer to 
fundamental or elementary. Rather it refers to the ubiquitous parameter “q”’ 
which underlies the entire subject. The next casual mistake is to flip through the 
pages of the book (in under five minutes) which are filled with gigantic formulas 
written in unfamiliar code. The natural response here is: “If this is ‘basic’ 
hypergeometric series, please, God, save me from ‘advanced’ hypergeometric 
series!”’ 

What _is being discussed in this book anyway? Recall the classical Gaussian 


hypergeometric series 


“a(atl)-(atn—Db(b+ 1 (b+n-1) ig 4. 
l+ Lae (edna lypo =2F ("2"), 


n=1 


(1) 


These series and their generalizations are characterized by the rising factorial 
ala + 1)...(a +n — 1). The q-analog or basic analog is defined by replacing the 
ordinary rising factorial by the basic or q-rising factorial: 


(1 — a)(1 — aq) ++: (1 — aq""") = (a3. q)n- (2) 
Thus 
a,b;q,t _ — (4; 4) (D3 4) n h 
24)( c it x (q:a)Acia)n @) 


More generally 


(apie 


—— 


baa |" Ey Casaysbaaye (hia), OD" 


Ss 


[“ Agyeess a.,4, | a (a,; q) ,( 4; Q) n a (a,; Q) n 
(4) 
Well you may ask, “What sort of interesting results can you obtain for these 


things?” 
282 
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I’m glad you asked. For example, we find at the bottom of page 42 of the book 
under review 


aq? 
a,qva , — qva,b,c,d,e, f;4q, bcdef 
sP7 
aq aq aq aq aq 
va,-va,—,—, —, —, — 
b c dee f 


_ (aq, aq/de, aq/df , aq/ef ; 4) « F [cae ee Fina) 
(aq/d,aq/e,aq/f,aq/def;q).  * *\aq/b,aq/c, def/a 


(aq, aq/bc, d,e, f, a’q”/bdef , a*q?/cdef; @).. 
(aq/b, aq/c, aq/d, aq/e, aq/f, a’q*/bcdef , def/aq; @)., 


(5) 


aq /de, aq /df , aq/ef ,a*q* /bcdef ; q, 4 
X43 ; 


a’*q? /bdef , a*q?/cdef , aq* /def 


To most mathematicians such a formula is completely opaque. To the deep insider 
marvellous symmetries peep out here and there; however, selling this to the rest of 
the world as a formula whose intrinsic beauty justifies its existence is a bit much. 

However, (5) does contain some special cases that have elicited some of the 
most rhapsodic prose ever used to describe mathematics. In particular, if f is 
replaced by q~* where N is a positive integer and then b, c,d, e and N > © with 
a — 1, we find (after invocation of Jacobi’s triple product, another special case of 
(5)) 


n2 


00 1 “ q 
haa yaa) * Gada) a) 


n=1 


(6) 


Equation (6) and a companion identity were utilized by Ramanujan to deduce 
some amazing continued fraction formulas, e.g. 


1 5+ 75 1+ V5 | 2 : 
em > J ~2 fe ”) 
+ 
e74t 
1+ 
1+" 


This and related formulae were contained in Ramanujan’s first letter to G. H. 
Hardy. Hardy, in his biographical chapter on Ramanujan [7], discusses this letter at 
length. He begins, “I should like you to begin by trying to reconstruct the 
immediate reactions of an ordinary professional mathematician who receives a 
letter like this from an unknown Hindu clerk.” 

Of (7) and two related continued fraction identities he said, “[they] defeated me 
completely; I had never seen anything in the least like them before. A single look 
at them is enough to show that they could only be written down by a mathemati- 
cian of the highest class. They must be true because, if they were not true, no one 
would have had the imagination to invent them. Finally (you must remember that I 
knew nothing whatever about Ramanujan, and had to think of every possibility), 
the writer must be completely honest, because great mathematicians are com- 
moner than thieves or humbugs of such incredible skill.” 
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Abstract and Concrete Categories. By Jiri Adamek, Horst Herrlich and George 
Strecker, John Wiley and Sons, New York, 1990, xii + 482 pp. 


GABRIELE CASTELLINI 
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Category theory is a relatively young area of mathematics that started to 
develop in the 1940’s and early 1950’s as a means of providing a general theory of 
mathematical structures. In doing so, it unified many existing mathematical con- 
cepts (e.g., product) and it made other somewhat hazy concepts (e.g., naturality ) 
more precise. Its first significant uses were in the areas of algebraic topology and 
homological algebra, but soon afterwards its usefulness as a general language 
caused it to be welcomed into other classical areas of mathematics such as 
topology and algebra. By now it has ingressed into virtually all areas, although its 
major applications in analysis have been slower to develop than in many other 
fields. There is also a recent strong tendency to use category theoretical techniques 
in theoretical computer science. 

But just what is category theory? 

To keep this discussion on a conversational level, we will refrain from giving a 
technically accurate and detailed definition of a category. In order to convey some 
general ideas about the topic, we can say that a category consists of a class of 
objects and a (possibly empty) set of morphisms for each pair of objects. We 
usually denote by f: X — Y a morphism with domain the object X and with 
codomain the object Y. An associative operation called composition is defined on 
the class of all pairs of morphisms that satisfy the following matching condition: 
morphisms f and g have a composite (denoted g  f) provided that the codomain 
of the morphism f agrees with the domain of the morphism g. Each object comes 
with an identity morphism that forms composites with other morphisms in the only 
reasonable way that an identity could. 

Objects of different categories can be quite different in nature. The morphisms 
of a given category have usually been chosen in order to interact nicely with its 
objects. The terminology concrete category is used when there is a given “underly- 
ing” category (such as the category of sets) and objects of the category in question 
are “‘based on” objects of the underlying category. Some familiar examples will 
illustrate this idea. Set (resp. Top, Haus, Grp, TopGrp, Vec, Ban, and Ban,) 
usually denotes the category whose objects are sets (resp., topological spaces, 
Hausdorff spaces, groups, topological groups, vector spaces, Banach spaces, and 
Banach spaces) and whose morphisms are functions (resp. continuous functions, 
group homomorphisms, continuous homomorphisms, linear transformations, linear 
contractions, and bounded linear maps). Each of these is an abstract category. But, 
each also can be considered as a concrete category based on Set, since each has 
objects that are sets with structure and in each case the morphisms are functions 
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between the underlying sets that in some sense “preserve” the structure. However, 
notice that TopGrp can also naturally be viewed as a concrete category over Top, 
and as a concrete category over Grp. These three concrete categories of topologi- 
cal groups have quite different properties from one another, and indeed are all 
different from the abstract category of topological groups. Notice also that Banach 
spaces constitute the objects of two quite natural, but different, concrete categories 
based on sets. 

It should also be pointed out that the definition allows categories that are far 
different from these examples. Objects of a category need not have elements, and 
morphisms need not be functions. As an example, any partially ordered set can be 
viewed as an abstract category whose objects are the members of the set, and 
a <b means that there is precisely one morphism from a to b. Indeed, any such 
partially ordered set also can be viewed as a concrete category over the trivial 
category that has just one object and one morphism. This example demonstrates 
the strong connection between order theory and concrete category theory. 

We see, then, that proofs in category theory cannot be obtained by considering 
points within objects. They must be based solely on the morphisms and how they 
are composed. However, this constraint imposed by the abstraction can often be an 
advantage, since it may give special insights and since any categorical result will 
apply in all categories. The abstraction can build quite quickly. For example, 
categories themselves may be viewed as objects of (quasi)categories, where the 
morphisms between them are called functors, and functors can be considered as 
objects, where the morphisms between them are called natural transformations. 

The most apparent feature of category theory is the language that it provides to 
precisely describe mathematical concepts that otherwise would remain quite vague. 
This helps us to understand that, although different areas of mathematics might at 
first look very different from one another, they may share common features. For 
instance, products of topological spaces are technically different from products of 
groups and from products of vector spaces. If CX;);., is a family of topological 
spaces, then their topological product ILX, with projections (7;: ILX; > Xj); <;, 
satisfies the following condition: for any topological space Y and for any family of 
continuous functions (f;: Y > X;);-,, there exists a unique continuous function 
(f,): Y > ILX; such that 7, °¢f,) = f;, for every j € I. 

If, in the above condition, we replace ‘topological spaces” by ‘“‘groups” (resp. 
“vector spaces”) and replace “continuous functions” by ‘“group-homomorphisms” 
(resp. “linear transformations’), we obtain the notion of direct product of groups 
(resp. product of vector spaces). With category theory all of these notions of 
products (and of course many more) can be obtained at once as the categorical 
product by taking the above condition and replacing “topological spaces” by 
“objects” and replacing “‘continuous functions” by “morphisms.” 

In a similar way scores of concepts in individual abstract and concrete cate- 
gories have suitable categorical analogues for which they are special cases—some- 
times even those concepts that seem to be defined entirely in terms of points. 

For instance, we say that a topological space X is Hausdorff if any two of its 
distinct points have disjoint neighborhoods. This formulation clearly makes no 
sense outside of topology. However a characterization of Hausdorff spaces in 
terms of the closed diagonal is suitable for generalization to any sufficiently nice 
category, since it makes use only of categorical concepts such as diagonal mor- 
phisms and closure operators. 

An obvious but useful consequence of this generality is that any (concrete) 
categorical result gives rise to a number of results, one for each particular 
(concrete) category and only the proof of the categorical one is needed. 
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The Duality Principle for categories pushes this idea even further. Any concept 
in a category can be dualized by basically reversing the direction of the morphisms 
in the category. The interesting consequence is that any true statement in category 
theory will yield a true dual statement. In other words, one can obtain two results 
(whose specializations to particular categories might seem quite unrelated) by 
proving just one of them. 

The book Abstract and Concrete Categories by J. Adamek, H. Herrlich and 
G. E. Strecker makes these ideas especially clear. It presents the fundamental 
concepts of category theory by starting with the most basic ones and then gradually 
moving to the more advanced ones. Thus, the chosen approach is to present the 
special instances of abstract concepts first and the more general ones later. Such a 
method is bound to be more pedagogically successful than the opposite approach 
of giving the most general definitions first. For example, although the important, 
yet abstract, notion of adjoint situation comes in the latter half of the book, it 
comes as the culmination of three separate approaches—via reflections, via 
factorization structures, and via limits. Note that categorical product, mentioned 
above, is just one special case of limit. 

This book is different from other category theory books in that approximately 
half of it is devoted to concrete categories, and the theories of abstract categories 
and concrete categories are developed roughly in parallel. Other texts tend to treat 
concrete categories minimally, if at all. 

Besides providing the basic definitions as well as numerous examples that clarify 
them, the text contains many recent developments in category theory—especially 
within the realm of concrete categories. This makes Abstract and Concrete Cate- 
gories suitable both for the student who is looking for a gentle introduction to the 
topic of category theory and for the researcher who needs a good reference work. 
It has been written with the same pleasant style as the previous volume by the last 
two authors, entitled Category Theory. However, it includes many important topics 
that were absent in the earlier text. Some of these are factorization structures for 
sources, injective hulls, equations and implications, cartesian closed categories, 
quasitopoi, topological universes, monads, and essentially algebraic, topologically 
algebraic, topological, and solid categories (together with corresponding functors 
and structure theorems for them). The reader should notice that at the time the 
previous book was published, in 1973, the study of many of these topics either had 
not yet been born, or was still in its infancy. This book, which includes many of the 
latest developments of the discipline previously only accessible in the research 
literature, was certainly needed. 

To conclude, we would like to mention that the text has extensive and useful 
indices and that each section ends with a collection of exercises and a list of 
suggestions for further reading. Worth noticing is the fact that a few humorous 
illustrations appear now and then throughout the book to break the formality and 
to make the reading more light and enjoyable. We would have liked to have seen 
more of them. 
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General, P, L. Liberal Learning and the Arts and 
Sciences Major, Volumes 1 and 2. Association of 
American Colleges. Volume 1: The Challenge of 
Connecting Learning, 1990, xiii + 33 pp, (ISBN: 0- 
911696-51-2]; Volume 2: Reports From the Fields, 
1991, xiii + 226 pp, (P). [ISBN: 0-911696-50-4] Re- 
ports on a study of twelve undergraduate majors fo- 
cused on connections (both within and beyond the 
major), on context (as an essential component of 
liberal education), and on coherence (principles of 
organization that are apparent to students). The 
mathematics chapter is an abridged version of the 
report that appeared in the November 1990 issue of 
Focus. LAS 


General, S. The Mathematical Funfair. Brian Bolt. 
Cambridge University Pr, 1989, vi + 122 pp, $10.95 
(P). (ISBN: 0-521-37743-9] Over a hundred diverse 
puzzles on familiar themes, including matchstick and 
coin puzzles, railway shunting problems, number 
puzzles, chessboard puzzles, magic squares, tricks, 
and games. Includes detailed commentary and solu- 
tions. LCL 

General, P, L. The Popularization of Mathematics. 
Eds: A.G. Howson, J.-P. Kahane. ICMI Study Ser. 
Cambridge University Pr, 1990, xi + 210 pp, (P). 
[ISBN: 0-521-40867-9; 0-521-40319-7] Reports from 
a September 1989 international conference held in 
Leeds, U.K. to study various approaches to popular- 
izing mathematics—in the media, in schools, in mu- 
seums, in books and magazines, in TV and films, in 
competitions. Presentations illustrate various exam- 
ples and diverse objectives from Australia, Canada, 
China, Czechoslovakia, Denmark, France, Hungary, 
Italy, New Zealand, Spain, United Kingdom, and 
United States. Includes description of a major trav- 
eling exhibition, the “Pop Maths Roadshow,” whose 
opening was part of the Leeds workshop. LAS 


Elementary, T(13: 1). Intermediate Algebra: 
Concepts and Applications, Third Edition. Mar- 
vin L. Bittinger, Mervin L. Keedy, David Ellenbo- 
gen. Addison-Wesley, 1990, xvi + 700 pp, $29.56. 
(ISBN: 0-201-17296-8] Features early introduction 
of graphing, functions, systems of equations, and a 


5-step algorithm for problem solving. This edition 
(a revision of A Problem Solving Approach to Inter- 
mediate Algebra, Second Edition, TR, January 1988) 
adds coverage of order of operations, algebra of func- 
tions, determinants, more inequalities, and new ex- 
ercises. JNC 


Precalculus, T(13: 1). Trigonometry for College 
Students, Fifth Edition. Karl J. Smith. Brooks Cole, 
1991, xv + 410 pp, $42.75. (ISBN: 0-534-13728-8] 
This edition includes all the fundamental identities 
in the first chapter, new sections on inverse func- 
tions and conics in polar form, and sets of miscella- 
neous problems at the end of each chapter. (First 
Edition, TR, June-July 1977; Second Edition, TR, 
October 1980; Third Edition, TR, November 1984.) 
INC 


Precalculus, T(13: 1). Fundamentals of College 
Algebra, Third Edition. Charles D. Miller, Margaret 
L. Lial, David I. Schneider. Scott Foresman, 1990, 
xv + 509 pp, $32. [ISBN: 0-673-38638-4] Cover- 
age of traditional topics includes examples and ex- 
ercises designed to show how algebra techniques are 
used in calculus; this edition includes new histori- 
cal comments and “exercises that test understand- 
ing.” (First Edition, TR, October 1982; Second Edi- 
tion, TR, October 1986.) JNC 


Precalculus, T(13: 1), S, C. Graphing Calculator 
and Computer Graphing Laboratory Manual, Second 
Edition. Franklin Demana, et al. Addison-Wesley, 
1991, iv + 118 pp, $6.36 (P). [ISBN: 0-201-55436-4] 
A companion manual to the Demana and Waits series 
of textbooks on precalculus mathematics, which em- 
phasize calculator and computer-produced graphs. 
This manual teaches readers how to use several com- 
mon graphing utilities, including Texas Instruments, 
Casio, Sharp, and Hewlett-Packard graphing calcula- 
tors, and the Apple IJ, IBM, and Macintosh versions 
of Master Grapher. All told, a valuable compendium 
of information, and for students, a useful alternative 
to full owner’s manuals. PZ 


Precalculus, T, P. Approaching Precalculus Math- 
ematics Discretely: Explorations in a Computer En- 
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vironment. Philip G. Lewis. MIT Pr, 1990, xiv + 
557 pp, $24.95 (P). (ISBN: 0-262-12138-7] Devel- 
ops the concepts of functions, linear transformations, 
vectors, recursion, limits and trigonometric functions 
using Logo; an appendix introduces Logo. JNC 


Finite Mathematics, T(13-14), S, C. Problem 
Solver for Finite Mathematics and Calculus. Ken- 
neth L. Wiggins. PWS-Kent, 1991, xiv + 221 pp, 
$16 (P). (ISBN: 0-534-92439-5] Problem Solver is an 
IBM-PC package (supplied on two 5.25 inch disks; 
not tested by this reviewer) of mathematical rou- 
tines: graphics (2-d plots of functions and tables), 
matrix manipulations, statistical computations, nu- 
merical integration, and root-finding, etc. The book 
itself is both a user manual and a compendium of 
problems (on precalculus, finite mathematics, cal- 
culus, elementary linear algebra, and statistics) to 
which the software can be applied. Each section in- 
cludes at least one example worked in detail. PZ 


Education, L. Sourcebook for Science, Mathemat- 
ics, and Technology Education, 1990-1991. Eds: 
Betty J. Calinger, Barbara Walthall. AAAS, 1990, v 
+ 202 pp, $12.95 (P). Information on organizations, 
offices, agencies, and people who are working to im- 
prove science and mathematics education. Includes 
state and national organizations, public and private 
groups, professional associations and education agen- 
cies, books and journals. Indexed by name, organi- 
zation, state, and publication. Coverage of mathe- 
matics is slim: no mention of the NCTM Standards 
nor of NRC’s Everybody Counts. A useful reference 
for names and addresses. LAS 


Education, P, L*. Mathematics and Cognition. 
Eds: Pearla Nesher, Jeremy Kilpatrick. ICMI Study 
Ser. Cambridge University Pr, 1990, 180 pp, $39.50. 
(ISBN: 0-521-36608-9] A concise, well-referenced 
state-of-the-art survey of research in the psychology 
of mathematics education (PME) prepared over a 
two-year period by the International Group for PME 
which is affiliated with the International Commission 
for Mathematics Instruction (ICMI). Five chapters 
that advance by age level, from arithmetic to calcu- 
lus, are enclosed by survey chapters on epistemology 
and psychology and on future perspectives for re- 
search. LAS 


Education, P*, L. They’re Not Dumb, They’re Dif- 
ferent: Stalking the Second Tier. Sheila Tobias. Re- 
search Corp (6840 E. Broadway Blvd., Tucson, AZ 
85710-2815), 1990, 94 pp, (P). Eyewitness accounts 
of typical introductory university physics and chem- 
istry courses prepared by experienced obseryers— 
humanities graduate students who volunteered to 
take the courses and keep detailed notes of their 
experiences, observations, and feelings. Examples: 
“The lack of community ... produces a totally pas- 
sive classroom experience.” “My class is full of in- 
tellectual warriors ...taught to fear cooperation.” 
“Students key into buzz-words and catch phrases 
... They are not turned on by ideas.” “Everyone, 
including the professor, is geared toward exams.” 
“There are lots of atomized individuals ... physics 
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is not a place where people befriend one another.” 
Mathematicians can learn much about how “second 
tier” students—those who enter college not already 
committed to science—react to typical introductory 
courses. LAS 


Education, P. Black Students and the Mathemat- 
ics, Science, and Technology Pipeline: Turning the 
Trickle into a Flood. Howard University Pr. The 
Journal of Negro Education, 59:3 (Summer 1990), 
281 pp, (P). A special “Yearbook Issue” devoted 
to science and mathematics education, particularly 
about mathematics and especially about algebra— 
which for most minority students is the barrier to the 
mathematics and science pipeline. Includes research 
studies (e.g., on “learned helplessness,” on the effects 
of Eurocentrism on motivation of Afro-American and 
Hispanic students), and project reports (including 
one by Jaime Escalante). A rich resource of ideas, 
experience, and people. LAS 


History, P, L*. Regiomontanus: His Life and 
Work. Ernst Zinner. Transl: Ezra Brown. Stud. 
in History & Philos. of Math., V. 1. North-Holland 
(US Distr: Elsevier Science), 1990, x + 402 pp, 
$82. (ISBN: 0-444-88792-X] An extraordinarily de- 
tailed account of the scholarly life of this unher- 
alded precursor to Copernicus, also known as Jo- 
hannes Molitoris (or Miller) of Konigsberg. Dis- 
coverer of the fifth and sixth perfect numbers, Re- 
giomontanus (1436-1476) was the pre-eminent as- 
tronomer of his age, tabulating and correcting as- 
tronomical tables and working with kings and popes 
on calendar corrections. Regiomontanus’ planetary 
observations laid the foundation for De revolution- 
ibus without ever challenging church geocentric doc- 
trine. LAS 


Foundations, P, L**. Brouwer’s Intuitionism. 
Walter P. van Stigt. Stud. in History & Philos. of 
Math., V. 2. North-Holland (US Distr: Elsevier Sci- 
ence), 1990, xxvi + 530 pp, $94.75. (ISBN: 0-444- 
88384-3] A fascinating intellectual biography trac- 
ing the roots of Brouwer’s philosophical views from 
his angry, arrogant youth through a calculated de- 
tour into topology (to establish mathematical bona 
fides), and back to foundational studies. Includes 
carefully documented expositions of Brouwer’s philo- 
sophical views, his analysis of language and logic, 
and his intuitionist mathematics. Appendices pro- 
vide original text and translations of several impor- 
tant unpublished papers and correspondence. Re- 
veals clearly how personalities combined with intel- 
lectual passions to fuel the “foundation wars” in the 
first part of the twentieth century. LAS 


Graph Theory, S, P. Fuzzy Sets in Information 
Retrieval and Cluster Analysis. Sadaaki Miyamoto. 
Theory & Decision Library, Ser. D, V. 4. Kluwer 
Academic, 1990, x + 259 pp, $99. [ISBN: 0-7923- 
0721-6] Introduction to fuzzy set theory for finite 
sets. Connects these ideas to theory of finite, fuzzy 
graphs. Terse but thorough exposition of major re- 
sults. Accessible to readers with no background in 
fuzzy set theory. Remainder of the text divides into 
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independent chapters applying fuzzy set theory to 
information retrieval (in particular, bibliographic re- 
trieval) and hierarchical cluster analysis. Includes 
many simple examples and a large section devoted 
to various algorithms of clustering schemes. MK 


Graph Theory, T(16-17: 2), P. Decompositions 
of Graphs. Juraj] Bosak. Math. & Its Applic., V. 47. 
Kluwer Academic, 1990, xviii + 248 pp, $112. (ISBN: 
0-7923-0747-X] Begins with basic concepts and defi- 
nitions and proceeds rapidly to an in-depth study of 
decompositions of graphs into isomorphic subgraphs. 
While prerequisites are quite minimal, a certain de- 
gree of mathematical sophistication is necessary. Ex- 
tensively referenced, it includes a 36-page bibliog- 
raphy of relevant research papers. Conjectures and 
open problems sprinkled throughout. AD 


Discrete Mathematics, T(13-14: 1, 2). Foun- 
dations of Discrete Mathematics. Peter Fletcher, 
Hughes Hoyle, C. Wayne Patty. PWS-Kent, 1991, 
Xvill + 781 pp, $35. (ISBN: 0-534-92373-9] Begins 
with elementary logic, methods of proof, sets, func- 
tions, induction, and recursion. Other chapters ex- 
amine algorithms, counting, Boolean algebra, equiva- 
lence relations, order, and formal languages. A chap- 
ter on matrices constitutes a brief course in linear 
algebra. Treatment of graph theory and trees is ex- 
tensive (171 pages). Advanced chapter on combi- 
natorics discusses generating functions. No calculus 
background necessary, only two years of high school 
algebra. AD 


Discrete Mathematics, T(13: 1). Discrete Math- 
ematics for Computing. A. Vince, C. Morris. Math. 
& Its Applic. Ellis Horwood (US Distr: Simon & 
Schuster), 1990, 297 pp, (P). (ISBN: 0-13-217522- 
3] Assumes little formal mathematical background. 
Covers sets, logic, data representation and manipu- 
lation, graph theory and associated algorithms, rela- 
tions, functions, combinatorics, probability, and er- 
ror correcting codes, all at a very elementary level. 
Examples of algorithms presented in a language-free 
manner. Answers for exercises supplied. AD 


Number Theory, $(17-18), P, L. Representation 
Theory and Automorphic Functions. I.M. Gel’fand, 
M.I. Graev, J.J. Pyatetskii-Shapiro. Transl: K.A. 
Hirsch. Academic Pr, 1990, xviii + 426 pp, $39.50 
(P). (ISBN: 0-12-279506-7] Reprint of a 1968 trans- 
lation. A valuable resource for anyone venturing into 
the Langlands program. BC 


Linear Algebra, T(16-17). Eztreme Properties of 
Linear Transformations. Ali R. Amir-Moéz. Polyg- 
onal, 1990, vii + 166 pp, $24. [ISBN: 0-936428-12-0] 
Hermitian, positive, unitary properties of transfor- 


mations on inner-product spaces. Exercises; terse 
style. MLR 


Group Theory, P. Combinatorial Group Theory. 
Eds: Benjamin Fine, Anthony Gaglione, Francis 
C.Y. Tang. Contemp. Math., V. 109. AMS, 1990, 
xii + 191 pp, $37 (P). [ISBN: 9-8218-5116-0] The 
proceedings from the AMS special session on combi- 
natorial group theory and infinite groups held at the 
University of Maryland, April 1988. The papers in 
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this volume represent a mix of pure research papers 
and expository papers; together they convey a sense 
of the direction in the field. LCL 


Algebra, S(18), P. Automata and Algebras in Cat- 
egortes. JifiAdamek, Véra Trnkovd. Math. & Its 
Applic. Kluwer Academic, 1990, xiv + 474 pp, $167. 
(ISBN: 0-7923-0010-6] Chapters 1 and 2 introduce 
the basic concepts of sequential and tree automata 
leading into work of Arbib and Manes. Succeed- 
ing chapters treat free algebras, minimal realiza- 
tion and reduction, universal realization, and non- 
deterministic automata. Chapter notes, references, 
index. JS 


Algebra, P. Computers in Algebra. Ed: Martin 
C. Tangora. Lect. Notes in Pure & Appl. Math., 
V. 111. Marcel Dekker, 1988, viii + 162 pp, $89.75 
(P). (ISBN: 0-8247-7975-4] Papers from a confer- 
ence on computers in algebra held at the University 
of Illinois at Chicago in December 1985. Most of 
the papers are on finite groups and such problems 
as enumerating cosets, finding efficient presentations, 
constructing explicit representations, and computing 
cohomology. JAS 


Algebra, T(16-17: 1), L. Steps in Commutative 
Algebra. R.Y. Sharp. London Math. Soc. Stud. 
Texts, V. 19. Cambridge University Pr, 1990, xi + 
321 pp, $49.50; $17.95 (P). (ISBN: 0-521-39338-8; 0- 
521-39732-4] A gentle introduction, assuming only 
basic abstract algebra. Some exercises. JD-B 


Algebra, P. Lecture Notes in Mathematics-1428: 
Blocks of Tame Representation Type and Related Al- 
gebras. Karin Erdmann. Springer-Verlag, 1990, xv + 
312 pp, $32.20 (P). (ISBN: 0-387-52709-5] Studies 
algebras associated to blocks of tame representation 
type. These are the 2-blocks whose defect groups 
are dihedral or semi-dihedral or (generalized) quater- 
nion. LC 


Calculus, T**(13-14: 2, 3). Calculus. Gilbert 
Strang. | Wellesley-Cambridge Pr (Box 82-279, 
Wellesley, MA 02181), 1991, xv + 655 pp, $50 
(Libraries). (ISBN: 0-9614088-2-0] A lively fresh 
look at calculus by a master expositor: conventional 
topics cast in new light and used for important pur- 
poses. Employs several standard examples (e.g., 
speedometer/odometer, electrocardiogram) through- 
out to provide continuous grounding in real prob- 
lems. Introduces computer and calculator methods 
as well as mathematical ideas rooted in these tools 
(e.g., iteration, chaos from Newton’s method, pat- 
terns in the graph of sinn for integral n,0 <n < 
10000). Contains, as expected, several thousand ex- 
ercises, including regular “fill-in-the-blank” passages 
in lieu of chapter summaries. Emphasizes pairs of 
functions f(t) and v(t) (distance and velocity), dif- 
ferential equations (beginning in Chapter 2), and vi- 
sualization (graphs and zooms). The real innovation, 
however, is in the exposition: direct, refreshing, en- 
thusiastic, personal. LAS 

Real Analysis, P. Analysis II: Convez Analysis 
and Approzimation Theory. Ed: R.V. Gamkrelidze. 
Encyclop. of Math. Sci., V. 14. Springer-Verlag, 
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1990, 255 pp, $59.50. (ISBN: 0-387-18179-2] Basic 
ideas of convex analysis (subspaces, convex cones, 
duality, linear and convex programming, etc.) and 
approximation theory (classical problems and meth- 
ods, and “best” approximation by subspace, Fourier, 
linear and nonlinear methods). Extensive reference 
section. MLR 


Complex Analysis, P. Lecture Notes in Mathemat- 
ics-1495: Computational Methods and Function The- 
ory. Eds: St. Ruscheweyh, et al. Springer-Verlag, 
1990, vi + 211 pp, $22 (P). [ISBN: 0-387-52768-0] 
Seventeen papers on aspects of complex functions 
theory—conformal mapping, univalent functions, ex- 
tremal problems—comprise partial proceedings of a 
March 1989 conference held at Valparaiso, Chile. 
Computer and numerical methods appear frequently 
throughout the papers. PZ 


Partial Differential Equations, T(17-18), P. 
The Analysis of Linear Partial Differential Operators 
I: Distribution Theory and Fourier Analysis, Sec- 
ond Edition. Lars Hormander. Grund. der math. 
Wissenschaften, B. 256. Springer-Verlag, 1990, xi + 
440 pp, $69, [ISBN: 0-387-52343-X]; Study Edition, 
$39.50 (P). This edition includes exercises with an- 
swers and hints. (First Edition, TR, March 1984; 
Extended Review, December 1985.) MLR 


Partial Differential Equations, P. Singular Per- 
turbations I: Spaces and Singular Perturbations on 
Manifolds without Boundary. Leonid S. Frank. Stud. 
in Math. & Its Applic., V. 23. North-Holland (US 
Distr: Elsevier Science), 1990, xxiv + 555 pp, $89.75. 
(ISBN: 0-444-88134-4] The first of two volumes de- 
voted to singular perturbations, treating here linear 
perturbations on smeoth manifolds without bound- 
ary. Chapter 1 develops the necessary background 
in functional analysis and distributions; Chapter 2 
discusses Sobolev spaces; and Chapter 3 deals with 
elliptic and hyperbolic perturbations, using a variety 
of techniques. Notes, extensive bibliography. JS 


Partial Differential Equations, P. Nonlinear 
Evolution Equations That Change Type. Eds: Bar- 
bara Lee Keyfitz, Michael Shearer. Inst. for Math. 
& Its Applic., V. 27. Springer-Verlag, 1990, xiv + 
284 pp, $35. (ISBN: 0-387-97353-2] Many simple 
models of fluid flow, including granular flow, multi- 
phase flow, and flow in porous media, involve partial 
differential equations that change type. The papers 
in this volume, based on the 1988-89 IMA Program 
on Nonlinear Waves, address numerical and analyti- 
cal difficulties posed by change of type. JO 


Partial Differential Equations, C. PLTMG: A 
Software Package for Solving Elliptic Partial Differ- 
ential Equations, Users’ Guide 6.0. Randolph E. 
Bank. Frontiers in Appl. Math., V. 7. SIAM, 1990, 
xiv + 164 pp, $22.50 (P). (ISBN: 0-89871-255-6] A 
reference manual for a software package for solving a 
general class of scalar, parameter dependent, elliptic 
partial differential equations in general regions of the 
plane. JNC 


Functional Analysis, T(18: 1, 2), S, P. Func- 
tional Analysis: An Introduction for Physicists. 
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Nino Boccara. Academic Pr, 1990, xiii + 327 pp, 
$44.50. (ISBN: 0-12-108810-3] Burdened by nei- 
ther “paralyzing rigor” or “barren formalism” the au- 
thor’s intent is to make the basic concepts of modern 
functional analysis intelligible and useful for theoret- 
ical physicists. Many (but not all) proofs are given, 
with problem sets (and solutions) to enhance under- 
standing. Chapters treat measure and intregration, 
Lebesgue spaces, Hilbert spaces, distributions, and 
linear operators. References, index. JS 


Functional Analysis, T(17-18), S, P, L. C* 
Algebras and Operator Theory. Gerard J. Murphy. 
Academic Pr, 1990, x + 286 pp, $44.50. (ISBN: 0- 
12-511360-9] Contains an accessible introduction to 
core material (spectral theory, C*-algebras, Hilbert 
space operators, ideals and positive functions, von 
Neumann algebras, representations of C*-algebras), 
plus the treatment of topics of high contemporary 
profile (including C*-tensor products and K-theory). 
The careful presentation, choice of topics, and num- 
ber of exercises make this useful as a course text. KS 


Functional Analysis, P. Analysis of Toeplitz Op- 
erators. Albrecht Bottcher, Bernd Silbermann. 
Springer-Verlag, 1990, 512 pp, $79. (ISBN: 0-387- 
52147-X] A Toeplitz operator T(a) : H? — H? for 
AeL™ and 1 < p < oo is defined by f — P(af) 
where P is the Riesz projection. These operators 
have uses in (for example) physics, probability, and 
information theory as well as in operator theory and 
function theory. The book is intended for both be- 
ginners and specialists. MLR 


Analysis, P. Mathematical Structures of Nonlinear 
Science: An Introduction. Melvyn S. Berger. Non- 
linear Topics in Math. Sci., V. 1. Kluwer Academic, 
1990, xii + 421 pp, $99. [ISBN: 0-7923-0728-3] A 
book that grew out of four lectures the author gave at 
the University of California: New ideas on integrabil- 
ity of nonlinear systems; vortex phenomena in fluids 
(a classic topic that stands to benefit from new math- 
ematical ideas of nonlinear science); new connections 
between global differential geometry and nonlinear 
analysis; superconductivity—the simplest nonlinear 
gauge theory that works. AWR 


Analysis, T(18), S, P, L. The Analytical and 
Topological Theory of Semigroups: Trends and De- 
velopmenis. Eds: Kar! Heinrich Hofmann, Jimmie 
D. Lawson, John S. Pym. Walter de Gruyter, 1990, 
xi + 398 pp, DM 138. (ISBN: 0-89925-698-8] A 
comprehensive survey of the current state of research 
in the theory of semigroups and its applications. This 
volume emphasizes the relationships with topology 
and analysis (functional and harmonic), and is pre- 
sented in four parts: Lie theory and algebraic ge- 
ometry; structure of compact semigroups; functional 
analysis on semigroups; applications to systems the- 
ory, number theory, probability, and topology. KS 


Algebraic Geometry, P. Sheaves on Manifolds. 
Masaki Kashiwara, Pierre Schapira. Grund. der 
math. Wissenschaften, B. 292. Springer-Verlag, 
1990, x + 512 pp, $89. [ISBN: 0-387-51861-4] In- 
tended to present a self-contained exposition of sheaf 


292 


theory in the context of derived categories. Includes 
a short history (in French) of the subject. JO 


Differential Geometry, T(18), S, P. An In- 
troduction to CR Structures. Howard Jacobowitz. 
Math. Surveys & Mono., No. 32. AMS, 1990, x + 
237 pp, $74. [ISBN: 0-8218-1533-4] Very readable 
introduction to the subject of CR structures. Begins 
with counterexamples to the Riemann mapping the- 
orem in higher dimensions involving CR manifolds. 
Focuses on the equivalence problem as solved by Car- 
tan and by Chern and Moser. Concludes by showing 
the nonsolvability of the Lewy operator. Accessible 
to anyone who has taken a graduate course in differ- 
ential geometry. OJ 


Differential Geometry, T(18: 1), S, P. Subman- 
tfolds and Isometric Immersions. Marcos Dajczer, et 
al. Math. Lect. Ser., V. 13. Publish or Perish, 1990, 
vill + 173 pp, $26. (ISBN: 0-914098-22-5] Text for 
advanced graduate course in differential geometry. 
Uses flat bilinear forms to prove rigidity theorems. 
Assumes knowledge of graduate differential geome- 
try, but reviews many of the basic facts and theo- 
rems. Well-written and well-referenced. Exercises, 
bibliography. OJ 

Algebraic Topology, P. Classical Aspherical Man- 
ifolds. F. Thomas Farrell, L. Edwin Jones. CBMS 
Reg. Conf. Ser. in Math., No. 75. AMS, 1990, vii + 54 
pp, $24 (P). [ISBN: 0-8218-0726-9] Lectures from 
the NSF-CBMS Conference on K-theory and Dy- 
namics held in Gainesville, Florida in January 1989. 
Includes an epilogue with results obtained since the 
conference. SP 


Topology, P. Open Problems in Topology. Eds: Jan 
van Mill, George M. Reed. North-Holland (US Distr: 
Elsevier Science), 1990, xiv + 692 pp, $92.25. (ISBN: 
0-444-88768-7] Designed as a source book for thesis 
problems and a challenge to top researchers. Offers 
a collection of unsolved conjectures and problems in 
various areas of topology (e.g., general and point-set, 
computer science oriented, geometric and algebraic, 
etc.) submitted by researchers in each of the fields. 
Description of each problem and of related results 
and conjectures is given. Indexes for easy reference. 
An excellent resource. OJ 


Dynamical Systems, P. Decision Processes in Dy- 
namic Probabilistic Systems. Adrian V. Gheorghe. 
Math. & Its Applic., V. 42. Kluwer Academic, 1990, 
xvii + 354 pp, $112. (ISBN: 0-7923-0544-2] Brings 
together theory of Markov and semi-Markov pro- 
cesses, dynamical and linear programming, and de- 
cision process theory with either complete or partial 
information. Mathematics is advanced, examples are 
scarce, notation is non-standard, and exposition is 
terse. Includes Fortran programs to calculate transi- 
tion probabilities, first passage times, and other char- 
acteristics of Markov processes. MK 


Dynamical Systems, P. Nonlinear Structures in 
Physical Systems: Pattern Formation, Chaos, and 
Waves. Eds: Lui Lam, Hedley C. Morris. Springer- 
Verlag, 1990, x + 331 pp, $49. [ISBN: 0-387-97344-3] 
Collection of thirty-five short papers from the Second 
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Woodward Conference held in November 1989 at San 
Jose State University. Topics include pattern forma- 
tion of bubbles; viscous fingers; chemical systems; 
frustrated systems; chaos; self-organized criticality; 
traveling waves, and various nonlinear structures in 
magnetic systems, plasmas, polymers, thin films, and 
biological systems. SP 

Dynamical Systems, T(17), S*, P, L. Perspec- 
tives of Nonlinear Dynamics, Volume 2. E. Atlee 
Jackson. Cambridge University Pr, 1990, xviii + 
632 pp, $59.50. (ISBN: 0-521-35458-7] Extensive 
introduction written by a physicist for graduate stu- 
dents and researchers in the sciences and engineering. 
Many helpful illustrations, historical notes, exercises, 
and examples. Topics in this volume include maps 
in R? and T?, twist maps, Lorenz, KAM, Lyapunov 
exponents, solitons, KdV, and much more. SP 


Dynamical Systems, P. Mathematics of Nonlinear 
Science. Ed: Melvin S. Berger. Contemp. Math., V. 
108. AMS, 1990, xi + 153 pp, $32 (P). (ISBN: 0-8218- 
5114-4] Proceedings of AMS Special Session held in 
January 1989 at Phoenix, Arizona. Papers deal pri- 
marily with new ideas about non-chaotic phenomena 
in physical science. Topics include instability, global 
extensions of KAM, integrable systems, bifurcation 
analysis in fluids, and gauge theory. SP 


Dynamical Systems, P. Identifiability, Recursive 
Identification and Spaces of Linear Dynamical Sys- 
tems. B. Hanzon. Mathematisch Centrum, 1989, 
(ISBN: 90-6196-371-0]. Part I, CWI Tract, V. 63, 
223 pp, Dfl. 58 (P); Part IJ, CWI Tract, V. 64, 413 
pp, Dfl. 48 (P). Results of the author’s research over 
the past ten years on the problem of mode! identifi- 
cation. More specifically, among all possible models 
that might describe a certain phenomenon, how does 
one choose the most nearly correct one? In Part I the 
author considers representations of linear dynamical 
systems, deterministic and stochastic, and describes 
these as abstract objects. He imposes a topology so 
that one can decide when models are close and when 
far apart. The set of stochastic linear systems of 
a fixed degree form a differentiable manifold under 
certain conditions. Part IJ investigates an algorithm 
that uses overlapping parametrizations to investigate 
this model set. LCL 


Control Theory, P. Nonlinear Controllability and 
Optimal Control. Ed: H.J. Sussmann. Pure & Appl. 
Math., V. 133. Marcel Dekker, 1990, viii + 472 
pp, $115. (ISBN: 0-8247-8258-5] Contributions to 
a 1987 workshop in Finite Dimensional Controllabil- 
ity and Optimal Control held at Rutgers University, 
this volume is intended to serve as an overview of 
recent efforts to put new tools of differential geomet- 
ric control theory to work on such old control theory 
problems as optimal trajectories, system invertabil- 
ity, sampling, and more. AWR 


Probability, P. Lecture Notes in Mathematics- 
1426: Séminaire de Probabilités XXIV, 1988/89. 
Eds: J. Azéma, P.A. Meyer, M. Yor. Springer- 
Verlag, 1990, v + 490 pp, $47.90 (P). [ISBN: 0-387- 
52694-3] Conference proceedings; contains 39 pa- 
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pers, 15 in English, the rest in French. RSK 


Probability, T(18), P. Entropy and Information 
Theory. Robert M. Gray. Springer-Verlag, 1990, 
xxiii + 332 pp, $53.50. (ISBN: 0-387-97371-0] A 
well-written, research-level treatment of the theory 
of probability information measures and their appli- 
cation to coding theorems for information sources 
and noisy channels. Begins with basic definitions 
as originally introduced by Shannon. Covers stan- 
dard stationary ergodic theorems and rates, diver- 
gence, relative entropy, and ergodic theorems for den- 
sities. Many of the examples and applications are to 
Markovian systems. A large section of examples of 
channels and codes, as well as chapters on distor- 
tion, source coding theorems, and coding for noisy 
channels. Large reference list. MK 


Probability, P. Continuous Bivariate Distribu- 
tions, Emphasising Applications. T.P. Hutchinson, 
C.D. Lai. Rumsby Scientific (POB 76, Rundle 
Mall, Adelaide, South Australia 5000), 1990, xxxi 
+ 412 pp, $63. (ISBN: 0-7316-7206-2] Comprehen- 
sive compilation of probability distributions involv- 
ing two variables, including ways they are generated. 
Extensive collection of examples of their use in mod- 
elling data. RSK 


Stochastic Processes, T(16-17), S, L. Proba- 
bility, Statistics, and Queueing Theory With Com- 
puter Sctence Applications, Second Edition. Arnold 
O. Allen. Comput. Sci. & Scientific Comput. Aca- 
demic Pr, 1990, xxi + 740 pp, $49.95. (ISBN: 0- 
12-051051-0] ‘Truly three books in one: probability 
theory, statistical inference, and queueing theory as 
they apply to computer science problems. Sections 
are terse, but cover standard topics, include good 
examples, and contain a wide variety of exercises. 
Queueing theory section covers models for computer 
systems such as Jackson networks and BCMP Queue- 
ing Networks. Makes use of several software pack- 
ages: MINITAB, SAS/STAT, EXPLORE, APL, and 
Mathematica. An appendix contains Mathematica 
programs to assist understanding of queueing mod- 
els. Philosophy is clearly how theorems and the- 
ory are used in practical computer science problems. 
(First Edition, TR, March 1979.) MK 


Computational Statistics, T(16-17), L. An In- 
troduction to Generalized Linear Models. Annette J. 
Dobson. Chapman & Hall, 1990, x + 174 pp, $52.50. 
[ISBN: 0-412-31100-3] An undergraduate-graduate 
level treatment of generalized linear models. Very 
accessible exposition full of examples. Incorporates 
both MINITAB and GLIM examples. Standard ,top- 
ics: exponential families, generalized linear models, 
estimation, inference (mostly likelihood-based), mul- 
tiple regression (includes diagnostics, model selec- 
tion, non-linearity), ANOVA, ANCOVA, logistic re- 
gression, and log-linear models. MK 


Computational Statistics, S. How SAS Works: 
A Comprehensive Introduction to the SAS System, 
Second Edttion. Paul A. Herzberg. Springer-Verlag, 
1990, xvii + 193 pp, $29 (P). [ISBN: 0-387-97291- 
9] Written both as an introduction and as a bridge 
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to the more extensive reference manuals published 
by the SAS Institute. Concentrates on the DATA 
steps used to input, create, and modify data, while 
limiting the discussion of PROC (procedure) steps to 
descriptive statistics. RSK 


Statistics, P*. Asymptotics in Statistics: Some 
Basic Concepts. Lucien Le Cam, Grace Lo Yang. 
Ser. in Stat. Springer-Verlag, 1990, viii + 180 pp, 
$29. (ISBN: 0-387-97372-9] Updated rewriting of 
a set of lectures given by Le Cam in 1968, orig- 
inally published in French. Concentrates on local 
asymptotic normality (LAN) conditions, but now 
also includes a discussion of the locally asymptoti- 
cally mixed normal case (LAMN). Each of the six 
non-introductory chapters concludes with interesting 
historical remarks. RSK 


Statistics, T(16-17: 1, 2), P. Statistical Design 
and Analysis of Experiments with Applications to 
Engineering and Science. Robert L. Mason, Richard 
F. Gunst, James L. Hess. Ser. in Prob. & Math. Stat. 
Wiley, 1989, xvi + 692 pp, $64.95. (ISBN: 0-471- 
85364-X] Nontheoretical practitioner’s guide, assum- 
ing only college algebra. Stresses the strategy of ex- 
perimentation, data analysis, and the interpretation 
of experimental results. Main sections are on experi- 
mental design (presented separately from the anal- 
ysis), analysis of data from designed experiments, 
and regression modeling. Includes such topics as 
fractional factorial experiments, response-surface de- 
signs, multiple comparison procedures, analysis of 
covariance, and outlier detection. RSK 


Statistics, P. Proceedings of the Thirty-Fifth Con- 
ference on the Design of Experiments. US Army Re- 
search Office (POB 12211, Research Triangle Park, 
NC), 1989, xii + 327 pp, (P). Proceedings from 
the thirty-fifth conference on the Design of Exper- 
iments in Army Research, Development and Test- 
ing, which resiliently survived the San Francisco-area 
earthquake in October 1989. Includes a special ses- 
sion on statistical issues related to combat modelling. 
Other invited topics include statistical graphics, an 
overview of statistical methods for categorical data, 
and an appraisal of several multistage selection pro- 
cedures. MK 

Statistics, T*(17: 1), C, L. Regression Analy- 
sis: Theory, Methods, and Applications. Ashish Sen, 
Muni Srivastava. Texts in Stat. Springer-Verlag, 
1990, xv + 347 pp, $39. [ISBN: 0-387-97211-0] Up- 
to-date account of the theory and methods of regres- 
sion analysis, assuming a background of mathemati- 
cal statistics. First part covers least squares regres- 
sion assuming the Gauss-Markov conditions. Second 
part presents methods to alleviate the effects of vio- 
lations of these conditions. Third part discusses mul- 
ticollinearity, variable search, and biased estimation. 
Includes a computer disk which contains most of the 
over 50 data sets used in the text to illustrate the 
material. RSK 

Statistics, P. Statistical Methods in Longitudinal 
Research. Ed: Alexander von Eye. Stat. Mod- 
eling & Decision Sci. Academic Pr, 1990, $74.95 
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each. Volume I: Principles and Structuring Change, 
xiv + 273 pp, (ISBN: 0-12-724960-5]; Volume II: 
Time Series and Categorical Longitudinal Data, xiv 
+ 328 pp. (ISBN: 0-12-724962-1] Collection of con- 
tributed papers from methodological researchers of 
longitudinal analyses in the social sciences. Numer- 
ous examples and applications including discussions 
of computer implementation. Topics: change mea- 
surement, missing data estimation, alternative anal- 
yses of repeated-measure designs, longitudinal factor 
analysis, latent variable structural equation models, 
scaling, analysing event histories, linear and nonlin- 
ear curve fitting, multivariate time series models, de- 
scriptive and associative developmental models, and 
analysis of categorical longitudinal data. MK 


Statistics, P*. U-Statistics: Theory and Practice. 
A.J. Lee. Stat.: Textbooks & Monographs, V. 110. 
Marcel Dekker, 1990, xi + 302 pp, $79.75. (ISBN: 0- 
8247-8253-4] “Reasonably comprehensive” descrip- 
tion of the theory of U-statistics (which include many 
of the common statistics in use today) that has ac- 
cumulated over the past forty years, with particular 
attention to asymptotic theory. Also includes a se- 
lection of applications of this theory. RSK 


Computer Literacy, S. Calculator Logic Systems 
and Mathematical Understandings. Enid R. Bur- 
rows. NCTM, 1990, viii + 133 pp, $22.50 (P). 
[ISBN: 0-87353-295-3] Detailed discussion with nu- 
merous examples of the major calculator operat- 
ing systems—algebraic logic (keys executed in order 
pressed), operator logic (operations preserve logical 
hierarchy), and reverse Polish notation (operators 
act as functions on input stack). Focuses solely on 
button-pushing with little discussion of the impor- 
tant implications of these diverse systems for math- 
ematics or pedagogy. LAS 


Algorithms, T(17-18: 2), S, P. Combinatorial 
Algorithms for Integrated Circuit Layout. Thomas 
Lengauer. Ser. in Comput. Sci. Wiley, 1990, xxviii 
+ 697 pp, $59.95. (ISBN: 0-471-92838-0] Introduc- 
tion to, the theory of, and then latest results on com- 
binatorial problems on integrated circuit layout and 
their algorithmic solutions. Unified theory and re- 
sults for both CAD practitioners and algorithm the- 
oreticians. Includes graph algorithms, optimization 
techniques, operations research, and statistics results 
as needed for layout optimization; also includes ac- 
tual layout problems and solutions like global and 
detailed routing. JPH 

Computer Systems, T**(15-17: 1), L*. Op- 
erating System Concepts, Third Edition. Abraham 
Silberschatz, James L. Peterson, Peter B. Galvin. 
Addison-Wesley, 1991, xviii + 696 pp, $47.50. (ISBN: 
0-201-51379-X] A substantially revised and updated 
edition of a popular textbook that focuses on funda- 
mental concepts. In addition to bringing some of the 
older material up-to-date and adding new exercises, 
the authors have added new chapters on distributed 
file systems and on the Mach operating system. AO 


Artificial Intelligence, $(18), P. Artificial In- 
telligence in Process Engineering. Ed: Michael L. 
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Mavrovouniotis. Academic Pr, 1990, xiv + 367 pp, 
$49.95. [ISBN: 0-12-480575-2] A collection of pa- 
pers illustrating the application of artificial intelli- 
gence methods to chemical and biochemical process 
engineering. The goal of the editor is to acquaint pro- 
cess engineers and artificial intelligence practitioners 
with each others’ tools and problems. The desired re- 
sult is more active collaboration between these two 
communities. SM 


Artificial Intelligence, P, L. Connectionist Robot 
Motion Planning: A Neurally-Inspired Approach to 
Visually-Guided Reaching. Bartlett W. Mel. Per- 
spectives in A.J., V. 7. Academic Pr, 1990, xvi + 
165 pp, $29.95. (ISBN: 0-12-490020-8] This book 
describes MURPHY, a robot arm and camera sys- 
tem whose task is to reach for objects while avoiding 
obstacles. The software controlling MURPHY’s mo- 
tion is based on a connectionist model of learning by 
experience. JO 


Applications, S(15-16). Life Insurance Mathe- 
matics. Hans U. Gerber. ‘Transl: W. Neuhaus. 
Springer-Verlag, 1990, xiii + 131 pp, $49.50. (ISBN: 
0-387-52944-6] ‘Translation of a 1986 Swiss mono- 
graph. Concise introduction to the mathematics of 
life contingencies, assuming some familiarity with 
probability theory. RSK 


Applications (Biological Science), S(18), L. 
Mathematical Ecology of Plant Species Competition: 
A Class of Deterministic Models for Binary Miz- 
tures of Plant Genotypes. Anthony G. Pakes, Ross 
A. Maller. Stud. in Math. Biology. Cambridge Uni- 
versity Pr, 1990, xiii + 193 pp, $49.50. (ISBN: 0- 
521-37388-3] Presents two mathematical (not com- 
puter) models describing the interaction of two 
species of annual plants. The existence of both “soft” 
and “hard” seeds plays an integral role in these mod- 
els. SM 


Applications (Electrical Engineering), T(18: 
2), P, L. Semiconductor Equations. P.A. Marko- 
wich, C.A. Ringhofer, C. Schmeiser. Springer- 
Verlag, 1990, x + 248 pp, $54.50. (ISBN: 0-387- 
82157-0] A text on the drift diffusion equations 
modelling the charge transport in semi-conductors. 
The book arrives at these equations via a unified 
discussion of the hierarchy of semi-conductor models 
starting with the kinetic transport equations. The 
material is addressed to applied mathematics, elec- 
trical engineers, and solid state physicists. Includes 
problems and extensive references. MPR 


Applications (Engineering), P. Proceedings of 
the Third European Conference on Mathematics in 
Industry. Eds: John Manley, Sean McKee, David 
Owens. European Consort. for Math. in Industry, 
V. 5. BG Teubner Stuttgart (US Distr: Kluwer Aca- 
demic), 1990, xii + 564 pp, $127. {ISBN: 0-7923- 
0807-7] Motivated by a desire to acquaint more 
mathematicians with the needs and opportunities for 
people adept at using mathematical models in indus- 
try, the European Consortium for Mathematics in 
Industry was formed. This book reflects the inclu- 
siveness of the organization: vibrations of overhead 
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transmission lines, temperature surges in thermis- 
tors, spiral turns by means of forming cutters, op- 
timization in large gas networks, dairy management, 
and more. There are several general articles on such 
topics as supercomputers, simulation, etc. AWR 


Applications (Engineering), P. Mathematics in 
Industrial Problems, Part 3. Avner Friedman. Math. 
& Its Applic., V. 31. Springer-Verlag, 1990, xiii + 
187 pp, $24. (ISBN: 0-387-97436-9] The third in 
a series produced by the Institute of Mathematics 
and Its Applications at the University of Minnesota. 
The seventeen problems in this volume have been 
collected by the author with two criteria in mind. 
The problems listed have a bona fide application to a 
manufacturing problem, and simultaneously present 
an opportunity for mathematical novelty. Each de- 
scription is short, giving a definition of the problem, a 
brief overview of current techniques with references, 
and a list of relevant open questions. SM 


Applications (Engineering), T(16-17: 1). Vari- 
ational and Finite Element Methods: A Symbolic 
Computation Approach. A.I. Beltzer. Springer- 
Verlag, 1990, xi + 254 pp, $59.50. (ISBN: 0-387- 
51598-4] An introduction to variational and finite 
element methods for undergraduate engineering stu- 
dents. It is distinguished by its use of a dialect of 
MACSYMA as a tool that allows the student to fo- 
cus on the ideas presented. AO 


Applications (Physics), T(17-18: 1), S, P, L. 
The Quantum Statistics of Dynamic Processes. E. 
Fick, G. Sauermann. Transl: W.D. Brewer. Ser. in 
Solid-State Sci., V. 86. Springer-Verlag, 1990, xiii + 
395 pp, $79. [ISBN: 0-387-50824-4] Graduate-level 
introduction to methods and applications of statisti- 
cal physics within a quantum-theoretical framework. 
Covers basic concepts and response and relaxation 
theory for small perturbations from equilibrium. In- 
cludes exercises, with solutions. BC 


Applications (Physics), P. Lecture Notes in 
Physics-359: Problems Involving Change of Type. 
Ed: K. Kirchgassner. Springer-Verlag, 1990, xii + 
207 pp, $32.40. [ISBN: 0-387-52595-5] Sixteen pa- 
pers on variational problems, phase transitions and 
dynamics, viscoelasticity, hyperbolic-elliptic prob- 
lems, and degenerate parabolic equations—in short, 
things that spontaneously change type. BC 


Applications (Physics), $(16-18), P, L. A De- 
velopment of the Equations of Electromagnetism in 
Material Continua. Harry F. Tiersten. Tracts in 
Natural Philos., V. 36. Springer-Verlag, 1990, xi + 
156 pp, $49. [ISBN: 0-387-97241-2] Based on lec- 
ture notes for a graduate course in mechanics. Great 
attention is given to developing the equations of 
electrostatics, magnetostatics, and electromagnetism 
from simply-stated first principles. The interplay 
between the careful mathematics and the concrete 
boundary conditions is very illuminating. A fasci- 
nating excursion for anyone familiar with Maxwell’s 
equations. MU 
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Applications (Physics), T(18: 1, 2). Proba- 
bility and Heat: Fundamentals of Thermostatistics. 
Friedrich Schlogl. Friedr. Vieweg & Sohn (US Distr: 
GLP International Vieweg), 1989, xii + 249 pp, DM 
98. (ISBN: 3-528-06343-2] Derives basic thermody- 
namic theory from a probabilistic analysis of molecu- 
lar microdynamics, and then again from phenomeno- 
logical postulates, in an attempt to resolve some 
of the usual conceptual confusion of thermodynam- 
ics. JO 

Applications (Physics), T(17-18: 1, 2), S, P, L. 
Relativistic Quantum Mechanics: Wave Equations. 
Walter Greiner. Theoretical Phys., V. 3. Springer- 
Verlag, 1990, xvi + 345 pp, $39.50 (P). [ISBN: 0-387- 
50986-0] This fluent translation of the 1987 Ger- 
man original begins with the Klein-Gordon equation 
(for spin-0 particles) and then works through the 
Dirac equation (for spin-1/2). Although most at- 
tention is given to Dirac’s theory, other allied topics 
such as spinors and Lorentz invariance are also cov- 
ered. The final chapters are devoted to the wave 
equations for particles with higher spin. MU 
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Scient. Computing. Academic Pr, 1990, xvii + 558 
pp, $64.50. [ISBN: 0-12-741245-X] A lucid, interdis- 
ciplinary cognitive science approach to vision, with 
extensive material on paralle] and distributed pro- 
cessing and neural networks. Uses techniques from 
mathematics, physics, and neuropsychology of per- 
ception. Reviews many real applications. Extensive 
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1990, vii + 122 pp, $39.50. [ISBN: 0-387-97340-0] 
Developmental systems are models of genetic codes 
containing rules of transformation and “operating 
systems” grouped into a “generating word” that is 
capable of life-like behavior including regeneration 
after trauma, sensitivity to mutation, cloning, and 
reproduction. Offers the opportunity to simulate life 
in computers with a mathematical model. Captures 
the essence of DNA without its enormous complex- 
ity. LAS 
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Andrew Sterrett, Editor 


...the day will come, | believe, when the 
value of writing to learn will be univer- 
Sally acknowledged 


Rueben Hersh in 
Writing to Leam Mathematics and Science 


Need help in getting started as an individ- 
ual or as a member of a department fac- 
ing a Writing Across the Curriculum require- 
ment? Learn how others have made use of 
student assistants, both undergraduate and 
graduate, in ways that benefit students and 
faculty members alike. Read that feedback 
from student journals provide early-warning 
signals for instructors, as well as help stu- 
dents clarify their own thought processes. 


This collection of essays is an outgrowth of 
the widespread interest shown in sessions 
of contributed papers on writing given at 
the 1988 and 1989 Annual Meetings of the 
MAA. Many of the 30 authors of the essays 
included in this volume participated in those 
sessions and each has considerable expe- 
rience in requiring students to write about 
mathematics. 


Included in this volume are essays that: 

M remind professors how frequently math- 
ematicians, regardless of their careers, 
are asked to write, (“Mathematicians 
Write; Mathematics Students Should 
Too"), 
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ot hys 


™@ provide a theoretical framework by which 
to assess writing assignments (“Writing 
for Educational Objectives in a Calculus 
Course”), and 

™ give practical examples of assignments 
that work (“Writing in Mathematics: A 
Plethora of Possibilities”). 


This source book is filled with practical sug- 
gestions. It will enhance the comprehen- 
sion that your students have of mathemat- 
ics. 
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Howard Eves 
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Howard Eves made a valuable contri- 
bution to the Dolciani Mathematical Ex- 
position series. . . The twenty lectures 
included are a delight to read. They 
place each ’great moment” in its his- 
torical context and lay special empha- 
sis on human aspects of each achieve- 
ment. No algebra or geometry teacher 
should be without this book. 


Tom Walsh in The Mathematics Teacher 


He has the knack of bringing ancient 
mathematics alive, at the same time re- 
lating it to modern ideas and develop- 
ments. His work is readable and infor- 
mative and indeed bears reading a sec- 
ond time. His exposition is clear and 
he is not content with generalities, but 
gives actual examples of early meth- 
ods. 


E.H. Lockwood in 
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COMPLEX ANALYSIS: 
THE GEOMETRIC VIEWPOINT 


Steven G. Krantz 


Geometric methods have been used in 
complex analysis since the 1930s when 
Lars Ahlfors discovered that they give a nice 
way to look at the Schwarz lemma. Since 
that time they have become a central part of 
the research activities of complex analysis. 
However these important techniques have 
never found their way into a text accessible 
to a broad audience. 


Steven G. Krantz, a leading worker in com- 
plex analysis and a well-known mathemati- 
cal expositor, has written the first book ex- 
plaining how complex analysis can be stud- 
ied using methods of geometry. Assum- 
ing no background in Riemannian geometry, 
and only one semester of complex analysis, 
Krantz explains the role of Hermitian met- 
rics and of curvature in understanding the 
Schwarz lemma, normal families, Picard’s 
theorems, conformal mappings, and many 
other topics. A minimum of geometric for- 
malism is used to gain a maximum of ge- 
ometric and analytic insight. The climax of 
the book is an introduction to several com- 
plex variables from the geometric viewpoint. 
Poincaré’s theorem, that the ball and bidisc 
are biholomorphically inequivalent, is dis- 
cussed and proved. 
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Except for the minimal background require- 
ments, the book is self-contained. A review 
of relevant topics in the classical theory of 
one complex variable is provided. The style 
is light and inviting. The book is a must for 
anyone with an interest in complex analysis. 
Take a glance at the main chapter headings 
and order your copy today 
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readers in related fields. The purpose of the journal 
is to facilitate cross-fertilization between the theory 
and application of game-theoretic reasoning. 
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All other countries: $133.00 


Volume 3 (1991), 4 issues 
In the U.S.A. and Canada: $110.00 


Under New Editorial Direction 


Journal of 


Approximation Theory 


Editors-in-Chief 


Paul Nevai 
The Ohio State University, Columbus 


Allan Pinkus 
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Theory is devoted to new advances in pure and 
applied approximation theory and related areas, 
The journal covers classical and abstract approxi- 
mation, constructive approximation, harmonic 
analysis and Fourier expansions, function spaces 
and interpolation of operators, real and complex 
function theory, orthogonal polynomials and special 
functions, general orthogonal systems, approxi- 
mation theoretical aspects of difference and 
differential equations, multivariate approximation, 
and spline functions of one and several variables. 
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All other countries: $534.00 
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Editor 
Eberhard Knobloch 
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David E. Rowe 


Pace University, Pleasantville, New York 


Historia Mathematica \s concerned with 
the history of all aspects of the mathematical sciences 
in all parts of the world and all historical periods. The 
Journal publishes occasional biographies of mathe- 
maticians and historians, studies of organizations and 
institutions, essays on historiography, and articles on 
the interactions among all facets of mathematical 
activity and other aspects of culture and society. 


Published under the Auspices of the International Commission on the 
History of Mathematics of the Division of the History of Science of the 
International Union of the History and Philosophy of Science 


ISSN 0315-0860 
All other countries: $105.00 
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In the U.S.A. and Canada: $85.00 
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and 


Updated 


THE LAST PROBLEM 


E. T. Bell 
Revised and updated by Underwood Dudley 


What Eric Temple Bell calls the last prob- 
lem is the problem of showing that Pierre 
Fermat was not mistaken when he wrote 
in the margin of a book, almost 350 years 
ago, that 2” + y” = z” has no solution in 
positive integers when n > 3. The orig- 
inal text of THE LAST PROBLEM traced 
the problem from Babylonia in 2000 B.C. 
to seventeenth-century France. Along the 
way we learn quite a bit about history, and 
just as much about mathematics. Under- 
wood Dudley's notes bring us up-to-date on 
recent attempts to solve the problem. 


The book is unique in that it,is a biogra- 
phy of a famous problem. The book fits 
no categories. It is not a book of mathe- 
matics. Pages go by without an equation 
appearing. It is not a history of number the- 
ory because it includes too much about the 
history of the western world, and it is not 
a history of western civilization because its 
focus is on mathematics. It is too entertain- 
ing to be scholarly and contains too much 
mathematics to be widely popular. It is an 
unusual book. 


What T.A.A. Broadbent said about Bell's 
work applies to THE LAST PROBLEM. 
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His style is clear and exuberant, his 
opinions, whether we agree with them 
or not, are expressed forcefully, often 
with humor and a little gentle malice. 
He was no uncritical hero-worshipper, 
being as quick to mark the opportunity 
lost as the ground gained, so that from 
his books we get a vision of mathemat- 
ics as a high activity of the questing 
human mind, often fallible, but always 
pressing on the neverending search for 
mathematical truth. 


This is a rich and varied, wide-ranging book, 
written with force and vigor by someone with 
a distinctive style and point of view. It will 
provide hours of enjoyable reading for any- 
one interested in mathematics. 
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\\e ge Extensive descriptions of advising programs 

, CONe OT atics that work is included, along with sugges- 

Math he _acne™ tions for teaching that describe a wide range 

of instructional techniques. You will learn 

about how to use computers in your teach- 

ing, and how to evaluate your performance 

Soe MATHEMATICS as well as that of your students. 

TEACHING Every faculty member concerned about teach: 

Alan Schoenfeld, Editor. ing should read this book. Every admin- 
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Undergraduate Teaching of Mathematics mathematics programs should have a copy. 
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The Search for a Finite Projective Plane of Order 10 


C. W. H. Lam,* Computer Science Department, Concordia University, 
Montréal, Québec, Canada H3G 1M8 


CLEMENT Lam received a Ph.D. in mathematics at Caltech under Herbert 
Ryser in 1974. After graduation, he spent a year at the University of 
Calgary. In 1975, he joined the computer science department at Concordia 
University, where he is currently a professor. His research interest is in 
combinatorics, especially in the application of computers to solve mathe- 
matical problems. 


Dedicated to the memory of Herbert J. Ryser 


1. Prologue. When I was a graduate student looking for a thesis topic, Herbert 
Ryser advised me not to work on the projective plane of order 10. Even though he 
was extremely interested in this subject, he believed that it was too difficult and 
that I might get nowhere with it. I took his advice and chose another problem. 
Somehow, this problem has a beauty that fascinates me as well as many other 
mathematicians. Finally in 1980, I succumbed to the temptation and started 
working on it with some of my colleagues. We eventually managed to get some- 
where, but unfortunately, Dr. Ryser is no longer with us to hear of the final result. 
This is an expository article describing the evolution of the problem and how 
computers were used to solve it. 


2. The History of the problem. A finite projective plane of order n, with n > 0, 1s 
a collection of n* + + 1 lines and n* + n + 1 points such that 


1. every line contains n + 1 points, 

2. every point is on n + 1 lines, 

3. any two distinct lines intersect at exactly one point, and 
4. any two distinct points lie on exactly one line. 


There are several different definitions for a finite projective plane and this set 
of axioms is chosen to highlight the striking duality of lines and points. One can 
interchange the words “line” and “point” in the definition and obtain essentially 
the same axioms! The attractiveness of these objects is in their simplicity and their 
reliance on the language of geometry. One is tempted to start drawing lines on 
paper and may soon discover some simple examples. 

The smallest example of a finite projective plane is a triangle, the plane of order 
one. The smallest non-trivial example is of order 2, as shown in Fic. 1. There are 
seven points labelled from 1 to 7. There are also seven lines labelled L1 to L7. Six 
of them are straight lines but L6 is represented by the circle through the points 2, 
6, and 7. The reader is invited to verify that the axioms of a finite projective plane 
are Satisfied. 

An early reference to a finite projective plane is in the paper by Veblen [32], 
which studied the axioms for geometry and used the plane of order 2 as an 


*This work was supported in part by the Natural Sciences and Engineering Research Council of 
Canada under Grants A9373, A9413, 0011 and by the Fonds pour la Formation de Chercheurs et I’Aide 
a la Recherche under Grants EQ2369 and EQ3886. 
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L1 = {1,2,4} 

L2 = {2,3,5} 

L3 = B,4,6} 

2 6 

L4 = {4,5,7} 

L5 = {5,6, 1} 

L6 = (6,7, 2} 

4 7 5 L7 = {7, 1, 3} 


Fic. 1. The finite projective plane of order two. 


example. Veblen also proved that this plane of order 2 cannot be drawn using only 
straight lines. In a series of papers [32, 33, 34], Veblen, Bussey and Wedderburn 
established the existence of most of the planes of small orders, as well as all four 
non-isomorphic planes of order 9. One of the missing orders is n = 6. 

In 1938, Bose [4] explained why there is no projective plane of order 6. He 
related the existence of a finite projective plane of order n to the existence of a 
hyper-Graeco-Latin square. The term “Graeco-Latin square” originated with 
Euler in 1782. In our modern terminology, they are called orthogonal Latin 
squares. Let us define them. 

A Latin square of order n is an n X n matrix satisfying the following properties: 


1. all the entries are integers between 1 and n, 
2. In every row, no entry is repeated, and 
3. in every column, no entry is repeated. 


Let S, =[s] and S, = [s] denote two Latin squares of order n. They are 
said to be orthogonal provided that the n* 2-samples (s{>, s©) for i, j = 1,2,...,n 
are distinct. A simple way to visualize the definition is to put the second square, 
slightly shifted, on top of the first square. The resulting n by n array of 2-samples 
has no repeated entries, and is often referred to as a Graeco-Latin square or an 
Euler square. 

FIGURE 2 contains two examples of Latin squares of order 4. They are orthogo- 
nal and the resulting Graeco-Latin square is shown in Fic. 3. Is there a third Latin 
square orthogonal to both the squares in Fic. 2? Yes, there is one, as shown in 
Fic. 4. Can one find a fourth? The answer is no. One can easily prove the 
following theorem [26, p. 80]. 


Fic. 2. Two orthogonal latin squares of order four. 
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Fic. 3. A Graeco-Latin square of order 4. 


Fic. 4. A Latin square orthogonal to those in Fic. 2. 


THEOREM 1. Let S,,5S,,...,5S, be a set of t mutually orthogonal Latin squares of 
order n => 3. Then 


t<n-—1l. (1) 


If equality holds in (1), then the orthogonal set is said to be complete. Finally, 
we are ready to state Bose’s result [26, p. 92]. 


THEOREM 2. Let n > 3. We may construct a projective plane of order n if and 
only if we may construct a complete set of n — 1 mutually orthogonal Latin Squares 
of order n. 


Why does Bose’s result explain the non-existence of a projective plane of order 
6? It states that such a plane exists if and only if there exists a complete set of 5 
mutually orthogonal Latin squares of order 6. The possible existence of even a pair 
of orthogonal Latin squares of order 6 was a much older problem. 

In a 1782 paper [12], Euler started by stating the problem of the 36 officers. 
This problem asks for an arrangement of 36 officers of 6 ranks and from 6 
regiments in a square formation of size 6 by 6. Each vertical and each horizontal 
line of this formation is to contain one and only one officer of each rank and one 
and only one officer from each regiment. Euler denoted the 6 regiments by the 
Latin letters a, b, c, d, e, f and the 6 ranks by the Greek letters a, B, y, 6, &, ¢. 
He further remarked that the characteristic of an officer was determined by the 
two letters, one Latin and the other Greek, and that the problem consists of 
arranging the 36 combinations of two letters in a square in such manner that every 
row and every column contains the six Latin as well as the six Greek letters. This 
was the origin of the term “Graeco-Latin square.’ Euler observed that the first 
step was to arrange the Latin letters in a square so that no letter was missing either 
from any row or from any column. He called this square a Latin square. If he had 
chosen to arrange the Greek letters instead, then we would probably have Graeco 
squares rather than Latin squares. In any case, if we label both the ranks and the 
regiments from 1 through 6, then Euler’s problem reduces to the construction of a 
pair of orthogonal Latin squares of order 6. 
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Euler found no solution to this particular problem. He then conjectured that no 
solution exists if the order of the square is of the form n = 2 (mod 4). This is the 
famous Euler’s conjecture. The first case nm = 2 is trivially impossible. Tarry 
around 1900 [27] verified by a systematic enumeration that Euler’s conjecture 
holds for n = 6. Since there does not exist even a pair of orthogonal Latin squares, 
Bose’s result implies the non-existence of a projective plane of order 6. 

Yet, there is something unpleasant about a systematic hand enumeration: It is 
messy and it is error prone. Mathematicians did find a better explanation in the 
celebrated Bruck-Ryser theorem [7], which was published in 1949. 


THEOREM 3. (Bruck-Ryser) If n = 1,2 (mod 4), then a necessary condition for 

the existence of a finite projective plane of order n is that integers x, y exist satisfying 
2 2 
n=x°t+y’, 


Since 6 = 2 (mod 4) and it is not a sum of two integer squares, the Bruck-Ryser 
theorem implies the non-existence of a projective plane of order 6. How did they 
prove the theorem? We will not repeat it here, but one of the crucial steps 
involved the use of an incidence matrix. 

The incidence matrix A = [a;;] of a projective plane of order n is an n* +n + 1 
by n* +n+1 matrix where the columns represent the points and the rows 
represent the lines. The entry a,,; is 1 if point j is on line 7; otherwise, it is 0. For 
example, Fic. 5 gives the incidence matrix for the projective plane of order 2. In 


Fic. 5. An incidence matrix for the plane of order two. 


terms of an incidence matrix, the properties of being a projective plane are 
translated into: 


1. A has constant row sum n + 1, 

2. A has constant column sum n + 1, 

3. the inner product of any two distinct rows of A is 1, and 
4. the inner product of any two distinct columns of A is 1. 


The conditions on row sums and row inner products can be encapsuled in the 
following matrix equation: 


AAT =nl + J, (2) 


where A’ denotes the transpose of the matrix A, J denotes the identity matrix, 
and J the matrix of all 1’s. Every diagonal entry on the right-hand side of Eq. (2) is 
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n + 1, which implies that the inner product of any row of A with itself is m + 1 or, 
equivalently, its row sum is n + 1. Every off-diagonal entry is 1, which is the same 
as requiring the inner product of any two distinct rows of A to be 1. Ryser also 
proved that A is a normal matrix [25]; in other words, 


AA! = A‘A, 
Hence, Eq. (2) also implies the conditions regarding column sums and column 
inner products. The Bruck-Ryser theorem starts from this equation and proves 


that it implies n is a sum of two integer squares when n = 1,2 (mod 4). 
It is surprising that the Bruck-Ryser theorem has a partial converse [13]. 


THEOREM 4. If n = 0,3 (mod 4) or if n = 1,2 (mod 4) and n = x? + y’, then 
there exists a rational matrix A satisfying equation (2). 


Of course, if the matrix A is actually the incidence matrix of a projective plane, 
then it must have entries which are either 0 or 1. Although this theorem guaran- 
tees only a matrix with rational entries, it suggests that the necessary part of the 
Bruck-Ryser theorem is very close to being also sufficient. 

Projective planes are special cases of a class of combinatorial objects called 
symmetric block designs. We are not going to discuss block designs, except to 
mention that Chowla and Ryser have generalized the Bruck-Ryser theorem to 
symmetric block designs [10], which it is now known as the Bruck-Ryser-Chowla 
theorem. Here again, a partial converse exists, providing more credence to the 
hope that the conditions in the Bruck-Ryser-Chowla theorem are both necessary 
and sufficient. This hope is now shattered by the non-existence of the finite 
projective plane of order 10. 

Let us return to the history of projective planes. Now that we have a good 
explanation of the nonexistence of a plane of order 6, what is the next unknown 
case? It is n = 10. Since 10 = 1* + 37, a plane of order 10 would exist if the 
necessary condition of the Bruck-Ryser theorem is also sufficient. On the other 
hand, 10 = 2 (mod 4), and so, if one believes Euler’s conjecture, then it does not 
exist. 

First, Euler’s conjecture was shown to be false. In 1959, Bose and Shrikhande 
[5] constructed a pair of orthogonal Latin squares of order 22. Then Parker [23, 24] 
constructed a pair of orthogonal Latin squares of order 10. Together they showed 
that Euler’s conjecture is false for all orders greater than six [6]. This raised the 
hope for the existence of a plane of order 10. 

History indicated that significant advances were made when one branch of 
mathematics was shown to be related to a different branch of mathematics. It is 
not surprising that the beginning of the end of the plane of order 10 occurred 
when people started studying the binary error-correcting code associated with it. 


3. The beginning of the end. Let A be the incidence matrix of a finite 
projective plane of order 10. Let V be the vector space generated by the rows of A 
over F,, the finite field with two elements {0,1}. A vector in V is called a 
codeword. The weight of a codeword is the number of 1’s in the codeword. Let w, 
be the number of codewords of weight i. We define the weight enumerator of V to 


be 
111 


yw, x'. 
i=0 
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In 1970, Assmus gave a talk at Oberwolfach entitled ‘““The Projective Plane of 
Order Ten?” which discussed the properties of V. After the talk, there was a lot of 
anticipation about this new approach. Maybe one could derive a contradiction such 
as showing that one of the weights is non-integral or negative. Unfortunately, no 
such simple contradictions were found. 

However, there were several important advances. Assmus and Mattson showed 
[2] that the weight enumerator is uniquely determined by w,, w,5, and w,.. Since 
their report is not readily available in most libraries, we shall refer to the paper of 
MacWilliams, Sloane, and Thompson [22], which proved many of the same results. 
One of the many innocuous but extremely useful results was: 


THEOREM 5. Let I be any line of the plane, and v any codeword of V. Then 
lu A Z| = |v| (mod 2). 


For example, every line must intersect a codeword of even weight in an even 
number of points. This gives us an extra condition beyond what is available from 
the definition of a projective plane. Roughly speaking, this condition reduces the 
number of possibilities for each line by half. The cumulative effect of this condition 
is tremendous and it is the main reason that makes an exhaustive search possible. 

Furthermore, MacWilliams et al. showed that w,, = 0 after using about 3 hours 
of computer time on a General Electric 635. Bruen and Fisher later showed in [8], 
that w,, = 0 also followed from an earlier computer result by Denniston [11]. 
However, the method of MacWilliams, et al., illustrated how to continue attacking 
the problem. This can be summarized as follows: 


Given any weight i, we assume that a codeword of weight i exists. By 
considering the intersection patterns of a few selected lines with the 7 
points of this codeword, we arrive at a small number of starting 
configurations, each corresponding to a submatrix of the incidence 
matrix. Then, we try to complete the rest of the incidence matrix. If we 
succeed, then it is time to celebrate because we have constructed a 
plane. If none of the starting configurations can be so completed, then 
the plane of order 10 does not contain any codeword of weight i and 
w; =0. 


This method requires first the generation of all the possible starting configura- 
tions. A good reference is the 1980 paper [14] by Marshall Hall, Jr., which analyzed 
in detail the starting configurations for codewords of small weight i < 20. Given a 
starting configuration, the attempt to complete it is basically a backtracking 
process. The term ‘‘backtrack” was coined by D. H. Lehmer in the 1950’s, but 
backtrack techniques have been used to solve puzzles for a long time. It is a 
tedious and lengthy task, one that is best suited for a computer. 


3.1 Backtrack search using a computer. The first description of a generalized 
backtrack search algorithm was in [35]. Let us first define a restricted version of 
the search problem. 


Given a collection of sets of candidates C,,C,,C3,...,C,, and a 
boolean compatibility function P(x, y) defined for all x @C, and 
y &C,, find an m-tuple (x,,..., x,,) with x; © C; such that P(x,, x;) is 


true for all i #/. 
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The m-tuple satisfying the above condition is called a solution. The term compati- 
bility was first introduced by Carter in 1974 [9]. This version is restricted because 
the compatibility function is only defined on ordered pairs rather than on all 
k-tuples, 1 <k < m. 

For example, if we take m = n* + n + 1 and let C, be the set of all candidates 
for column i of the incidence matrix of a projective plane, then P(x, y) can be 
defined as 


P(x,y) = (me if<x,y)=1 
false otherwise, 


where (x, y) denotes the inner product of columns x and y. A solution is then a 
complete incidence matrix. 

In a backtrack approach, we generate k-tuples with k<m. A_ k-tuples 
(x1,...,X,) is a partial solution at level k if P(x;, x,) is true for all i # j < k. The 
basic idea of the backtrack approach is to extend a partial solution at level k to 
one at level k + 1, and if this extension is impossible, then to go back to the 
partial solution at level k — 1 and attempt to generate a different partial solution 
at level k. 

A nice way to organize the information inherent in the partial solutions is the 
backtrack search tree. Here, the empty partial solution ( ) is taken as the root, and 
the partial solution (x,,...,x,_,,X,) is represented as the child of the partial 
solution (x,,...,x,_,). Following the computer science terminology, we often 
called a partial solution a node. It is often true that the computing cost of 
processing a node is independent of its level k. Under this assumption, the total 
computing cost of a search is equal to the number of nodes in the search tree times 
the cost of processing a node. 

The number of nodes in the search tree can often be reduced by using the 
symmetry or property-preserving operations. If A is the incidence matrix of a 
projective plane, then the operations of permuting the rows and permuting the 
columns of A correspond only to reordering the lines and points of the plane. 
These operations preserve the property of being a projective plane. To see how 
they reduce the size of a search tree, consider the plane of order 10. There are 


Ga = 4.7 x 10" choices for the first column, corresponding to the number of 
ways of placing 11 ones in the 111 rows. By using the row permutations, we can 
assume that all these ones are placed on the first 11 rows, reducing the number of 
partial solutions (x,) from 4.7 x 10'* to 1. Mathematicians love to use the phrase 
“without loss of generality” to indicate a simplification by symmetry operations. So 
without loss of generality, the second column has only one choice—with a one in 
the first row and the remaining 10 ones in rows 12 to 21. Now, row 1 has nine 
remaining ones. By permuting columns, we can assume that these remaining ones 
of row 1 are in columns 3 to 11. Next, by row permutation, the remaining 10 ones 
of column 3 can be placed in rows 22 to 31. Continuing in this manner, it is not 
difficult to show that there is only one choice up to column 21. Beyond this point, 
symmetry operations are difficult to visualize, because they often involve combina- 
tions of row and column permutations. 

This process of reducing the search tree by using symmetry operations is called 
isomorph rejection. Two partial solutions are said to be isomorphic if there is a 
symmetry operation mapping one to the other. To determine whether a solution 
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exists, it is only necessary to extend the non-isomorphic partial solutions, because 
if a partial solution can be extended to a complete solution, then every other 
isomorphic partial solution can also be so extended. The testing of whether two 
partial solutions are isomorphic is called isomorphism testing. The set of symmetry 
Operations mapping a partial solution to itself forms an automorphism group. In 
the context of projective planes, an automorphism of a complete solution is also 
called a collineation. 

The efficient use of the symmetry operations is one of the most difficult 
problems in a backtrack search. It is both a blessing and a curse—a blessing 
because there is always hope of further optimization and a curse because it is the 
source Of many programming errors. However, following the method of 
MacWilliams et al., it is clear that the existence question of the plane of order 10 is 
headed towards a computer-based solution and that symmetry consideration is a 
necessary tool for efficiency reasons. 

In his 1974 Ph.D. thesis, Carter picked up where MacWilliams et al. had left off. 
He showed that there are six possible starting configurations associated with a 
codeword of weight 16. By using the computer, he managed to eliminate four of 
the cases, as well as one subcase of the fifth. He used a total of about 100 hours of 
computer time, mostly on a CDC 6600 at the Institute for Defense Analyses (IDA) 
in Princeton, New Jersey, and also on a CDC 7600 at the Lawrence Radiation 
Laboratories in Berkeley, California. His search investigated about 3 x 10° nodes, 
or about 10° nodes per second. 

This was the situation when we entered the picture. While we knew w,, = 0, 
the search for weight 16 codewords was about three-quarters done and the search 
for weight 12 codewords was presumed to be too difficult. The person who 
suggested the problem to us was John Thompson. 


4. The home stretch. Around 1980, we purchased a share in a VAX-11/780 
with the intention of running long mathematical programs during the off-hours 
when it was otherwise unused. In the same year, Thompson sent John McKay two 
papers [28, 29] outlining a connection between a codeword of weight 12 and a set 
of fixed-point free involutions. When McKay showed me the papers, I was finally 
hooked. We had a computer looking for a problem and the search for codewords 
of weight 12 was a problem looking for a computer. I started by writing a simple 
computer program to play with the ideas. Little did I know that it would become a 
9 year undertaking! 

We realized very early that the weight 12 case was going to take a lot of 
computer time. So I showed my simple program to Larry Thiel, who has a 
reputation of making any computer program run faster. Right away, he saw ways of 
speeding it up. He and Stanley Swiercz wrote most of our computer programs. 

We realized also that it is important to estimate how long a computer search 
would take. It would be useless to speed up a program by a factor of ten, if the 
resulting program still requires 100 years to run. However, speeding it up by a 
factor of 1000 would make the search feasible in our environment. By adapting 
Knuth’s Monte Carlo method [16], we estimated that the search tree had about 
4 x 10!' nodes. Hoping that we could process 10° nodes per second, we arrived at 
an estimate of 50 days of computer time. 

I was going to present the final results at the Ninth Australian Conference on 
Combinatorial Mathematics in 1981, but the program development was slower 
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than expected. At the conference, I had to quickly change my plan and talked 
about a feasibility study of such a search [17]. The program was finally developed 
and the search finished late in 1982. We did not find a completed incidence matrix 
and so, w,, = 0 [18]. Ryser was happy with the result and encouraged us to 
continue. Hall was both excited and pessimistic. He wrote, “For the first time, I 
doubt that a plane of order 10 exists.” 

The search for weight 12 codewords took 183 days instead of the predicted 50. 
The main reason for the discrepancy was that we could process only about 3 x 104 
nodes per second instead of 10°. Yet, we learned a lot about the planning of the 
search, the estimation process, and the optimization techniques—knowledge which 
became useful in our later programs. In retrospect, we probably could have 
reduced the size of the search tree by another factor of 10 and come within the 
predicted time. 

Even before the search for the weight 12 codeword was finished, we were 
looking for a way to solve the whole problem. One possibility is to determine w,, 
and then the weight enumerator. Unless the weight enumerator provides a 
contradiction, we still have to perform another search. Hall in [14] proved that if 
W,. < 1211, then there exist primitive weight 20 codewords, which intersect the 
lines of the plane in at most four points. We might be able to skip the step of 
finding w,,. if we could go directly to weight 20. Unfortunately, we obtained an 
estimate of at least 10'° weight 20 starting configurations. Even though, as Hall 
remarked, it required only a few minutes of computer time to try each starting 
configuration, a few minutes times 10'° equaled 10’ years—clearly beyond our 
capability. There was no short cut which could avoid computing the weight 
enumerator. 

During my visit to the University of Cambridge in 1982, Thompson showed me 
Carter’s thesis and suggested that we might be able to finish the remaining cases. 
It was the first time I had read his thesis and I was surprised by how familiar his 
computing techniques were. We had rediscovered a lot of what was already in his 
thesis! Thompson also made me promise to redo Carter’s work on weight 16, just 
to have an independent verification of its correctness. Although I have yet to do it, 
I still intend to fulfil this promise. 

In the mean time, Larry had developed an extremely useful prototyping pro- 
gram, called NPL. It had a very humble beginning. Working with incidence 
matrices meant that we often had to draw them and fill in large numbers of zeros 
and ones. As a labour saving move, we started drawing templates and then making 
photocopies. Larry took this one step further, printing the matrices with a 
computer rather than drawing the templates by hand. Soon we included the 
capability of initializing the starting configurations and later, the ability to back- 
track was also included. Before long, it became a full-blown prototyping program, 
capable of performing estimations and adapting to new configurations. It turned 
out to be an extremely useful tool. We could quickly explore an idea for an 
“improvement” to the search. Ideas that turned out to increase the required CPU 
time were discarded. We could use it to plan our final search in detail, without 
having to write the program first. The most exciting stage was this planning stage. 

Using NPL, we found a feasible way to solve the remaining cases of Carter. 
Compared to Carter’s work, we found a better search order and a better use of the 
symmetry, both reducing the size of the search tree. Carefully optimised programs 
to implement the planned attack were written for each of the remaining cases. 
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Running on two VAX computers, it took the equivalent of 80 days of computing 
on a VAX-11/780 [19]. No completed incidence matrix was found and hence 
Wig = 0. 

Now that w,, w,; and w,, were all known to be zero, we could compute the 
weight enumerator of the binary error-correcting code for the plane of order 10 
[20]. What catches the eye is that w,. = 24,675. If one follows the same method of 
finding starting configurations and trying to extend them, then one will either 
construct the plane or show that it does not exist. 

Even before we finished our search for the weight 16 case, Thompson was 
already busy at work. In a letter dated October 24, 1983, he enclosed a copy of his 
unpublished note [30] with a list of 82 starting configurations for the weight 19 
case. He constructed them by hand and had not checked for pairwise inequiva- 
lence. He ended the letter with “I hope you’ll press on with this question, although 
perhaps I am too greedy.” 

It was a challenge that we could not refuse. I quickly wrote up a program and 
found 65 starting configurations. Larry also upgraded his NPL program to include 
isomorphism testing and found 66 configurations! After cross checking our results, 
I found that his numbers were correct. We also reconciled our numbers with those 
of Thompson. At last, we could see the finish line at a distance. 

By simple counting arguments, 17 of the 66 cases can easily be eliminated. Four 
other cases are eliminated in a more ad hoc manner. Scott Crossfield, working with 
me as an NSERC Summer Undergraduate Assistant, obtained estimates for each 
of the remaining 45 cases. Our preliminary results indicated that the problem 
could be solved with another two years of computing time. These arguments and 
estimation results were presented in [19]. 

We decided to solve these cases by starting with the easy ones and hoping to 
discover better methods for the more difficult ones. We hedged our preliminary 
estimates by saying that it might be too optimistic. Our estimate of how fast the 
computer could test the acceptability of a column was based on our experience of 
the weight 12 and 16 cases. For weight 19, this test was much slower for technical 
reasons related to the fact that 19 is an odd number whereas the other two 
numbers are even. We could proceed only about 60 nodes per second, giving an 
estimate of 100 years to finish the search. To make the search feasible, we needed 
to speed it up by another factor of 100. The only possibility left was to use a faster 
computer, bringing to mind a supercomputer. Fortunately, our method could be 
easily adapted to use the vectoring capability of a supercomputer [31]. It was time 
to ask for help. Both Hall and Thompson suggested that IDA might be willing 
because it had helped Carter previously. After some discussion with Nick 
Patterson, the deputy director of the Communication Research Division at IDA, 
he agreed to run our yet undeveloped program on their CRAY-1A as the lowest 
priority job using up the otherwise idling computer. He and Douglas Wiedemann, 
who was on leave from IDA pursuing graduate studies at the University of 
Waterloo, gave us many suggestions on the efficient usage of the CRAY. In 
addition, Patterson looked after the day-to-day running of the program for over 
two years. He was the unsung hero in the successful completion of our work. 

In the meantime, Concordia had acquired more and more VAX computers. 
Soon, we were running our program on five different computers. We figured that 
there was enough computing resources at Concordia to solve all but a few of the 
most difficult cases. So we targeted the CRAY program for only one type of 
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starting configuration which contained all the most difficult cases. The other 
configurations were left for our VAX computers to solve. All the cases at 
Concordia finished around January 1987 and took the equivalent of 800 days of 
VAX-11/780 computer time. 

There are many considerations that go into developing a computer program 
which runs for months and years. Interruptions, ranging from power failures to 
hardware maintenance, are to be expected. A program should not have to restart 
from the very beginning for every interruption; otherwise, it may never finish. To 
solve the restarting problem, we partitioned our starting configurations into 
smaller cases and a message was output onto a log file every time one of them was 
finished. If there was an interruption, we just looked up the last completed case 
and continued from that point. For our programs, a convenient subdivision was at 
the ““A2” boundary; more details can be found in [19, 21]. The messages written on 
the log file also contained statistics about the run to allow comparison and 
verification of the results. 

In order to avoid any unpleasant surprises involved in running programs at a 
long distance, we also put in a number of safety checks. Most of them were related 
to testing whether the sizes of the internal data structures were large enough. For 
lack of memory space, one could not merely allocate the largest expected size for 
every structure. Instructions were given on how to selectively increase the sizes. 
Some data structures were so specialized that their size could not be increased. We 
temporarily ignored the problem by deciding to handle them later, if necessary. Of 
course, we hoped that it would not be necessary. 

We finished developing our CRAY program in 1986 and we were relieved to 
obtain a final estimate of about 3 months of CRAY computer time. It started 
running at IDA in the fall of 1986 and was to await completion in two to three 
years. 


5. The finish line. On November 11, 1988, Patterson called and said that the 
run was finished. Always careful, he proposed to verify that there was a record of 
each subcase before sending the results on a magnetic tape back to us. Somehow, 
the news of the end of the plane of order 10 was spread around the combinatorics 
community and we were deluged with inquiries. On November 18, Patterson called 
again with some bad news. There was an error number 4 for one of the A2’s. What 
is an error number 4? It had been two years since we last looked at our CRAY 
program. Taking out an old listing, it took us a while to determine that error 
number 4 was a size problem for a data structure that could not be enlarged. This 
A2 could not be solved with the existing CRAY program! If it had been able to 
handle this A2, it would have taken about 30 seconds. The only other program 
around that could handle this case was our slow but adaptable NPL program. 
After further dividing this A2 into about 200 smaller subcases, it took NPL one 
day to solve one of them, giving an estimate of 200 days just to solve this A2! 
Meanwhile, Barry Cipra from Science called and said he would like to write an 
article about the non-existence of a plane of order 10. What should we tell 
everybody? 

We decided that it was prudent to solve the problem first. We found that, with a 
small modification of the CRAY program, it could handle all but one of the 
subcases. So, we ran this offending subcase using NPL and bypassed the CRAY. 
The plane of order 10 was again dead on November 29, 1988. 
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There was a surprising amount of public interest in the non-existence of a finite 
projective plane of order 10. Besides Science, it was also reported in the New York 
Times and in Scientific American. One often asked question is, “How much 
computer time it took on the CRAY?” Unfortunately, we did not keep track of the 
computer time used on the CRAY, because we thought it was a useless figure 
other than demonstrating the difficulty of the problem. We were more interested 
in keeping statistics that might be useful for a future verification by someone else. 
So, we had to make an educated guess and said 3,000 hours. Later, Patterson 
suggested that it was probably closer to 2,000 hours. So, our CRAY program was 
proceeding nodes at a rate of about 2 x 10* nodes per second. 

After the burst of publicity, we finally managed to read the magnetic tape 
containing the statistics. To our horror, we found another A2 with an error 
number 4. However, we knew exactly what to do this time and there was no panic. 
It was handled exactly the same way as the previous one. By the end of January 
1989, the plane of order 10 was dead a third and, hopefully, the final time. 


6. Is this really the end? ‘This is not the first time that a computer has played 
an important role in “proving” a theorem. A notable earlier example is the 
four-color theorem [3]. Yet, these are not proofs in the traditional mathematical 
sense. It is impossible for any human being to check through all the calculations. 
From personal experience, it is extremely easy to make programming mistakes. We 
have taken many precautions, including the use of two different programs to cross 
check selective sample cases and the checking of internal consistency when 
isomorphism testing is performed. Yet, I want to emphasize that this is only an 
experimental result and it desperately needs an independent verification, or better 
still, a theoretical explanation. 

There is, moreover, the possibility of an undetected hardware failure. A 
common error of this type is the random changing of bits in a computer memory, 
which could mean the loss of a branch of the search tree. This is the worst kind of 
hardware error, because we might lose solutions without realizing it. The CRAY-1A 
is reported to have such errors at the rate of about one per one thousand hours of 
computing. At this rate, we expect to encounter two to three errors! We did 
discover one such error by chance. After a hardware problem, Patterson reran the 
1,000 A2’s just before the failure and the statistics have changed for the A2 
processed just prior to the malfunction. How should one receive a “proof” that is 
almost guaranteed to contain several random errors? 

Unfortunately, this is unavoidable in a computer-based proof—it is never 
absolute. However, despite this reservation, we argued in [21] that the possibility of 
hardware errors leading us to a wrong conclusion is extremely small. Since each 
A2 is in a separate run and there are about half a million non-isomorphic A2’s, 
the probability of one random hardware error affecting one specific A2 is about 
2 x 107°. Suppose we accept that the weight enumerator is correct. Then if an 
undiscovered plane of order 10 exists, it would contain 24,675 weight 19 code- 
words. The 19 points in each such codeword give rise to an A2. Since we have 
searched through all non-isomorphic A2’s, we must have encountered these 
special A2’s. If all these 24,675 special A2’s are isomorphic, then only one out of 
about half a million non-isomorphic A2’s can be extended to the undiscovered 
plane. Even under this assumption, the probability of this special A2 being 
affected by two or three undetected hardware errors is less than 107°. Is it likely 


1991] THE SEARCH FOR A FINITE PROJECTIVE PLANE OF ORDER 10 317 


that all 24,675 A2’s arising from an undiscovered plane are isomorphic? Since the 
plane is known to have a trivial collineation group [1, 15, 36], it is more likely that 
there are two or more non-isomorphic A2’s amongst the 24,675 cases. In this 
situation, the probability of hardware errors affecting all of them is infinitesimal. 
The same argument can be used even if we do not assume the correctness of 
previous computer-based results. Basically, the argument depends on the observa- 
tion that if a plane of order 10 exists, then it can be constructed from many 
different starting points. Random hardware failures are unlikely to eliminate all of 
them. In other words, the fact that no one has yet constructed one is a very strong 
indication that it does not exist. 


7. Epilogue. While we were tracing the origin of the existence problem of the 
plane of order 10, we talked to Dan Hughes, who has worked in this area for a 
long time and is famous for the Hughes planes which are named after him. He 
recounted the following story. In about 1957, at a Chinese restaurant in Chicago, 
Reinhold Baer, another mathematician well known for his work in group theory 
and projective planes, was trying to impress the younger Hughes by remarking that 
if the plane of order 10 was settled by a computer, he hoped not to be alive to see 
it. Baer got his wish but I do not think Herb Ryser shared this opinion. Ryser was 
happy that the weight 12 case was settled by a computer. I can only extrapolate 
and hope that he would also be happy that the whole problem has been “‘settled,”’ 
even if by a computer. 
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The Fiftieth William Lowell Putnam Mathematical Competition 


LEONARD F. K.osinsk1, Santa Clara University 
GERALD L. ALEXANDERSON, Santa Clara University 
Loren C. Larson, St. Olaf College 


The following results of the fiftieth William Lowell Putnam Mathematical 
Competition, held on December 2, 1989, have been determined in accordance with 
the governing regulations. This annual contest is supported by the William Lowell 
Putnam Prize Fund for the Promotion of Scholarship, left by Mrs. Putnam in 
memory of her husband, and is held under the auspices of the Mathematical 
Association of America. 

The first prize, $5,000, was awarded to the Department of Mathematics of 
Harvard University. The members of the winning team were: Jeremy A. Kahn, 
Raymond M. Sidney, and Eric K. Wepsic; each was awarded a prize of $250. 

The second prize, $2,500, was awarded to the Department of Mathematics of 
Princeton University. The members of the winning team were: David J. Grabiner, 
Mathew D. Mullin, and Rahul V. Pandharipande; each was awarded a prize of 
$200. 

The third prize, $1,500, was awarded to the Department of Mathematics of the 
University of Waterloo. The members of the winning team were: Grayden 
Hazenberg, Stephen M. Smith, and Colin M. Springer; each was awarded a prize 
of $150. 

The fourth prize, $1,000, was awarded to the Department of Mathematics of 
Yale University. The members of the winning team were: Bruce E. Kaskel, 
Andrew H. Kresch, and Sihao Wu; each was awarded a prize of $100. 

The fifth prize, $500, was awarded to the Department of Mathematics of Rice 
University. The members of the winning team were: Hubert L. Bray, John W. 
McIntosh, and David S. Metzler; each was awarded a prize of $50. 

The six highest ranking individual contestants, in alphabetical order, were 
Christo Athanasiadis, Massachusetts Institute of Technology; William P. Cross, 
California Institute of Technology; Andrew H. Kresch, Yale University; Colin M. 
Springer, University of Waterloo; Ravi D. Vakil, University of Toronto; and Sihao 
Wu, Yale University. Each of these was designated a Putnam Fellow by the 
Mathematical Association of America and awarded a prize of $500 by the Putnam 
Prize Fund. 

The next four highest ranking individuals, in alphabetical order, were William 
Chen, Washington University, St. Louis; Jordan S. Ellenberg, Harvard University; 
David J. Grabiner, Princeton University; and Raymond M. Sidney, Harvard 
University. Each was awarded a prize of $250. 

The following teams, named in alphabetical order, received honorable mention: 
California Institute of Technology, with team members William P. Cross, Robert 
Southworth, and Glenn Tesler; the University of California, Berkeley, with team 
members Warren M. Lam, Jordan Lampe, and Jonathan C. Ruyle; Carnegie 
Mellon University, with team members Petros I. Hadjicostas, Eric Lauer, and Karl 
M. Westerberg; the University of Chicago, with team members Justin Boyan, 
Doyle Tanner, and Andrew Yeh; and Stanford University, with team members 
John C. Loftin, Greg G. Martin, and Christopher S. Wilson. 
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Honorable mention was achieved by the following thirty-five individuals named 
in alphabetical order: Edgar D. Bailey, Jr., University of Texas, Austin; Daniel J. 
Bernstein, New York University; Hubert L. Bray, Rice University; Nicholai I. 
Chaudarov, Brandeis University; Andrew Chou, Massachusetts Institute of Tech- 
nology; Michael Cole, Hofstra University; Matthew M. Cook, University of Illinois, 
Champaign-Urbana; Daniel P. Cory, Stanford University; Kevin B. Ford, Califor- 
nia State University, Chico; Michael L. Hutchings, Harvard University; Jeremy A. 
Kahn, Harvard University; Jordan Lampe, University of California, Berkeley; 
Gregory D. Landweber, Princeton University; Daniel D. Lee, Harvard University; 
Daniel I. Lieberman, Princeton University; Sanjoy S. Mahajan, Stanford Univer- 
sity; Russell A. Manning, California Institute of Technology; David M. Maymudes, 
Harvard University; John W. McIntosh, Rice University; Christopher J. Monsour, 
University of Maryland, College Park; Rahul Vijay Pandharipande, Princeton 
University; Eric M. Rains, Case Western Reserve University; Stephen A. 
Smith, University of Waterloo; Steven M. Stadnicki, Clarkson University; Andras 
Vasy, Stanford University; Eric H. Veach, University of Waterloo; Nikolai I. 
Weaver, Harvey Mudd College; Eric K. Wepsic, Harvard University; Karl M. 
Westerberg, Carnegie Mellon University; David B. Wilson, Massachusetts Institute 
of Technology; Michael P. Wolf, Harvard University; Ali F. Yegulalp, Columbia 
University; Andrew Yeh, University of Chicago; Zhaoliang Zhu, Yale University; 
and Michael E. Zieve, Harvard University. 

The other individuals who achieved ranks among the top 94, in alphabetical 
order of their schools, were: University of Alberta, Graham C. Denham, Brendan 
M. Mumey; University of British Columbia, Wayne J. Broughton, Gregory F. 
Wellman; Brown University, Martin M. Wattenberg; California Institute of Tech- 
nology, Tien-Yee Chiu, Michael G. Greenblatt, Gwoho H. Liu, Robert G. 
Southworth; University of California, Berkeley, Bryan F. Clair, Wee-Lang Heng; 
University of California, Santa Barbara, John C. Carey; University of California, 
San Diego, Franz J. Wrasidlo; University of Chicago, Adrian Tanner; Duke 
University, Will A. Schneeberger; Emory University, Charles D. McDonell; Geor- 
gia Institute of Technology, Jeffrey W. Herrmann; Harvard University, J. Stewart 
Burns, Roland B. Dreier, Daniel Eric Gottesman, F. Dean Hildebrandt, Michael 
C. Jablecki, Michael D. Mitzenmacher, Matthew C. Weiner; University of Mary- 
land, College Park, Eric M. Boesch, Jay P. Chawla; Massachusetts Institute of 
Technology, Todd W. Rowland; Michigan State University, Jacob R. Lorch; 
Princeton University, Timothy Y. Chow, Paul E. Ericksen, William P. Minicozzi, 
Matthew D. Mullin; Reed College, Nathaniel P. Thurston; Rice University, John 
Kenneth Burton, Jr., David S. Metzler; Simon Fraser University, Richard S. Kiss; 
Stanford University, Thomas H. Chung, John C. Loftin, Greg G. Martin; Univer- 
sity of Texas, Austin, Bryan W. Taylor; Trinity College, Hartford, Marshall A. 
Whittlesey; Washington University, St. Louis, Adam M. Costello, Jordan A. 
Samuels, Peter S. Shawhan; University of Waterloo, Graydon H. Hazenberg, 
Marc S. Ordower, Giuseppe Russo; Wellesley College, Yihao L. Zhang; and the 
University of Wisconsin, Madison, Cavan C. Fang. 

There were 2392 individual contestants from 373 colleges and universities in 
Canada and the United States in the competition of December 2, 1989. Teams 
were entered by 288 institutions. 
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The Questions Committee for the fiftieth competition consisted of Gerald A. 
Heuer (Chair), Paul R. Halmos, and Kenneth A. Stolarsky; they composed the 
problems listed below and were most prominent among those suggesting solutions. 


PROBLEMS 


Problem A-1. How many primes among the positive integers, written as usual in 
base 10, are such that their digits are alternating 1’s and 0’s, beginning and ending 
with 1? 


Problem A-2. Evaluate [¢/bem***" 7° dy dx, where a and b are positive. 
Problem A-3. Prove that if 


11z'° + 10iz? + 10iz — 11 = 0, 
then |z| = 1. (Here z is a complex number and i” = —1.) 


Problem A-4. If @ is an irrational number, 0 < a < 1, is there a finite game 
with an honest coin such that the probability of one player winning the game is a? 
(An honest coin is one for which the probability of heads and the probability of 
tails are both 1/2. A game is finite if with probability 1 it must end in a finite 
number of moves.) 


Problem A-5. Let m be a positive integer and let ¥ be a regular (2m + 1)-gon 
inscribed in the unit circle. Show that there is a positive constant A, independent 
of m, with the following property. For any point p inside Y there are two distinct 
vertices v, and v, of & such that 


A 


\lp —vy|-—|p —v,||< — - —. 
m m 


Here |s — t| denotes the distance between the points s and f. 


Problem A-6. Let a = 1+ a,x +a,x* +--+: be a formal power series with 
coefficients in the field of two elements. Let 
if every block of zeros in the binary 
1 expansion of nm has an even number 
” of zeros in the block, 


0) otherwise. 


(For example, a3, = 1 because 36 = 100100,, and a,, = 0 because 20 = 10100,.) 
Prove that a +xa+1=0. 


Problem B-1. A dart, thrown at random, hits a square target. Assuming that any 
two parts of the target of equal area are equally likely to be hit, find the probability 
that the point hit is nearer to the center than to any edge. Express your answer in 
the form (avb + c)/d, where a,b, c,d are positive integers. 


Problem B-2. Let S be non-empty set with an associative operation that is left 
and right cancellative (xy =xz implies y = z, and yx = zx implies y =z). As- 
sume that for every a in S the set {a": n = 1,2,3,...} is finite. Must S be a 
group? 
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Problem B-3. Let f be a function on [0, ©), differentiable and satisfying 
f'(x) = —3f(x) + 6f(2x) 


for x > 0. Assume that | f(x)| < e~ for x > 0 (so that f(x) tends rapidly to 0 as 
x increases). For n a non-negative integer, define 


by = fo x"f(x) ds 


(sometimes called the nth moment of f). 

a. Express yw, in terms of po. 

b. Prove that the sequence {u,,(3"/n!)} always converges, and that the limit is 0 
only if uw, = 0. 


Problem B-4. Can a countably infinite set have an uncountable collection of 
non-empty subsets such that the intersection of any two of them is finite? 


Problem B-5. Label the vertices of a trapezoid T (quadrilateral with two parallel 
sides) inscribed in the unit circle as A, B,C, D so that AB is parallel to CD and 
A, B,C, D are in counterclockwise order. Let s,, s, and d denote the lengths of 
the line segments AB, CD, and OE, where E is the point of intersection of the 
diagonals of 7, and O is the center of the circle. Determine the least upper bound 
of (s, — s,)/d over all such T for which d ¥ 0, and describe all cases, if any, in 
which it is attained. 


Problem B-6. Let (x,,%,5,...,xX,) be a point chosen at random from the 
n-dimensional region defined by 0 <x, <x, < ++: <x, < 1. Let f be acontinu- 
ous function on [0,1] with f() = 0. Set x, =0 and x,,, =1. Show that the 
expected value of the Riemann sum 


» (X41 —X) f(%i41) 


is {,f(t)P(t) dt, where P is a polynomial of degree n, independent of f, with 
0<P(t)<1for0<t<1l. 


SOLUTIONS 


In the 12-tuples (1,9, M9, ..., M9, N_1) following each problem number below, n,; 
for 10 > i > 0 is the number of students among the top 199 contestants achieving i 
points for the problem and n_, is the number of those not submitting solutions. 


A-1 (92, 2, 6, 7, 0, 0, 0, 9, 2, 8, 41, 32) 
Solution. One. Let N, be the number 
101010...101 


with exactly k digits equal to 0. If k = 1, so that N, = 101, then N, is prime. All 
other N,’s are composite, as the following reasoning shows. If k is odd, then 101 
divides N,. If k is even, then 


11N, =R°S, 


where R is the number consisting of k + 1 digits all equal to 1 and S is the 
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number with k + 2 digits beginning and ending with 1 and having only 0’s in 
between. One of the numbers R and S divides N,. 


A-2 (141, 6, 29, 0, 0, 0, 0, 0, 5, 7, 4, 7) 


Solution. Divide the region into two parts by the diagonal line ay = bx to get 


fo fremmenen dy dx = [freer dy dx + [per dx dy 


= [re ax + [—e"” ay 
0 a 0 b 


A-3 (13, 4, 4, 2, 0, 0, 0, 0, 6, 8, 56, 106) 


11 — 10iz 
Solution. We have z? = ——————. If z =a + bi, then 
1lz + 10i 
2 11 — 10iz 11? + 220b + 107(a* + b*)| _ f(a,b) 
° - liz+10i} YV [112(a2 +b?) + 22064107] g(a,b)' 


If a* + b? > 1, then g(a, b) > f(a, b), making |z?| < 1, a contradiction. If a? + b? 
< 1, then f(a, b) > g(a, b), making |z’| > 1, a contradiction. Thus, |z] = 1. 


Second Solution. Rouché’s Theorem states that if |g(z) — f(z)| < |f(z)| for z 
on |z|=a then g and f have the same number of zeros inside |z| = a. Let 
e(z) = 11z'° + 10iz? + 10iz — 11 and f(z) = 10iz — 11. Then |g(z) — f(z)| = 
11 z!° + 10iz°| = |z]? li] [11z + 10i] < |10iz — 11] if |z| < 1. Now f(z) has its only 
root at 11/10i so that g(z) has no zeros inside |z| < 1. If z is a zero of g and 
|z| > 1, then —1/z is also a zero of g (which we’ve seen doesn’t happen), and 
therefore, the only zeros of g are on the circle |z| = 1. 


A-4 (54, 22, 16, 4, 0, 0, 0, 0, 4, 2, 53, 44) 


Solution. Yes. Write the number 


in its binary representation, 
a, = Oorl, n=1,2,3,..., 


and play the game as follows. Keep tossing the coin, recording the result, say 6,, of 
the nth toss as 0 when it falls tails and 1 when it falls heads; you win the game if 
the first time that B, # a, your result, B,, is 0 (and, therefore, a, = 1). In other 
words, you win if at the first time when the B sequence differs from the a 
sequence, the B sequence is smaller. 

The motivation for this procedure is to think of @ as determining the interval 
(0,@), and think of the coin tosses as determining a number in [0, 1]; “win” means 
“land in the prescribed interval”. There are many other solutions; this might be 
one of the simplest. 
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How can you win? To see the answer, it helps to introduce some notation: let 
1, N>5,N3,... be the positions of the 1’s that occur in the sequence, a. One way to 
win is to toss so that the B sequence agrees with the a sequence for the first 
n, — 1 terms and then disagrees. The probability of that is 1/2 to the power n,, 
which is equal to the partial sum up to n, of the binary representation of a. The 
next way to win is to toss so that the B sequence agrees with the a sequence for 
the first n, — 1 terms and then disagrees. The probability of that is 1/2 to the n,, 
which is equal to the number obtained from the partial sum up to n, of the binary 
representation of a@ by replacing the first 1 with a 0. These two ways of winning are 
the beginning of an infinite sequence. The (infinite) sum of the corresponding 
probabilities is exactly the binary representation of a. 


A-5 (6, 1, 3, 0, 0, 0, 0, 1, 0, 2, 49, 137) 


Solution. The diameter of Y is 2cos(w/2(2m + 1)), so all the |p — v,| for 
1 <i<2m + 1 fall into the interval [0,2 cos(7/2(2m + 1))]. Hence some two of 
them differ by at most 


7 IT 
cos| 2(2m + 1) | 
2m 
Let 
1 qT 
co| 2(2m + 1) 


f(m) = 


This is a positive term sequence with limit 7*/32 and therefore this sequence has 
a positive minimum value. Therefore, 


7 


_ —_—_ 
co| 2(2m + 1) 


for some A > 0. Hence 


T A 
085m ray) So e 
and the result follows. 
A-6 (3, 1, 0, 0, 0, 0, 0, 1, 0, 4, 49, 141) 


Solution. We show that a+ + xa +a = 0. 


Note that 
17.2) 2 (0) (oe) (0) 
“ - | » ans 7 » a,x°", q* = » a,x", xar* _ » a,x°r, 
n=0 n=0 n=0 n=0 
Therefore, 
at+xa*+ta= ) a,x"+ (Ay 4 + G,)x" + Y (a, + Qn) x": 


n=0 n=Q(Q n=0 
2|n,4|n 4|n n odd 
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If 2|n,4|n, then a, = 0. If 4|/n then a, ,, =a,, 80 a, 4+ a, = 0. If n is odd, 
then 4(,-1)/2 = 4p, 80 Ay + Ay_1y/2 = O. 


B-1 (144, 10, 0, 0, 0, 0, 0, 0, 0, 0, 42, 3) 


Solution. Consider a 2 X 2 dartboard centered at the origin with vertices at 
(1,1), (—1, 1), (-—1, — 1), and G, — 1). A point (x, y) in the square is closer to the 
center of the board than the top edge of the board if and only if 


yx? ty? <1-y, 
or equivalently, 


1—x? 


2 


y< 


A similar parabolic region arises for each of the other three sides, and therefore 
the region of points closer to the center than any edge has the shape of the shaded 
region in the following figure. 


Let A denote the area of this region in the first quadrant bounded by y = 
(1 — x*)/2 and y = x. Then, the probability we desire is 


8 Area A 1 — x? 4V2 —5 
——— = 2 Area A = 27! - Ty | gy = 
mn 0 2 3 


B-2 (49, 44, 17, 10, 0, 0, 0, 0, 3, 4, 41, 31) 


Solution. Yes. Let a be an arbitrary element of S. The set {a”: n = 1,2,3,...} 
is finite, and therefore a” = a” for some m,n with m > n > 1. By cancellation we 
have a’? =a, where r(a)=m-—n+1> 1. If x is any element of S, then 
aa’ ~1y = a™y = ax, and this implies that a”®~!x =x. Similarly, we see that 
xa"! =x, and the element e = a7! is an identity. The identity element is 
unique, for if e’ is another identity, then e = ee' = e'. If r(a) > 2 then a” ~? is 
an inverse of a, and if r(a) = 2 then a* = a = e and a is its own inverse. Thus S 
is a group. 


326 KLOSINSKI, ALEXANDERSON, AND LARSON [April 


B-3 (36, 4, 18, 48, 4, 10, 3, 1, 4, 1, 9, 61) 


Solution. a. Clearly, {jx"g(x) dx exists for g(x) equal to f(x) or f’(x) because 
f(x) tends rapidly to 0. Hence, 


[xtf) dx = —3p, + 6 f x"f(2x) dx. 
0 0 


Using parts on the integral on the left and the substitution u = 2x on the integral 
on the right, we obtain 


00 6 
X"FC)]o — My = ~ 3B + Sat Hn 


Since x"f(x) > 0 as x — © for any n > 0 we have 


n 1 
My 3 1 My-1 
on 
Iteration now yields 
n! 1 
My 3n 7 (: - x). 
m=1 2” 


b. Since L* _,/2") < &, the infinite product II” _,. — (1/2”)) converges to 
a nonzero finite limit and the result follows. 


B-4 (52, 7, 0, 0, 0, 0, 0, 0, 0, 1, 62, 77) 


Solution. Yes. Consider the countably infinite set Q of rational numbers in 
[0, 1]. For each irrational number a in [0, 1], let Q, be a subset of Q that has a as 
its unique cluster point. There are uncountably many Q.’s, each of which is 
countably infinite (and, in particular, not empty). If Q, N Q, Is infinite, then it 
clusters at both @ and £B, and therefore a = B. 


Second Solution. Let L denote the set of lattice points in the plane: {(m, n)|m,n 
= Z}. Let S, be the set of points in L at a distance less than or equal to one from 
the line y = 6x, —1 < @ <1. Then |S, 1 S,,| is a finite set if and only if 6, # 95. 


B-5 (17, 17, 1, 1, 0, 0, 0, 1, 17, 29, 37, 79) 


Solution. Choose coordinates so that AB and CD are vertical, and so that E 
lies on the x-axis. For an appropriate slope m we see that B and D lie on the 
intersection of y = m(x + d) with x” + y* = 1. Thus, 
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Now (consider y coordinates) s, —s, is twice the sum of the roots of this 
equation, and therefore, 


2m 
s,s |a<2a. 


with equality if and only if m= 1. Thus for d #0 the maximum value of 
(s, — s,)/d is 2 and it is attained if and only if the diagonals intersect at right 
angles and T is not a square (since d = 0 for a square). 


B-6 (0, 1, 1, 1, 0, 0, 0, 2, 0, 11, 35, 148) 


Solution. The volume in R” of all points (x,, x5,...,x,) with 0 <x, <x, < 
- <x, <1is 
1 Xn x 1 
lf to [dey dey... de, = —. 
070 0 n! 
Thus, we need to divide 
” 1 ¢*n x2 
M= ve X; — xX; X ; dx,...dx,, 
Usd, Jo Cer — x) Fa) be 


by 1/n!. When 0 <i <n — 1 iterated integration yields for a typical summand 
xi 1 


[fo fC “Wao iy ren) dts de, 


it] 


[Po LO ay fie) bein de, 
itl 
inn f~ Sees rece AK 5490+ Ay A541 


n—-C+1) i 
[Spr (1 — x;41) ° XTi 
(n-(i+1))! (i+ 1)! 
Multiplication by n! reveals that the kernels are “almost” the terms of the 


binomial expansion of ((1 — x;,,) + x;,,)". In fact by adding in and subtracting 
out the i = —1 term we get 


nlM = [of(t)(— (1 2)") dt + nd, 


f( X41) Mies 


where ox . 
= fey aa (1 —x,)f(1) de,...dx, 


n-2 


=f [0-2 yl Bin te, 
1 
— Tran): 


so the expected value is 


[OP(O ae + 0) 


where P(t) =1-—(1 — 1)”. 
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Symmetry Classes: Functions of Three Variables 


N. METROPOLIS 
Los Alamos National Laboratory, Los Alamos, NM 87545 


GIAN-CARLO ROTA* 
Los Alamos National Laboratory and 
Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139 


I. Introduction. It is well known that every function of two variables f(x, y) is 
uniquely expressible as the sum of a symmetric function f, and a skew-symmetric 
function f,, defined, respectively, as 


_ f(z y) +f, *) _ f(xy) ~ FO.) 


fs(45 ¥) = > ——— fal x, ¥) = ———>———. 


The attempt to generalize this fact to functions of n variables led Alfred Young 
to develop his theory of symmetrizers, now subsumed in the theory of representa- 
tions of the symmetric group. 

According to this theory, a function of n variables is uniquely expressible as the 
sum of p,, functions, each one belonging to a different symmetry class, where p, is 
the number of partitions of the integer n. Unfortunately, a simple intuitive 
description of such symmetry classes has never been given except for n = 2. 

It is known that for functions of three variables, there is only one other 
symmetry class besides the two obvious symmetry classes of symmetric functions 
and of skew-symmetric functions. We give this third symmetry class a very simple 
characterization, one that seems to have been overlooked. We show that it consists 
of all cyclic-symmetric functions. We prove that every function of three variables is 
uniquely expressible as the sum of a symmetric function, a skew-symmetric func- 
tion and a cyclic-symmetric function. 

To make this note self-contained, we have added a short derivation of some 
known formulas. 

We believe that the underlying idea of this note will extend to functions of n 
variables. We hope the present note will at least entice the reader to further study 
of the vast theory of symmetry classes. 


II. Symmetry Classes. We shall deal with everywhere defined real-valued func- 
tions of three variables taken from a set {x, y, z}. The set of all such functions 
forms a vector space V over the field of real numbers. This setup is deliberately 
limited in scope for purposes of clarity. 

If o is a permutation on the set {x, y,z} mapping x to xo and if f is a 
function, we write 


of(x,y,z) =f(xo, yo, zo). 


* Partially supported by NSF Grant No. MCS 8104855. 
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A subspace W of the vector space V will be said to be a symmetry class if 
whenever f € W, then of € W. 

Our objective is to give a simple description of all symmetry classes. 

There are three notable symmetry classes, two of which are well known: 

(1) The symmetry class S consisting of all symmetric functions, i.e., all functions 
such that f(x, y, z) = f(xo, yo, zo) for all permutations o of the set {x, y, z}; 

(2) The symmetry class A of all skew-symmetric functions, i.e., all functions 
such that f(x, y, z) = (signa) f(xo, yo, zo) for all permutations o of the set 
{x, y, z}. Here, (sign o) is the signature of the permutation o. Since we are dealing 
with three variables, (sign ao) will be —1 if o is a transposition of two variables, 
and (sign a) will be 1 if o is the identity or else if o is a cyclic permutation. 

(3) The third is the symmetry class C of cyclic-symmetric functions, defined as 
the subspace C of all functions f satisfying the two equations 


f(x,y,z) +f(z,x,y) +fly,z,x) =0 
f(x,z,y) +fly,x,z) + f(z, y,x) =0. (*) 


The preceding definition makes sense: 


Proposition. The subspace C of V consisting of all functions satisfying (*) is a 
symmetry class. 


Proof. It suffices to show that if f is cyclic-symmetric and o is a transposition 
of the set {x, y, z}, then of is cyclic-symmetric. Now, by definition, the function of 
is cyclic-symmetric if and only if it satisfies the equations 


of(x,y,z) tof(z,x,y) tof (y,z,x) =0 
of(x,z,y) +of(y,x,z) +of(z,y,x) = 90. Ce") 


For the sake of the argument, suppose o is a transposition of x and z. The 
above equations (**) then reduce to the pair of equations 


f(z,y,x) + f(x,z,y) + fly, x,z) =0 
f(z,x,y) +fly,z,x) +f(x,y,z) =0. 


But note that these equations are identical except for order to the two equations 
(*). 

The same argument works for any transposition. Since any permutation is the 
product of transpositions, we conclude that of is cyclic-symmetric for any permu- 
tation oa, as desired. OC) 


It is easy to give examples of cyclic-symmetric functions. If f(x, y, z) is any 
function, then the function g(x, y, z) = f(x, y, z) — f(z, x, y) is cyclic-symmetric; 
and so is the function h(x, y, z) = f(x, y, z) — fy, z, x). 

The two equations (*) defining a cyclic-symmetric function can be replaced by a 
single equation, if we use place permutations instead of variable permutations. Let 
Tf be the function obtained from f by replacing the variable in the first entry of f 
by the variable in the third entry, the variable in the second entry by the variable in 
the first entry, and the variable in the third entry by the variable in the second 
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entry. Thus 
Th(x,y,z) =f(z,x,y) and rf(x,z,y) =fly,x,z). 
Then we have the Gmmediately obvious) 


Proposition. A function f is cyclic-symmetric if and only if f + tf + 77f = 0. 


III. Projections. We define three operators E,, E,, and E, from the vector 
space V to itself, as follows. Set 


1 1 
Ef=— of; E,f= 6 d_ (signa )of, 


where the sum ranges over all permutations o of the set {x, y, z}. 
To define the third operator E,, let o f(x, y, z) = f(z, x, y). Set 


1 
E.f=f- (ft of + 07f). 


These operators were first defined by Alfred Young. It is easily verified that 
they are mutually orthogonal idempotents and that FE, + E, + E, = o>, where oo 
is the identity operator. Furthermore, the function E,f is symmetric and the 
function Ef is skew-symmetric. 

What Young did not see, however, is the following: 


PrRoposiTIon. For any function of three variables f the function Ef is cyclic-sym- 
metric. Furthermore, if f is a cyclic-symmetric function, then E.f = f. 


Proof. As to the first assertion, let g = Ef. In other words, let 
1 
g(x,y,z)=f(x, 9,2) — s[ f(a yz) + fx 9) + fC, 2,40], 
To show that the function g is cyclic-symmetric we must verify the identities: 
g(x,y,z) + 9(z,x,y) +8(y,z,x) =0 


g(x, Z, y) + gly, X , z) + g(Z, y, x) = 0. 
Let us verify the first of these identities. The left side expands to 


1 
f(x,y,z) - zl f(s y,2) + f(z,x,y) +f(y,z,x)] 
1 
+ f(z, x,y) - zl f(z. xy) + f(y,z,x) + f(x, y,z)] 
1 
+ f(y,z,x) - zl f(y, 2,%) + f(x,y,z)+f(z,x,y)]. 


Add each column to obtain 


f(x,y,z) +f(z,x,y) + fly, z, x) 
1 
[f(x,y,z) +f(z,x,¥) +fly,z,x)] 


[f(z,x,y) +f(y,z,x) + f(x, y,z)] 


Wlre wl w| 


[f(y,z,x) + f(x, y,z) +f(z,x,y)] = 0. 
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Irrationality Without Number Theory 


RICHARD BEIGEL* 
Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218 


1. Introduction. Let k and ¢ be positive integers. It is well known that k!/’ is 
either a positive integer or an irrational number. This can be proved easily from 
the unique factorization theorem. 

An interesting question is how much number theory is necessary in order to 
prove this theorem. Maier and Niven [1] and Floyd [5] have presented proofs of 
this theorem that use no facts about prime numbers. The former use the division 
algorithm and induction to simplify and generalize Steinhaus’s proof [4, pp. 38-39] 
of the case k = t = 2; the latter uses the Euclidean algorithm to simplify and 
generalize Sagher’s proof [3] of the case t = 2. When t = 2, Maier and Niven’s 
technique 1s especially interesting because it does not explicitly use any number 
theory, only very basic inequalities. 

In this paper, we also prove the theorem for all k and t. Although our proof is 
more complicated than Floyd’s, it has the advantage of not explicitly using any 
number theory, so it can be presented to a very general audience. Our proof is 
simpler than Maier and Niven’s. To accustom the reader to the technique involved, 
we begin by presenting a very simple proof for the case t = 2. The idea is due toa 
proof presented by Niven [2] for the case k = t = 2. 


Proposition 1. k!/? is either a positive integer or an irrational number. 


Proof. Let x = k'/* and assume that x is rational. Then choose the smallest 
positive integer n such that nx is an integer. Let 


n'=n(x —|x]). 
where |x] denotes the integer part (floor) of x. Since 0 <x — |x] < 1, it follows 
that 0 <n’ <n. Note that n’ = nx — n|x|, which is an integer. What’s more, 


n'x = nx* — (nx) x], 


*Supported in part by grants CCR-8808949 and CCR-8958528 from the National Science Founda- 
tion. 
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1. Introduction. Let k and ¢ be positive integers. It is well known that k!/‘ is 
either a positive integer or an irrational number. This can be proved easily from 
the unique factorization theorem. 

An interesting question is how much number theory is necessary in order to 
prove this theorem. Maier and Niven [1] and Floyd [5] have presented proofs of 
this theorem that use no facts about prime numbers. The former use the division 
algorithm and induction to simplify and generalize Steinhaus’s proof [4, pp. 38-39] 
of the case k = t = 2; the latter uses the Euclidean algorithm to simplify and 
generalize Sagher’s proof [3] of the case t = 2. When t = 2, Maier and Niven’s 
technique 1s especially interesting because it does not explicitly use any number 
theory, only very basic inequalities. 

In this paper, we also prove the theorem for all k and t. Although our proof is 
more complicated than Floyd’s, it has the advantage of not explicitly using any 
number theory, so it can be presented to a very general audience. Our proof is 
simpler than Maier and Niven’s. To accustom the reader to the technique involved, 
we begin by presenting a very simple proof for the case t = 2. The idea is due toa 
proof presented by Niven [2] for the case k = t = 2. 


Proposition 1. k!/”? is either a positive integer or an irrational number. 


Proof. Let x = k'/* and assume that x is rational. Then choose the smallest 
positive integer n such that nx is an integer. Let 


n'=n(x —|x]). 
where |x] denotes the integer part (floor) of x. Since 0 <x — [|x] < 1, it follows 
that 0 <n’ <n. Note that n’ = nx — n|x], which is an integer. What’s more, 


n'x = nx* — (nx)[ x], 


*Supported in part by grants CCR-8808949 and CCR-8958528 from the National Science Founda- 
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which is an integer. Because n is the smallest positive integer with that property, n’ 
must be 0, so x — |x] = 0. In other words x is an integer. @ 


2. Main Results. THEOREM 2. k!/‘ is either an integer or an irrational number. 


Proof. Let x = k'/‘, and assume that x is rational. Then choose the smallest 
positive integer n such that nx is an integer. For 0 <i <t-— 1, we make the 
following claim, which we prove by induction on 2: 


i-ly yt 


INpucTiveE Hypotuesis. n’-'~'x‘'~' is an integer. 


The base case (i = 0) is obvious. The theorem follows from the final case 
(i = t — 1). It remains to establish the inductive step. Assume that the inductive 
hypothesis is true for 7 — 1 so that 


n'—'x'—'*1 is an integer. 


Let 


—i-1l ,t-i 


z=n' x 
We complete the induction by showing that z is an integer. Let 
n'=n(z-|z]). 


Then 0 <n' <n. We note that n’ is the difference of two integers because 
nz = (nx)'~', which is an integer by the initial assumption, and n| z] is the product 
of two integers. Therefore n’ is a nonnegative integer. What’s more, 


' 


nx 


nz —n|z|x 
_— niwtxt etl _ (nx )| z |, 


which is the difference of two integers (by the inductive hypothesis and the initial 
assumption), hence an integer. Since n is the smallest positive integer with that 
property, n’ must be 0. Therefore z — | z] is 0, so z is an integer, completing the 
induction. @ 


Using a trick from [1], we can extend this proof directly to apply to all algebraic 
integers. 


CoROLLARY 3. If x is an algebraic integer, then x is either a rational integer or an 
irrational number. 


Proof. Recall that an algebraic integer is by definition a zero of a monic 
polynomial with integer coefficients, 1.e., 
x'+ Yo cx’ =0. 
O<i<t-1 
Assume that x is rational, and choose the smallest positive integer n such that nx 
is an integer. Then, as observed by Maier and Niven 


ni—1xé — _ > c,n'—'x! 


en *(nx)', 


O<i<t-1 


which is an integer because nx is an integer. This establishes the base case for the 
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induction in the preceding theorem. The remainder of the proof is exactly as 
before, so we conclude that x is an integer. @ 


3. Generalizations. Floyd also shows that k!”* cannot be a root of a quadratic 
equation with rational coefficients unless k'/? is an integer. Using a different 
technique (but no number theory) we extend this to k'”‘ for all odd ¢. (The result 
is false for even ¢ because (2'/7)'”' is a root of a quadratic equation with rational 
coefficients. ) 


CoROLLARY 4. Let t be odd. If k'/' is a root of a quadratic equation with 
rational coefficients then k'/' is an integer. 


Proof. Let x = k'/‘ and assume that x is a root of a quadratic equation with 
rational coefficients. By the quadratic formula, there exists a rational number p 
and a nonnegative rational number gq such that x = p + q/”*. If gq = 0, then x is 
rational, and hence an integer by the preceding theorem, so we need only consider 
the case g > 0. 


=(p+q'”) 


> [| oa'7( + 1)' by the binomial theorem 


I 
t —i ll t —i 1 
tb [{ |e" gQhelgie + [{ q’”” 
i odd i even 


= +sq'/* +71, 


where 
t —i |i 


and 


Therefore, g'/* is equal to the rational number +(k — r)/s, unless s = 0. Each 


term of s is nonnegative because p is raised to an even power and q is positive. 
Furthermore, the last term (i = t) of s is equal to q!'/*!, which is strictly positive. 
Therefore s is positive, so g'/? is rational, so x is rational. By the preceding 
theorem, x is an integer. 


It is natural to ask whether k'”‘ can be a non-integral zero of an mth degree 
polynomial with rational coefficients. df ¢ has a divisor d such that 2 <d<m, 
then the answer is yes, because (2'/“)'”’ is a zero of x™~“(x% — 2).) The 
preceding corollary resolves the special case m = 2. It is unclear whether the proof 
technique works when m=3 or m = 4; however it is clear that the proof 
technique cannot be applied when m > 5, because there is no closed-form solution 
of the quintic equation. Because the statement of the general case requires some 
number theory, it seems likely that a resolution of the general case will require 
some number theory. We present an approach that works when t = 5 and m = 3. 


Coro.iary 5. If k'/° is a root of a cubic equation with rational coefficients then 
k'/> is an integer. 
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Proof. Let x =k'/ and assume that x is a root of a cubic equation with 
rational coefficients. Then there exist rational numbers p, g, and r such that 
x? + px? + qx +r = 0. Identically, 


x> — k = (x? + px? + qe + r)(x? — px + (p? — @)) + (2pq - p? — 1) x? 
+(pr+q?— p’q)x + qr —-p’r—k. 
Since x° — k = O and x? + px* +qxt+r=0, 
(2pq — p>? —r)x* + (pr+q?—p’q)x+aqr-p*r-—k=0. 

Thus x satisfies a quadratic equation with rational coefficients, and we are done 
by the preceding corollary, unless each coefficient of the quadratic equation is 
zero. From the first coefficient, we find r = 2pq — p°. Substituting into the second 
coefficient, we find q* + p’q — p* = 0. By the quadratic formula, g = 3(—1 + 
5!/2)n*. Therefore, 5'”/? = +(2q/p’ + 1), which is rational (and hence an integer 


by the preceding proposition), unless p = 0. But if p = 0, then g = 0 and r = QO, 
so x° = 0. Thus x is an integer. m 


4. Open Problems. Prove with a minimum amount of number theory: 


e If k'/”* is a zero of an irreducible cubic polynomial with integer coefficients 
and ¢ is not divisible by 3 then k!”‘ is an integer. 

e If k'/' is a zero of an irreducible mth degree polynomial with integer 
coefficients and ¢ is relatively prime to m then k'”‘ is an integer. 


5. Discussion. In our proofs, we used induction and the floor operation |x]. 
Since x is assumed to be rational, that operation implicitly uses the division 
algorithm. Thus, although the presentation of our proofs is simple, the depth of 
mathematics implicitly used is the same as in [1]. 

In Maier and Niven’s proofs, they used induction, the division algorithm, and 
the floor operation. Although they appear not to use any number theory in the 
case t = 2, they use the floor operation in that case. In fact, it would be very 
surprising if any proof of irrationality really used no number theory. 

In Floyd’s proofs, he used the Euclidean algorithm, which can be derived by 
using induction and the division algorithm. Thus, Floyd’s proofs also use the same 
depth of mathematics as in our proofs and in [1]. 
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Another Elementary Approach to the Jordan Form 


J. I. HALL 
Department of Mathematics, Michigan State University, East Lansing, MI 48824 


1. Introduction. Several elementary treatments of the Jordan form are avail- 
able. The recent article by Valiaho [3] in this MoNTHLY contains a proof and a 
good discussion of many other elementary proofs. The approach in this note differs 
from most of these earlier proofs in two ways. No restriction is made upon the 
underlying field, and the proof avoids complicated bookkeeping and awkward 
calculational verifications of linear independence. That which is new in the proof 
given here resulted from an effort to add these two features to the excellent 
account (following Filippov [1]) to be found in the book of Ortega [2]. The present 
treatment makes reasonably clear the fact that having a Jordan form is a natural 
property of a linear transformation. A linear transformation can be diagonalized 
when its eigenvector structure is very strong; there must be a basis composed of 
eigenvectors. As we see here, a linear transformation has a Jordan form (a weaker 
property) when its eigenvector structure may not be as strong but is still sufficiently 
rich. 


2. Existence of the Jordan form for linear transformations. Let X be a vector 
space over the field K, and let T: X — X bea K-linear transformation of X. The 
null space of T, null(T), is the subspace of all vectors of X taken to 0 by T. The 
range of T on X, range(T), is the subspace of all vectors of the form T(x), for some 
x in X. Of fundamental importance is the fact that the dimension of X over K is 
equal to the sum of the dimensions of null(T) and range(T). (See [2, 2.3.3, p. 62].) 
We denote by dim(U) the dimension of the K-subspace U of X. 

A Jordan chain for T in X of length m associated with A is a set of non-zero 
vectors X,,...,X,, such that 

T(x;) =Ax;+x;_,, fori=1,...,m, 

where we set xX, = 0. Note that if m is at least 1, then A is an eigenvalue for T 
with associated eigenvector x,. In particular, an eigenvector by itself is a Jordan 
chain of length 1. If A = 0, then T(x,;) = x;_, for i = 1,...,m; and the chain is 
nothing other than a “‘nil-orbit” for T in X. A Jordan chain for T associated with A 
is also a chain associated with A + a for T + aI, where I is the identity linear 
transformation. In particular a chain for T associated with A is merely a nil-orbit 
for T — Al. 


THEOREM 1. (Existence of the Jordan form for linear transformations). Let A: 
V — V be a linear transformation of the finite-dimensional K-vector space V. Assume 
that every non-zero A-invariant subspace of V contains an eigenvector for A. Then V 
possesses a basis which is a disjoint union of Jordan chains for A. 


The proof is based upon a general proposition and a technical lemma about 
linear transformations which possess a Jordan form. 


Proposition 2. Let T: X — X be a linear transformation of the finite-dimen- 
sional K-vector space X. Let W be a T-invariant subspace of X, and let S: W — W be 
the restriction of T to W. Assume that: 

(i) Weontains the null space of T; 

(ii) The range of S on W equals the range of T on X. 

Then W equals X. 
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Proof. Indeed: 
dim( X) = dim(null(T)) + dim(range(T)) 
= dim(null(S)) + dim(range(S)) = dim(W). 
Lemma 3. Let T: X — X be a linear transformation of the finite-dimensional 


K-vector space X. Assume that X has a basis which is a disjoint union of Jordan 
chains for T: 


Xiioesso X1,m, 
Xo 19+++9X2,m, 
Xkito+e to Xk m, 
where the ith chain X; 1,..-, X;,m, has length m; and ts associated with the eigenvalue 
A;. Assume further that \, = +++ =Aq=0 and 4; # 0 fori > d. Then 
(1) null(T) has as a basis Xx, \,...,Xj,j5+--»Xq¢, forl<i<d; 


(2) range(T) has as basis the disjoint union of the Jordan chains: 


Xi 1> nr, X1,m,-1 
Xajiss++s Xd,my-1 
Xd+1,t9°°°9> Xd+1,mqs1 
Xeotreee> Xk, mg 


Proof. Clearly the given linearly independent sets are contained in these sub- 
spaces as claimed. Thus dim(null(T)) > d and dim(range(T)) > dimCX) — d. On 
the other hand 


dim(null(T)) + dim(range(T)) = dim(X); 
so in each case we must have equality. The sets are bases. 


Proof of Theorem 1. The proof is by induction on the dimension of the 
underlying space V. If A is a multiple of I Gn particular when dim(V) = 1), any 
basis is a basis of Jordan chains of length 1. Assume that this is not the case and 
that the theorem is true in all dimensions less than n = dim(V). 

By assumption A has an eigenvector v # 0 associated with some eigenvalue A. 
Let B be the non-zero linear transformation A — AI. Also let C: U — U be the 
linear transformation of U = range(B) which is the restriction of B to U. Note that 
v € null(B), so U has a dimension m strictly smaller than n. A C-invariant 
subspace of U is invariant under both B and A. Therefore by induction we may 
conclude that U has a basis @Y which ts a disjoint union of Jordan chains for C 
(and so for B and A as well). Let the chains of Y be Y, for 1 <i <k. Here the 
ith chain @;, consisting of v; \,...,U;,m, 1S associated with the eigenvalue 4A,. 
Assume that A, = -:: =A, =Oand A, #0 fori >d. 

For each i with 1 <i <d, uv, ,, is in U = range(B); so we may choose a vector 
U; m,+1 Such that BO, ,+1) = Ui, m, Next we choose vectors z,; (for 1 <j < q) such 


that 
Zioe09 Z qo Ui 9+ + +0 Ua 


is a basis for null(B) which has dimension gq + d =n — m. 
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Consider now the set Y which is the union of %, all the uv; ,, 4, for 1 <i <d, 
and all the z; for 1 <j < q. This set contains 


mt+d+q=m+d+(n-—-m-—d)=n 


vectors. We first note that Y is a union of k + q disjoint Jordan chains for B, 
being the original chains %, (extended by vu, ,,,, when 1 <i < d) and the new 
null vectors z, (chains of length 1). 

Our construction of the candidate basis % is essentially that of [1,2]. To 
complete the theorem it remains only to prove that V is indeed a basis for V. Let 
W be the subspace of V spanned by %. As W contains all the z; as well as 
Ui 1+++>Va1, W contains null(B). As W contains the basis Y of U, the range of B, 
it is certainly B-invariant. Indeed because, in addition to WY, W also contains each 
U;,m,+1 tor 1 <i < d, Lemma 3 shows that the restriction of B to W still has U as 
its range. By Proposition 2, W is in fact equal to V. Thus the set Y of size n spans 
the space V of dimension n and so is a basis of V, as required. This completes the 
proof of Theorem 1. 


3. Existence of the Jordan form for matrices. If A is a member of the field K 
and m is a positive integer, then the matrix J,(A) is the m X m matrix over 
K with A’s on the diagonal, 1’s on the superdiagonal, and 0’s elsewhere. A matrix 
with entries from K is said to be in Jordan form if it is a block diagonal matrix, 
with all diagonal blocks equal to J,,(A) for various m and A. 


THEOREM 4. (Existence of the Jordan form for matrices). Let A be a matrix with 
entries from the field K. Assume that the characteristic polynomial of A factors into 
linear factors over K. Then there is an invertible matrix P with entries from K such 
that P~'AP is in Jordan form. 


Let V = kK”, the n-dimensional K-space of column vectors. Then naturally 
associated with the matrix A we have the linear transformation A: V > V given by 
A(v) = Av. The present theorem for the matrix A will be proven by applying the 
previous theorem to the linear transformation A. 


Lemma 5. Let A be ann Xn matrix with entries from the field K. Assume that 
the characteristic polynomial of A factors into linear factors over K. Then any 
non-zero A-invariant subspace W of V = K” contains an eigenvector. 


Proof. Let W1,.--;WimsXm+ p++) Xp, be a basis for V = K” containing the basis 
W,,---,W,, for W. Define Q to be the matrix whose ith column is w, when 
1<i<m and is x, when m+1<i<n. Then B=Q7'AQ is block upper 
triangular with an n — m Xm block of 0’s in the lower left. The characteristic 
polynomial of B is the product of that of the two diagonal blocks and is equal to 
that of A. In particular, the upper left m < m block has a characteristic polyno- 
mial which splits into linear factors over K. Therefore B has an eigenvector in V 


of the form v = (v,,...,v,,,0,..., 0)’. Now 


9>/%m “> 
W = OU = VW, T+ FU Wm 
is an eigenvector of A belonging to W. 
Proof of Theorem 4. By Theorem 1 and Lemma 5, V has a basis ¥ which is a 
disjoint union of Jordan chains ; ,,...,U; m, for 1 <i<k. Let P be the matrix 
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form over the complex numbers is all that is needed, the use of Proposition 2 may 
simplify calculations. As we have seen here, that is the case with the proofs of 
[1,2]. The existence proof of [3] can also be shortened by use of Proposition 2. 
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Borromean Circles Are Impossible 


BERNT LINDSTROM 
Department of Mathematics, Royal Institute of Technology, S-10044 Stockholm, Sweden 


Hans-OLov ZETTERSTROM 
National Defence Research Establishment, S-171290 Sundbyberg, Sweden 


The arrangement of three topological circles in R°* depicted in Fic. 1 is called 
the Borromean links in knot theory [cf. 1]. Such an arrangement of ‘circles’ is 
possible with curves, which are homeomorphic to geometric circles. To our 
knowledge it has not been observed before that the arrangement is impossible in 
R°? if the circles are geometric even when the radii are arbitrary. 


C; C, 


C3 


Fic. 1. 


Our proof is indirect. We assume that the arrangement is possible using 
ordinary circles. Lift C, (cf. Fic. 1) towards the spectator such that it will meet C, 
in two points in its new position (C3). Observe that C4 will not meet C, since we 
lift C, away from C3. 

There are now two cases: either C, and C} belong to a plane 7 or they do not 
belong to the same plane. In the first case C, will be partly above the plane, partly 
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The arrangement of three topological circles in R° depicted in Fic. 1 is called 
the Borromean links in knot theory [cf. 1]. Such an arrangement of ‘circles’ is 
possible with curves, which are homeomorphic to geometric circles. To our 
knowledge it has not been observed before that the arrangement is impossible in 
R° if the circles are geometric even when the radii are arbitrary. 


C; C, 


C3 


Fic. 1. 


Our proof is indirect. We assume that the arrangement is possible using 
ordinary circles. Lift C, (cf. Fic. 1) towards the spectator such that it will meet C, 
in two points in its new position (C4). Observe that C3 will not meet C3 since we 
lift C, away from C3. 

There are now two cases: either C, and C3 belong to a plane 7 or they do not 
belong to the same plane. In the first case C, will be partly above the plane, partly 
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below. If we follow the circle C; in clockwise direction we are first above the 
plane, then below, then above, then below, then back again and above the plane. 
This means that the circle C, meets the plane at least 4 times, which is impossible 
since the circle does not belong to the plane. 

In the other case, when C, and C3} do not lie in the same plane, we will prove 
that they belong to a sphere. The circle C, will meet this sphere at least 4 times, 
again a contradiction. We will now prove that C, and C3 belong to a sphere when 
they are not coplanar. Let P and Q be the points in which C, and C} meet (see 
Fic. 2). Let R be the midpoint of the line segment PQ. Let S, and S, be the 
centres of the circles C, and C3, respectively. The plane 77’ containing R, S, and 
S, is orthogonal to the line PQ. Let /; be the normal to the plane of C; through S, 
(i = 1,2). The line /; is orthogonal to the line PQ and contains the point S, in the 
plane 7’. It follows that J; lies in a’ (i = 1,2). Since J, and /, are not parallel lines 
they will meet in a point T. The distance between T and any point in C, U C3 isa 
constant r. Therefore C, and C; belong to the sphere with centre T and radius r. 
It is not hard to see that the circle C, goes in and out of this sphere 4 times, which 
is impossible because a circle and a sphere have at most two points in common 
when the circle does not belong to the sphere. Therefore, Borromean circles are 
impossible! 


It is interesting to make a model of the Borromean rings in some elastic 
material like thin iron wire. Try to make the ‘circles’ as perfect as possible! The 
arrangement of rings will then attain infinitely many stable positions in space. 
REFERENCE 
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The Inclusion L?(p) ¢ L4(v) 


A. G. MIAMEE 
Department of Mathematics, Hampton University, Hampton, VA 23668 


Let 1 be a nonempty set and > be a o-algebra of subsets of (. It is well known 
that /?(Q.) <14(Q.) whenever 0 < p < gq < ~. Subramanian [3] characterized all 
positive measures pw on (Q, ) for which L?(w) C L*(w) whenever 0 <p <q <~™. 
On the other hand it is well known that if w(Q) < © one has L%(u) C L?(p) 
whenever 0 < p < q < ~. Romero [1] considered this second inclusion property 
and characterized all positive measures w for which L?(u) C L?(uw) whenever 
O<psqs~. 

In this note we consider the more general inclusion L?(w) C L*(v), where now 
we have two measures involved and give a characterization for this general 
inclusion to hold (see the Main Theorem). Our proof for the Main Theorem is a 
nice application of the closed graph theorem. 

Then we apply the Main Theorem to some interesting special cases and get 
more concrete results including some other versions of the results of Subramanian 
and Romero mentioned above. Our proofs for the results considered in [1] and [3] 
seem more to the point. 

Concerning the definition of L? spaces and the above inclusion the following 
remark is in order. 


Remark. When L?(w) is regarded as a metric space; and that is what we would 
be doing throughout this work; the space which is really under consideration is not 
a space whose elements are functions, but a space whose elements are equivalence 
classes of functions. This is specially important here, and thus whenever we write 
L?(w) C L4(v) we mean that every equivalence class of functions in L(y) belongs 
to L4(v) as an equivalence class. There is, however, another possible interpretation 
for L°(u) C L%(v), namely, any individual function f with /{|f(w)|? du(w) < ~ 
has the property {| f(w)|* dv(w) < . 

The following lemma reveals the relation between these two different possible 
meanings of the inclusion L?(w) C L(y). Having this lemma we will interchange 
these two different meanings of L?’(w) C L4(v) in some appropriate settings, 
without mentioning the Lemma explicitly. 


Lemma. Let p and q be two positive real numbers. Then the inclusion L?(w) C 
L*(v) holds in the sense of equivalence classes if and only if 4 and v are absolutely 
continuous with respect to each other and L?(w) C L4(v) in the sense of individual 
functions. 


Proof. We just prove the “only if” part because the other part is easy. Suppose 
L?(uw) € L4(y) in its natural meaning as equivalence classes. Let E be a set in 
with w(E) = 0 then y, is in the equivalence class 0 of L?() but this equivalence 
class 0 must belong to L%(v), by our hypothesis. Then 0 is necessarily the 0 
equivalence class of L7(v). Now y,; € 0 implies that x, = 0 a.e. v, which means 
VCE) = 0. Thus v < w. Similarly one can see wp < v. 


This work was supported by NASA Grant NAG-1-768 and by ONR Grant N00014-89-J-1824. 
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MAIN THEOREM. Let p and q be any two positive real numbers. Then L?(w) C 
Lv) if and only if w is absolutely continuous with respect to v and there exists a 
constant C( p,q) such that 


If lla» $ CCd, @)IFillp,u, for all fe L?(u). (1) 


Proof. Since the “if” part is not hard to see we only show the “only if” part. 
Assume L?(w) C L*(v). By the Lemma the two measures p and v are absolutely 
continuous with respect to each other. To show (1), first take the case that 
p,q = 1, and define the linear operator J from the Banach space L?(w) into the 
Banach space L?(v) by 

I(f) =f, foreach f in L?(p). 
We claim that the operator J is closed (for the definition, see p. 120 [2]). To see 
this let f, be a sequence in L(y) such that 
f, ~f in L?(u)-norm, and I(f,,) > g in L4(v)-norm, 

for some f in L?(w) and some g in L*(v). One can then find a subsequence f,, of 
f,, such that (see for example Theorem 3.12 in [2]) 

fn, Of ae.p,andf, > g ae.v. 
Now since v is absolutely continuous with respect to w we can write 

fn, ?f ae.v,andf, ~g ae.v. 
Thus f = g a.e. v, which means that f and g, regarded as elements of L‘4(v), are 
the same. This shows that the linear operator J is closed and hence bounded by 


the closed graph theorem (see for example p. 120 [2]). Thus there exists some 
constant C( p,q) such that 


If lla = MCA Ilan < COv, aDIlF ily, 2, for all fe L?(p). 


This completes the proof of (1) for the case p, g > 1. To prove (1) for the other 
case let r= min(p,q), p'’ =p/r and q'=4q/r. One can easily show that the 
assumption L?(w) C L%(v) implies L?(w) ¢ L7(v). But since clearly p’,qg' > 1 we 
can use the result for the first case to write (note that f € L?(u) = |f|" € L?(w)) 


Allg S$ COD. a IFN ly... for all fe L?(n) 


for some constant C(p’, q’). This can be written as 
If lla» < CC’, a’)ilfllp,., for all fe L?(p), 
or as IIfllqv < C(p, Milf llp,., for all f € L?(u), where C(p, q) = (C(p’, q’))'””. 


Now we apply our Main Theorem to a couple of special cases to get more 
concrete results in these interesting cases. 


PropositIon 1. The following statements are equivalent 


Gi) L?(u) C L?(v) for some positive real number p. 
(ii) p is absolutely continuous with respect to v and there exists a constant D such 
that 


v(E) < Du(E), for every set E in &. (2) 
Gii) L\(u) Cc L'@) 
(iv) L?(u) C L?(v) for every positive real number p. 
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Proof. (i) = (ii): The absolute continuity of w with respect to v follows from 
the Lemma. Since L?(u) C L?(v), by the Main Theorem there exists a constant 
C(p) such that 


flo. =< CCp)Ifilp,., for all f in L?(p). (3) 


Take any set FE in > with w(E) < ~, then y, belongs to L?(y). Hence we can 
apply (3) to f = x, to get v(E) < Du(E) with D = C(p)”. For the other E’s the 
inequality (2) is obvious. (ii) = (iii). Start with a simple function f = Lf_14;X¢, in 
L'(w), with E, and E, disjoint if k # j. Using (2) we can write 


fll, = ¥ lajo(E,) <D ¥ laglu(E,) = Dilf lly, (4) 
k=1 k=] 


which means that any simple function in L'() belongs to L'(v). Now take any 
function f in L'(w), and take a sequence f,, of simple functions in L'(w) such that 
f, 2f in L'(w)-norm. Then f,, is clearly a Cauchy sequence in L'(w). Now (4) 
implies that f, is also a Cauchy sequence in L'(v) and hence it converges to some 
function g in L'(v); ie., 


f, 2 g in L'(v)-norm. 


Using a subsequence argument similar to that in the proof of the Main Theorem 
one can show that f =g ae. v. This together with the fact that g was in L'(v) 
implies that f € L'(v). (iii) = Civ): Let f © L?(u). Then |f|? € L(y). Thus as- 
sumption (iii) implies that |f|? € L(Y), which means that f € L?(v). Finally, 
(iv) = (i) is obvious. 


Note. The proof shows that the constant D in part (ii) can be chosen to be 
independent of p, since we can take D = C(1). 


ProposiTION 2. The following statements are equivalent 
Gi) L?(u) C L4(u) for some pair p and q of positive real numbers with p < q. 
(ii) There exists a constant m > 0 such that 
u(E) =m, for every p-non-null set E in &. (5) 
Gii) L?(w) C L4(y) for every pair p and q of positive real numbers with p < q. 


Proof. (i) = (ii): According to the Main Theorem there exists some positive 
constant C such that 


If lla.u < Cllfllp,,, for every f in L?(p). (6) 


Let E be a w-non-null set in } whose w-measure is finite. Then f = vy, belongs to 
L?(w). So by applying (6) to f = vy, one gets w(E)'/4 < Cu(E)'/”. Since 1/p > 
1/q (because p <q) and 0 < w(E) < © one gets w(E)'”?2~'“4>1/C which 
implies (5) with m = C?4/-®. (ji) = (iii): Let f € L(x). For any positive 
integer n let E, = {w € 2: |f(w)| > n}. Since 


n’u(E,,) < [ fl? du < irl’ du <, 
E, ) 
we have uw(E,,) < « for every n. Thus by the hypothesis in (ii) either w(E,,) = 0 or 


u(E,) =m. On the other hand E,,, CE,, w(E,) < », and N°_,E, = © imply 
that u(E,,) — 0. Thus there must exist some positive integer n, such that | f(w)| < 
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nN, for w-almost all w © 0. Hence 
[IPI du = fifPLfle? du x ng? fifi? du < @. 
0) 0) 
(iii) = (i): This is obvious. 
PRoposiTION 3. The following statements are equivalent. 


(i) L4(u) C L?(w) for some pair of positive real numbers p and q with p < q. 
(ii) There exists some constant M > 0 such that 


u(E) <M, for every set E in & with finite w-measure. (7) 
Gii) L4(u) C L?(u) for every pair p and q of positive numbers with p < q. 


Proof. (i) = (ii): Using the Main Theorem, one can see (as in the proof of 
(i) = i) in Proposition 2) that there exists a constant C > 0 such that 


u(E)'’” < Cu( LE)’, for every set E in > with finite u-measure. 


Since 1/p > 1/gq, if E is some p-non-null set in } of finite w~-measure we can 
write wCE)'/?~ 1/4 < C which means (7) holds with M = C?4/47”, (ii) = (iii): Let 
M be the smallest number for which (7) holds. There exists a sequence F,, of sets 
in >} with finite measure such that w(F,) ~ M asn > ~. Let S = U?_,F, then 
u(S) = w(U* _,F,) = lim,, ..4(U ”"_,F,). But by the choice of M, u(U"_, F,) < 
M for every m. So w(S) < M. On the other hand u(S) = w(F,) for every n and 
pi(F) > M. This gives w(S) => M. Thus w(S) = M. Letting T = O — S one con- 
cludes that either u(T) = 0 or T is an atom of infinite measure. Now let f be in 
L?(w), then f must vanish on T almost everywhere jp. So 


[\fl? du = fifi? du + fifi? du = fifl? dw, 


Since wCS) = M < © we can apply Hélder’s inequality to get 


D/4q 
[fl du < wcsyer{ fiflidu|  <e. 
co) S 
That is to say f © L?(y). Again (iii) = (i) is clear. 
We conclude this note by saying that most of these results and especially the 
Main Theorem is correct for the case of matrix-valued functions and the proof 
given here goes through in a similar fashion. 
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1. Introduction. The formulas C = 27r and A = mr’ are part of the vocabu- 
lary of our students and of many ordinary citizens, but very few know the ideas 
underlying them—not even my recent honors calculus class, who told me they 
were handed the formulas in high school without explanation. As for the presence 
of wz in both formulas, some thoughtful students may consider it a miracle; but 
probably most people regard 77, with its mysterious string of decimals, as just one 
more example of the depressing magic that math teachers dish out for them to 
memorize. 

Devoting even one calculus or precalculus period to the fundamental ideas can 
accomplish several things: come to grips in a nonthreatening way with the com- 
pleteness of the real numbers, afford a preview of the integral in a familiar context, 
clarify the basis for the trigonometric functions, show the theorem on the limit of 
(sin a)/a in its natural setting as essentially just the definition of the circumfer- 
ence of a circle, and of course elucidate the famous formulas themselves (which 
the students are going to remember long after they have forgotten all their 
calculus). 

What follows is an expanded account of what I presented to the class (and later 
to an MAA section meeting). It was a pleasure to see their eyes light up at the 
punch line. 


2. 7 and (sina) /a. The perimeter of any circumscribed polygon about a circle 
is greater than that of any inscribed polygon. (This is a good exercise for the 
students, though they may need hints.) Obviously, the area inside any circum- 
scribed polygon exceeds that inside an inscribed polygon. It is convenient to 
restrict our attention to the inscribed regular polygons P,,, where m, the number 
of sides, is a power of 2. (I had my students draw the picture for m = 4, 8, 16, and 
32.) We define C, the circumference of the circle, to be the least upper bound of 
the perimeters of the P,,, and A, the area inside the circle, to be the least upper 
bound of the areas they enclose. 

Polygonal lengths on a circle of radius r are equal to r times the corresponding 
lengths for the unit circle. So, then, is their least upper bound. (Here and later we 
need the theorem that lub cx = c lub x (c > 0); I would show students this proof.) 
The circumference of a circle is therefore proportional to the radius. The constant 
of proportionality—i.e., the circumference of the unit circle—is called 277; this 
defines zr. Thus, C = 27r. 

On the unit circle, a pair of consecutive vertices of P,, marks off an arc of 
length 277/m—at the present stage, this is an axiom rather than a theorem—and 
this number is defined to be the radian measure of the corresponding central 
angle. On a circle of radius r, the angle cuts off an arc of length 27rr/m. 

FiGurE 1 shows the sector associated with a side AB of a polygon P,,, and its 
two half-sectors corresponding to sides AD and DB of P,,,. The central half-angle 
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rtana 


<p 


Fic. 1. 


a (or a@,,,) has arc ra, and the half-side of P,, is r sin a. Since the ratio of side to 
arc is the same as perimeter to circumference, 


sna P, 1 (1) 
= el > . 
C 


Qa 


The area inside triangle OAD is (1/2)r* sina; and the area inside P,,, is 
C/(ra) times that area. Therefore, 


1 sina 
area P,,, = =Cr 
2 a 
Taking least upper bounds yields 
1 
A= ao. (2) 


(This is what so pleased the students.) With C = 27r, A = rr’. 


3. Remarks on the proof. A variant derivation bypasses (sin a)/a, using the 
more elementary fact that cosa — 1 as a — 0. To verify this latter, assume for 
simplicity that 0 < @ < 7/2; then 


0<1-cosa < 1—-cos*a@ =sin’a < sina <a (3) 


—this last from FiGurE 1. By the squeeze theorem, 1 — cosa — 0, whence 
cosa — 1, as a > 0. (Note too for later reference that sin a — 0.) 

In Ficure 1, the area enclosed by triangle OAB is (1/2)r? sin2a, and the 
length of AB, a side of P, is 2r sina. The area and perimeter of P,, are in the 
same ratio, which is (r/2)cos a. Taking limits, we get A/C = r/2. O 


A heuristic form of these arguments is the well-known “cut-and-unroll” proof 
shown in FIGuRE 2. 

At the other extreme, [1] uses integration by parts to transform the (improper) 
integral for the length of arc of a quarter-circle to (2/r) times the integral for the 
area. 
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upper bound of polygonal paths; the fact that congruent arcs then have the same 
length; the fact that arc length is additive (needed in deriving the formula for 
cos(a + 8)); and the existence of arcs of arbitrary length, so that sin a and cos a 
really are defined for all real a [2, pp. 198-199]. Everyone will have one’s own idea 
about whether, when, and how to bring them in. Most texts say nothing at all. 

In our discussion, the length of arc on the unit circle determined by a side of P,, 
is defined—as C/m; therefore sin a and cos @ are defined for a = 27/4, 27/8, 
27/16,.... Trivially, the sequence of perimeters P,, P,, Pi,...1S increasing 
(which is why we stick to powers of 2); the limit notation in (1) is therefore 
intuitively clear. But is it reasonable to evaluate the limit of (sin a)/a@ (and the 
other limits) for a ranging merely over a sequence? Well, at least that’s a more 
substantial domain than the empty set; moreover, we get the result free of charge 
in the course of deriving the formula A = arr’. In any case, we are still permitted 
to show students the standard argument based on the areas of the right triangles 
and the sector. 

By the way, if we now define the length along the unit circle determined by k 
consecutive sides of P, to be kC/m—or, alternatively, if we decree 
additivity—then all dyadic rational multiples of C appear as arc lengths. This 
dense set of values, obtained effortlessly, gives the student something to hang onto. 
(One can then get the lengths of all other arcs from this set by means of least 
upper bounds.) 


Acknowledgements. This article is an elaboration of material that was at one time part of a joint 
venture with R. H. McDowell, whom I wish to thank for permission to publish the material separately. I 
also wish to thank the referee for several helpful suggestions. 
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1. Introduction. What does an orthogonal transformation ‘look like” in Eu- 
clidean space of dimension n? 

For n < 2, most of my linear algebra students can literally see the answer to this 
question once I tell them what an orthogonal transformation is—any linear 
transformation which leaves all lengths unchanged—and point out that they 
already are familiar with some geometrically defined examples: reflections and 
rotations. 

In fact, an orthogonal transformation of a line (n = 1) is either the identity or 
the reflection across the origin, and an orthogonal transformation of a plane 
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1. Introduction. What does an orthogonal transformation ‘look like” in Eu- 
clidean space of dimension n? 

For n < 2, most of my linear algebra students can literally see the answer to this 
question once I tell them what an orthogonal transformation is—any linear 
transformation which leaves all lengths unchanged—and point out that they 
already are familiar with some geometrically defined examples: reflections and 
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In fact, an orthogonal transformation of a line (n = 1) is either the identity or 
the reflection across the origin, and an orthogonal transformation of a plane 
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(n = 2) is either the reflection across some line or some rotation around the 
origin. Not only is it easy to prove these facts of low-dimensional Euclidean 
geometry, they are visually obvious to almost everyone. 

The main point of this paper is to show how an orthogonal transformation in 
any dimension can (in a sense) be visualized using the known examples in the line 
and the plane. This is explained in §5, with the help of a theorem proved in §4: for 
n > Q, every orthogonal transformation has an invariant line or plane. 

Of course these are familiar results; see, for instance, [1, §81]. However, the 
particular geometric proof given in §4 seems less well known (J would welcome a 
reference). Complex eigenvalues make no appearance. Instead, Rayleigh quotients 
play a central role; they are defined and differentiated in §3, after some basic 
material on adjoints is recalled in §2. 

I have successfully presented this proof several times, in a linear algebra course 
peopled by math majors who have studied ‘‘vector calculus” but not yet ‘‘abstract 
algebra.” My bias is always towards geometric methods; they seem especially 
appropriate to a study of the structure of orthogonal transformations. The visual 
intuition and physical experiments which are available in dimensions 1, 2, and 3 
can be parlayed into a good understanding of the general case. 


2. Adjoints. Let E,, be a Euclidean vectorspace of dimension n, with the inner 
product of vectors v and w denoted by (v,w>. A linear transformation A of E, is 
orthogonal if ( A(v), A(w)) = (v,w) for all v and w, and symmetric if ( A(v), w) 
= (v, A(w)) for all v and w. 

Both definitions can be rephrased in terms of the adjoint of A, that is, the 
linear transformation of E, determined uniquely by the requirement that 
(A*(v),w) = (v, A(w)) for all v and w: the transformation A is orthogonal if 
and only if A* = A~', and symmetric if and only if A* = A. (Symmetric transfor- 
mations are, in fact, often called self-adjoint.) 


Remark. For the convenience of the reader who is more at home with matrices, 
the definitions can be rephrased yet again. (Neither matrices nor bases will be used 
in the rest of the paper.) A basis u,(@i = 1,...,) of E,, is orthonormal if (v,, v;) 
equals 1 or 0 according as i equals j or not. Let the (square) matrix of A with 
respect to some orthonormal basis be [a,;]. Then the matrix of A* with respect to 
the same basis is the transposed matrix [a,;]. It follows that A is orthogonal if and 
only if the rows (or columns) of its matrix, with respect to any orthonormal basis of 
E,. are an orthonormal basis of the Euclidian vector space of row (or column) 
vectors with the usual “dot product” as inner product. Similarly, A is symmetric if 
and only if its matrix (with respect to any orthonormal basis) is its own transpose, 
that is, symmetric across its main diagonal. 


3. Rayleigh quotients. The Rayleigh quotient of a non-zero vector vu (with 
respect to A) is R(v) = ¢( A(v),v)/<v,v), that is, the scalar component of A(v) 
along v. 


LEMMA 1. R attains maximum and minimum values. 


Proof. R is continuous, and attains all its values on the closed, bounded set of 
unit vectors (since R(tv) = R(v) for t # 0). O 
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LEMMA 2. R is differentiable, with gradient 
A(v) + A*(v) — 2R(v)v 


VR(v) = Top) 


Proof. It is sufficient (and necessary) to prove that 
( A(v) + A*(v) — 2R(v)v,w) 
(U,U) 
for every vector w. The proof is straightforward calculation. It can be accom- 
plished by using the quotient rule and twice applying (once with B = A, once with 
B = 1) the identity 
(Vu, B(v)),w) = (w, B(v)) + (v, B(w)) 


which holds for any linear transformation B of E,. O 


(VR(v),w) = 


Remark. The Rayleigh quotient seems to be a very useful geometric tool. It is 
applied in the next section to the special cases of orthogonal and symmetric 
transformations. Another application is given in [2]: if A is an arbitrary linear 
transformation of E,, then the trace of A is n times the average value of its 
Rayleigh quotient on the hypersphere §”~! of unit vectors. 


4. Invariant subspaces of orthogonal transformations. A subspace F CE, 1s 
invariant (for the linear transformation A) if A(v) € F for every v € F. Of course 
E,, and {0} are trivially invariant; any other invariant subspace is non-trivial. There 
are many linear transformations without non-trivial invariant subspaces. 

Even an orthogonal transformation may not have any non-trivial invariant 
subspaces. For instance, E, has no non-trivial subspaces to be invariant; more 
seriously, E, has many non-trivial subspaces (all the lines through the origin), but 
none of them is invariant if A is rotation by an angle other than an integer 
multiple of 77. However, those turn out to be the only problematic cases, as we see 
from the following theorem, promised in the introduction. 


THEOREM 1. If A is an orthogonal transformation of E,, (n > 0), then A has at 
least one invariant subspace of dimension 1 or 2 (which is, of course, non-trivial if 
n > 2). 


Proof. Let v be a critical point of the Rayleigh quotient function of A, that is, 
VR(v) = 0. (By Lemmas 1 and 2, we know that such points exist—for instance, the 
maxima and minima of R.) Then A(v) + A*(v) — 2R(v)v = 0. This is a linear 
dependence among the three vectors A(v), A*(v) =A7~'(v), and v, so the 
subspace they span has dimension 1 or 2, and is invariant. D 


Remarks. (1) When A is orthogonal, |R| < 1. If |R(v)| = 1, then A(v) = +0, 
SO UV spans an invariant line. If the maximum value of |R| is less than 1, then A has 
an invariant plane but no invariant line. (2) Suppose that, rather than orthogonal, 
A is symmetric. Then A(v) + A*(v) — 2R(v)v = 0 is a linear dependence be- 
tween the two vectors A(v) = A*(v) and v. We see that, if v is a critical point of 
R, then A(v) = R(v)v, so the effect of A is simply to multiply v by a scalar; 
conversely, if A(v) = tv for some v # 0, then R(v) = ¢. In other words, when 4 is 
symmetric, the critical points of R are exactly the eigenvectors of A (and the 
critical values of R are its eigenvalues). This is the original, classic application of 
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Rayleigh quotients (see [3]) and might be a good geometric motivation for the 
study of eigenvectors fairly early in a course. 


5. The structure of an orthogonal transformation. In this section we apply 
Theorem 1 to develop some geometric understanding of orthogonal transforma- 
tions. 

Let A be an orthogonal transformation of E,, n > 0. As a first step, we state 
the following obvious property of invariant subspaces for A. 


LEMMA 3. If F is an invariant subspace, then its orthogonal complement F * is 
also invariant, and A|F, the restriction of A to F, is an orthogonal transformation 
of F. 


The following terminology is convenient. Call the plane P a rotation plane (for 
A) if P is invariant and A|P is some rotation about the origin, possibly the 
identity; call P a reflection plane if P is invariant and A|P is reflection across 
some line. Every invariant plane is either a rotation plane or a reflection plane. 


THEOREM 2. If n > 3, then A has a rotation plane. 


Proof. If n = 3, then A has an invariant plane P (by Lemma 3 and Theorem 1). 
If P is not a rotation plane, then let MCP be the line such that A|P is a 
reflection across M. Either A|P~- is the identity or it is a reflection across the 
origin. In the first case, the span QO of M and P~ isa rotation plane (4|Q is the 
identity); in the second case, M~ is a rotation plane (A|M~ is rotation by 7). 

If n = 4, then (by Theorem 1) either A has an invariant line L or A has an 
invariant plane P. In the first case, the Euclidean 3-space L~ is invariant (by 
Lemma 3), and then, by the preceding paragraph, A|L~* has a rotation plane. In 
the second case, either P is a rotation plane, or P~ is a rotation plane, or both are 
reflection planes—but then the span of the reflection lines in P and P* isa 
rotation plane. 

Finally, if n > 5, then A has an invariant subspace F of dimension at least 3 (by 
Theorem 1 and Lemma 3). By induction, A|F has a rotation plane. 0 


CoROLLaRY. If n = 2k + 1 (resp., 2k + 2) is odd (resp., even), then E,, is the 
orthogonal direct sum of k rotation planes P,,..., P,, and an invariant line L (resp., 
an invariant plane P,). 


Thus, for example, in E, every orthogonal transformation with positive determi- 
nant is a rotation around some line. 


CoROLLARY. Any orthogonal transformation A of E,, can be expressed as a 
composition of n or fewer reflections in hyperplanes. 


Both corollaries are easily proved by induction. 
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Using the property of comparability of conforming sets that we have just proved, 
we now observe that if A is a conforming subset of X and x € A, then whenever 
y <x, either y € A or y does not belong to any conforming set. It now follows 
easily that the union U of all the conforming subsets of X is conforming, and we 
deduce from this fact that if x = f(U), then the set U U {x} is conforming. 
Therefore, x € U, contradicting the fact that x is a strict upper bound of U. 
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After seeing Liouville’s theorem on the approximation of real algebraic num- 
bers by rationals, it is natural to ask whether the converse is true. That is, if there 
are only finitely many rational numbers p/q such that |a — p/q| < c/q* for some 
positive integer k and constant c, can we conclude that a is algebraic? There is a 
simple counterexample that isn’t mentioned in the standard texts [1, 2, 3, 4]. 

If A = {[ay, a,,a5,...]; a; = 1 or 2} is the set of numbers whose continued 
fraction representation contains only 1s and 2s then A is uncountable and so must 
contain transcendental numbers. However if a = [ao, a, a5,...] © A then @ has 
no close rational approximation. To see this let p,,/q,, be the mth convergent. For 
any simple continued fraction we have 


1 


1 
24m Im+1- 


But G41 = 4m + Qm-1 OT Am+1 = 24m + Um—1 Since a, = 1 or 2 so, for the 
numbers under consideration, we have q,,,, < 3q,, and so 


1 
6q;, 
Moreover, this inequality holds for any rational number p/q, for if ja — p/q| < 


1/6q* then |a — p/q| < 1/2q’, and this implies that p/q is a convergent; see 
[4, p. 219]. 


re eee 


Am+1 Am 
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Algebra and analysis in the p-adic integers provide us with examples of 
mathematical phenomena that are hard to find in one package elsewhere. Z, is a 
regular local ring; it is uncountable, and the non-archimedean valuation induced 
by its maximal ideal, pZ,, makes it into a compact, totally disconnected topologi- 
cal ring that contains the ordinary integers as a dense subring. How can one 
visualize such a creature? 

Pictures that inspire images of p-adic objects are rare (see the frontispiece of [3] 
for an example). In this note, we show how to represent, for any prime p, the 
elements of Z, as the points on a fractal in R*. The resulting correspondence 
captures many of the important algebraic and topological facets of Z,. 


The pictures. Computer graphics enthusiasts have discovered a wonderful pas- 
time. The idea is to plot an inductively defined (nondeterministic) sequence of 
points {Q,,} in the plane. The sequence depends on an initial set of fixed vertices 
{A,, A;,...,A;,_4}, the initial point Q, in the sequence and a real number e 
between 0 and 1. Suppose that Q, is one of the A; and that Q; has been defined. 
To find Q;,,, pick one of the fixed vertices, say A,, at random (that is, so that the 
probability of choosing any one vertex is 1/k), and let 6 be the distance between 
Q, and A,. Then Q;,, is the point on the segment between Q; and A, at a 
distance of e6 from A,. If k = 3, e = 5, and the A, are vertices of an equilateral 
triangle, then the Q, lie on a Sierpifski triangle (see [1] for the definition of 
Sierpinski triangles and Figure 1 for a picture of one). 

We'll be concerned with the figures that occur when k = p (a prime > 3), 
e = 1/p, and the A, are vertices of a regular p-gon of side length 1. Call such a set 
of points S,; a Macintosh computer running for a few minutes produced the 
approximations to §, in Figures 2a and 2b. 

Figure 2a shows the process after 50 and 100 iterations; Figure 2b shows the 
results of 500 and 1000 iterations. Figure 2 shows the general shape of $3; our 
definition of S$, implies that it is the attractor for a system of three mappings on R? 
(each mapping is a contraction with magnitude 1/3 toward one of the A,).' 

A deterministic method to obtain S, is to approximate it with sets S,, ,. 
Suppose, for example, that p = 3. Then S35, °3 ,, 83, $3. and $3 , are depicted 
in Figure 3. Here, $3 ,, contains 3” triangular regions. It is constructed from S$; ,,_, 
by replacing each triangular region T in S;,,_, with three small triangles, one 
from each corner of 7, each with side-length 1/3 that of 7. S, is then the 
intersection of all the S, ,,. Similarly, S,, is the intersection of all the S,, ,,. A few of 
the S, ,, are illustrated in Figure 4. 

Using this method for constructing S, shows that S,, as a subset of R?, is 
compact and totally disconnected. 


*Work supported by NSF Grant MDR-895-4647 
‘To generate the images in Figure 2, the initial point Q, is chosen to be one of the vertices 


{A,, A;,A>}; if Qo is not a vertex, the resulting sequence {Q,,} will converge to $3 in the sense that 
lim, 5 (infp < 5,dist(P, Q,)) = 0. 
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Ss, 0 Ss, 1 Ss, 2 


Fic. 4. 


p-adic integers. For precise definitions of Z ,, see [3], [4] or [2]. We'll think of Z > 
as the completion of Z with respect to the absolute value | |,. Here, 


In|, = pro) 


where ord ,(n) is the highest exponent to which p divides n. The idea is that two 
integers are close if their difference is 0 modulo a high power of p. The 
completion contains the subring of @ known as the “p-integral’’ numbers (rational 
numbers whose denominators aren’t divisible by p), as well as some irrational 
numbers. 

If m is a positive integer, m has a finite base p expansion 


m=m,+m,ptms,p?+m,p?t+:::+m,p’, 


where the m, are integers between 0 and p — 1. This expansion (the p-adic 
expansion for m) will be denoted by its digits, and we’ll write 
m=my,m,m,m,;°°: mM 


- 
It’s easy to see that ord pm) is the smallest integer k such that m, > 0. It follows 
that two integers are close if their p-adic expansions agree for many places. In 
particular, the sequence 


1,11,111,1111,11111,... 


is Cauchy, and its limit in Z, can be calculated from the usual formula for the limit 
of a convergent geometric series 


1111111--- =1l+pt+p*+p?+pt=-:-- —— 
1—p 
(Notice that the common ratio in this series is p, and |p|, = 1/p.) Since every 
p-adic integer is the limit of some Cauchy sequence of integers, and since the 
p-adic expansions for these integers agree for arbitrarily long initial strings, we can 
think of a p-adic integer as an infinite p-adic expansion, denoted by an infinite 
string of digits 


Aj4,4,0,0, °° =ayta;pta,p*+a;p +apit+:::, 


where each a, is an integer between 0 and p — 1. As with all completions, | |, 
extends to a valuation on Z,, and its value can be calculated by the same formula 
that defines it on Z (ord ,(a) is the smallest integer k such that a, > 0). So, just as 
in Z, two p-adic integers are close if their representations agree for many digits 
(that is, if their difference is 0 modulo a high power of p). An element of Z, is a 
non-negative integer if and only if its digits are eventually 0; an element of Z,, is in 
@ precisely when its digits eventually repeat. 


358 ALBERT A. CUOCO [April 


The arithmetic of Z, is especially simple. Z, has unique factorization, and the 
only prime is p. Every p-adic integer a can be written uniquely as p‘u. Here, 
e = ord,(a), and u is a unit (a p-adic integer whose first digit is not 0). Z,/pZ, is 
isomorphic with Z/pZ. If U is the unit group in Z, and U, is the subgroup of U 
consisting of units whose first digit is 1 (the principal units), then U/U, is 
isomorphic with (Z/pZ)*, the multiplicative group of non-zero elements in Z/pZ 
(a cyclic group of order p — 1). 

Given a p-adic integer a, a fundamental system of neighborhoods for a is the 
sequence 


at+p"Z_.:n=0,1,2,3,...}. 
Pp 


Indeed, given an integer n,Z, splits up into p” disjoint disks of diameter 1/p”, 
namely the cosets in Z,/p”Z,. Two p-adic integers x and y are within 1/p” of 
each other if and only if they belong to the same disk. 

The metric d defined by | |, satisfies a stronger condition than the triangle 
inequality; if a, b and c are in Z,, then 


d(a,b) < max{d(a,c), d(b,c)} 


(equality holds if d(a,c) and d(b, c) are unequal). This non-archimedean property 
of d implies that every triangle is isosceles and that every point interior to a circle 
is its center. 


The correspondence. Naively, a p-adic integer is an object that can be approxi- 
mated modulo p” (that is, within a p-adic distance of 1/p”), for any n, by a 
length n p-adic expansion of an ordinary integer. And, a point on S,, is a point in 
R? which, for any n, is in a connected component of Syn (so that it can be 
approximated within a (Euclidean) distance of 1/p” by a vertex on S,, ,,). So, we 
can build a natural bijection between Z, and S, by setting up, for each n, a 
bijection between Z,/p”Z, and the vertices of S, , that makes the following 
diagram commute 


n 


Z,/p"*'Z, ——_? Z,/v"L, 


| | 


Here, the top row is the usual homomorphism (it amounts to forgetting the last 
digit in the p-adic expansion), and the bottom row collapses each polygon of side 
length 1/p”*' onto the unique vertex of the polygon of side length 1/p” that it 
contains. Taking the inverse limit, we have a correspondence between Z 
and S,.. 

Suppose, for the sake of example, that p = 3. Figure 5 shows how the corre- 
spondence develops. 


Pp 
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1 111 


Ai12 
110 
12 101 121 


10 A102 A122 
100 +120 
011 211 
jo A212 
210 
LN. A. 001 021 201 +»=«-221 
A002 Ao A202 L222 
000 200 220 
Fic. 5. 


Figure 6 shows the skeleton of $3 ,, but the labeled points are the actual 3-adic 
integers (expressed as elements of @) that remain after passing to the limit. 


-3/2 1/2 
12 14 


15 13 
~21/2 15/2 ~17/2 19/2 


—24 -6 —22 -4 


~9/2 3/2 -5/2 7/2 


A LA. LD oh 
~1 
~27/2, 9/2 - 15/2 OL —23/2 13/2 -11/2 05/2 


Aw Ax’ Al Al 


Fic. 6. 


Similarly, Figure 7 shows some elements of Z, as they appear on S, ). 

Figure 8 shows how to chase down the (irrational) 5-adic integer 
0313113111311113111113... in S, by tracking the approximation 0, 03, 031, and 
0313. 
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The most important aspect of our construction is: Two p-adic integers are in a 
disk of diameter 1/p” if and only if their images in S,, are in the same connected 
component of S,, ». 


0313 
031 


03 


Fic. 8. 
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Using the correspondence. Looking at Figure 6, for example, the following facts 
about Z, stand out: 

1. Z; iS compact. 

2. Every point in Z, has a neighborhood that is both open and closed; Z, is 
totally disconnected. 

3. The image of the filtration {3”Z,:n = 0,1,2,3,...} in S$, is contained in the 
set of triangles that contain 0 and have sides of length (1/3”) (n = 0,1,2,...). 

4. For any n, there are precisely 3” disks of diameter 1/3”. These are the 
translates of 3”Z, by 0,1,2,...,3”7'. 

Figure 9 suggests that Z,/pZ,, identifies with Z/pZ, and that U/U, identifies 
with (Z/pZ)*. 


4+ 5Z,; 4U, 
Fic. 9. 


One can even use S,, to give examples of the non-archimedean geometry of Z,,. 
For example, S; can be used to illustrate the isosceles triangle property in Z.. 
Suppose that we have three points A, B, and C in Z, that are not the vertices of 
an equilateral triangle. Suppose further that 


max{d(A, B),d(B,C),d(A,C)} = 5 
and that this maximum distance is between A and C. Then one pair of points, say, 
A and B, are separated by a distance of at most 1/5”*1. Using our correspon- 
dence, the situation is as in Figure 10. 

In Figure 10, we have a disk of diameter 1/5” in Z, that contains A, B, and C. 
This disk is the union of 5 disjoint disks, each of diameter 1/5"*'!. Two elements 
of the large disk are in the same small disk if and only if the distance between 
them is at most 1/5”*', while two 5-adic integers from the large disk are in 
different small disks if and only if the distance between them is exactly 1/5”. It 
follows that A and B are in the same small disk, and that C is in a different small 
disk. Therefore, 


d(A,C) =d(B,C) = 5 


and our triangle is isosceles. 
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Conversely, the arithmetic of Z, might be a useful tool in the study of the 
geometry of S,. For example, suppose that c is a fixed integer between 1 and 
p — 1. The fact that the neighborhoods of c are just the translates by c of the 
neighborhoods of 0 can be used to picture the effect of the mapping x ~ c+ x on 
S,- Multiplication by c is a one-to-one mapping of Z, onto itself, and there is an 
algorithm (much like the multiplication algorithm that is taught in elementary 
school) for obtaining the digits of ca from those of a. Given a p-adic integer a and 
its location in S,, can this algorithm be transported to a simple geometric 
construction that allows one to locate ca in S,? 


Remark. In our definition of the p-adic absolute value 


1 ord ,(a) 
lalp=|{— ; 
*  \ p 


our choice of the “uniformizing parameter” 1/p is somewhat arbitrary. Any real 
number less that 1 will give an equivalent metric, although our correspondence 
requires a parameter less than 0.5. In the case p = 2, it is sometimes convenient to 


take 1/4, so that 
1 ord4(a) 
lal. = (= . 


4 


Then it is possible to set up our correspondence between Z, and the Cantor-like 
subset of the unit interval obtained by repeatedly removing the middle half. 


p-adic numbers. Because Z,, is an integral domain, it has a quotient field Q,,, 
the field of p-adic numbers. Q, is locally compact, and it contains Z, as an open 
subring. If | |, is viewed as a function on Q (by defining the absolute value of a 
quotient as the quotient of the absolute values), Q, is the completion of Q with 
respect to | |,. We end by sketching a method for extending our embedding of Z D 
into R* to an embedding of Q, into R’. 
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One way to think about a p-adic number is as a p-adic expansion that contains 
a finite number of terms with negative exponents. If a is the p-adic number 


oO 


a= » a, D*, 
=—N 


we'll denote a by its digits 
a= A_nA_nay ne A_1,.A,j@,a, eoeee 


We can write @,, as the increasing union of the sets Q, ,, where 


Q,,={@ € Q,:lal, <p". 
Then 
Q,0 = Z, = {(O.ayaja, °**:0 <a; <p — J}, 
Q,1 = (4_1-4 aa, -°°:0 <a; <p — I}, 
and, more generally, 
Onn = {An °° ° A_4.094,a, °°°:0 <a; <p — I}. 


Geometrically, we can replicate S,, as in Figure 11 (for p = 3) to obtain a frac- 
tal S. .. 
Dp,@ 


Fic. 11. 
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11.2 
11.0 


10.1 12.1 


10.2 12.2 
10.0 12.0 


1.1 


1.2 
1.0 


0.1 2.1 


0.2 2.2 
0.0 2.0 


Fic. 12. 


20.0 


20.1 


21.1 


21.0 


The bijection between @, and S,, . is shown in Figure 12 for the case p = 3. 


Special thanks to E. Paul Goldenberg. 
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Editor’s note: Several readers have pointed out that the argument presented in 
Deng Bo’s article “The derivation of the Maclaurin series for sine and cosine” in 
November 1990 issue of the MONTHLY is well known. It appears in some calculus 
books, including those by Ivan Niven and George Simmons. Niven points out that 
the argument appears in the 1941 book What is Mathematics?, by R. Courant and 
H. Robbins, but was probably well known several decades earlier. Finally, Niven’s 
book (Calculus: An Introductory Approach, 2d edition, Von Nostrand, New York, 
1966, pp. 140, 160, 167, 171) shows how the logarithmic, exponential, and arctan- 


gent series can all be given similar derivations. 
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us call an integer good if its odd part is less than or equal to n. There are 
\(n + 1)/2] odd integers in [1, n] and, if k is any one of them, then it is easily seen 
that n < 2"k < 2”n for exactly m integers r. Therefore the number of good 
integers in the interval (n,2’"n] is m|(n + 1)/2]. It follows that the set S contains 
at least 


n+1 n+1 


2 


good integers. Now, by the pigeonhole principle there must exist an odd k 
(1 <k <n) such that S contains at least m + 1 integers with odd part k. If these 
integers are arranged in increasing order it is obvious that each of them divides the 
next one. (Moreover, all the quotients are powers of 2.) 

The example {n + 1,n + 2,...,2’"n} shows that the assertion of the problem 
need not hold for sets of cardinality (2” — 1)n; for if ay, a,,...,a,, are distinct 
positive integers with a;_,la,; for i = 1,2,...,m, then clearly a,, > 2’"ao. 


("= 1)n +1 = {2% —n—m 


Editorial comment. Most other solvers also used the pigeonhole principle. The 
proposer proved the result by induction on n, P. G. Walsh and D. Vuignier proved 
the result by induction on m, while John Steinig gave two inductive proofs, one by 
induction on m and one by induction on n. 

Several solvers mentioned the apparently more general result: If N > 2’"n and 
if T is a subset of {1,2,..., N} of cardinality N — n + 1, then T contains m + 1 
distinct integers a),a@,,...,a,, such that a;_,|a; for i = 1,2,...,m. This follows 
by applying the result of the problem to 


S=TN {1,2,3,...,2n}. 


Solved also by J. C. Binz (Switzerland), D. Callan, R. J. Chapman (England), L. E. Clarke 
(England), M. E. Kuczma (Poland), P. S. Livingston, O. P. Lossers (The Netherlands), D. E. Manes, A. 
Nijenhuis, E. R. Scheinerman, A. J. Schwenk, J. H. Steelman, J. Steinig (Switzerland), P. G. Walsh & 
D. Vuignier (Canada), A. Zulauf (New Zealand), and the proposer. 


Sparse Subsets of Z mod 4n 


E 3328 [1989, 446]. Proposed by Bruce K. Smith, San Rafael, CA and James G. 
Propp, Berkeley, CA. 


Suppose 7 is a positive integer greater than one. Call a subset S§ of Zmod4n 
sparse if for every k in Zmod n the following two conditions are satisfied: 

(a) SN {4k — 2,4k — 1,4k,4k + 1,4k + 2} has at most two elements, 

(b) SA {4k + 1,4k + 2,4k + 3} has at most one element. 

Prove that Z mod 4n has 7” sparse subsets. 


Solution by Raphael M. Robinson, University of California, Berkeley. Call a 
subset S’ of Z mod 4n spare if for every k in Z mod n the following two conditions 
are Satisfied: 

(a') S' 9 {4k — 1,4k,4k + 1} has at most two elements, 

(b’) S' NM {4k + 2} is empty. 

Since for each k in Zmodn there are precisely 7 choices for S’ N {4k — 1, 
4k,4k + 1}, and the choices are independent for different values of k, there are 
precisely 7” spare subsets of Z mod 4n. 
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We establish a one-to-one correspondence between sparse subsets § and spare 
subsets S’ as follows. Given a sparse subset S, we construct a spare subset S$’ by 
replacing each element in S of the form 4k + 2 by both 4k + 1 and 4k + 3, which 
were absent in S. Then the resulting S’ will be spare, since the count is the same in 
(a') as in (a). (We need n > 1 to make 4k — 2 and 4k + 2 distinct.) Given a spare 
subset S’, we construct a sparse subset S by replacing each occurrence of both 
4k +1 and 4k + 3 in S’ by 4k + 2. The resulting set will be sparse, since the 
count in (a) is the same as the count in (a’), and the count in (b) is reduced to one. 
If at most one of the numbers 4k + 1 and 4k + 3 is in S’, then (b) is satisfied 
without any change being made. It is easy to check that for n > 1 these two 
constructions give the desired one-to-one correspondence. 


Solved also by J. C. Binz (Switzerland), R. J. Hendel, A. A. Jagers (The Netherlands), I. Kozma 
(Israel), M. E. Kuczma (Poland), O. P. Lossers (The Netherlands), L. F. Martins (student), A. 
Nijenhuis, J. H. Steelman, and the proposers. 


Schroder Numbers Divisible by 3- 


E 3343 [1989, 734]. Proposed by Lou Shapiro, Howard University, Washington, 
DC and Douglas Rogers, University of East Anglia, U.K. 


Let S, be the number of polygonal paths in the plane that start at (0,0), end at 
(n,n), contain no points above the line y = x, and are composed of steps taken 
from the set {(0, 1), (1, 0), (1, D}. For example, S, = 1,5, = 2,8, = 6,8, = 22,8, 
= 90. (The numbers in the sequence {S,} are often called the Schrdder numbers.) 

Prove that S, is divisible by 3 for every positive even integer n. 


Solution by Carl Schoen and Paolo Ranaldi, University of Wisconsin, Eau Claire. 
Let A, be the number of such paths whose penultimate vertex on the diagonal 
y =x is(k,k). Then S, = £%_).A,. Then A, is the number of paths from (1, 0) to 
(n,n — 1) containing no points above the diagonal connecting (1,0) to (n,n — 1). 
Hence A, = S,_,. For k = 1,2,...,n — 2, A, is the number of paths from (0, 0) 
to (k, k) multiplied by the number of paths from (k + 1, k) to (n,n — 1) contain- 
ing no points above the diagonal connecting (kK + 1,k) to (n,n — 1). Hence, 
A, = S,S,_,—,- Finally, A,_, is the number of paths from (0,0) to (mn — 1,” — 1) 
multiplied by the number of paths from (m — 1,n — 1) to (n,n), or A,_, = 2S,_}. 
It follows that S, = 3S,_, + L%27S,S,_1-;- 

Assume S, is divisible by 3 for all even k <n. If n is even, then each term in 
the above sum is divisible by 3, since 3|$, when k is even and 3|S, _,_, when k is 
odd. It follows by induction that 3|S, for all even n. 


Solved also by the proposers and 37 other readers. 


Sum-free Sets Modulo n 
E 3346 [1989, 735]. Proposed by Dean S. Clark, University of Rhode Island, 
Kingston. 


Call a subset T of Z mod n sum-free if the sum of two distinct elements of T is 
not in T. Let s(n) be the maximum cardinality of a sum-free subset of Z mod n. 
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(a) Prove that s(n) = n/2 if n is even. 
(b) If n is odd, prove that s(n) > |n/3] + 1. 
(c)* If n is odd, is s(n) = |n /3] + 1? 


Solution by Torleiv Klgve, Bergen, Norway. (a) If n is even, the set of odd 
integers is a sum-free set of size n/2. On the other hand, if T is sum-free and 
aéT, let S={a+b: b€ T,b # a}. Then S and T are disjoint, which implies 
n > 2|T| — 1, or |T| < (+ 1)/2] = 1/2. 

(b) The set {a: |n/3] < a < 2|n/3]} is sum-free and has |n /3] + 1 elements. 

(c) The answer is “sometimes, but not always.” Let p be a prime such that 
p = —1(mod6), and let n be a multiple of p. Let T be the subset of Zmodn 
consisting of those residue classes a mod n such that a is congruent modulo p to 
one of the (p + 1)/3 integers in the interval [(p + 1)/3,(2p — 1)/3]. If c= 
(a + b\(mod n) for a,b € T, then c = (a + b)\(mod p), which implies c ¢ T; thus 
T is sum-free. Further |T| = n(p + 1)/Gp). If n=p or n= 3p, then 
n(p + 1)/Gp) = |n/3] + 1; in these two cases presumably no larger sum-free set 
exists. However, if n > 5p, then n(p + 1)/(@3p) > |[n/3] + 1. 


Editorial comment. Concerning the correct answer for odd n, five solvers (in- 
cluding Klgve), conjectured that s(n) =|[n/3]+ 1 if n has no prime divisors 
congruent to 5mod 6, but that s(n) = n(p + 1)/@Gp) if p is the smallest prime 
divisor of n congruent to 5 mod 6. KIgve commented that exhaustive search verifies 
this for n < 55. Thus n = 25 is the smallest value of n for which s(n) > |[n/3] + 1. 

This problem requires that the summands be distinct. The variation arising by 
dropping that requirement is considered in H. L. Abbott and E. T. H. Wang, 
“Sum-free sets of integers,” Proc. Am. Math. Soc. 67(1977), 11-16. 


Added in proof. In a manuscript received by the editors after the above material 
was sent to the printer, Umberto Zannier (Venice, Italy) proved the conjecture 
enunciated in the first paragraph of the Editorial comment. He used a theorem of 
Martin Kneser given as Theorem 16 of chapter 1 of H. Halberstam and K. F. Roth, 
Sequences, second edition, Springer-Verlag, 1983. Professor Zannier will publish 
his proof elsewhere. 


Solved also by J. Duemmel, R. B. Eggleton (Brunei), F. Galvin, D. R. Guichard, O. P. Lossers 
(Netherlands), J. H. Nieto (Venezuela), R. B. Richter (Canada), J. H. Steelman, R. Stong, G. 
Torchinelli, B. M. de Weger (Netherlands), J. Zhang (China). Solutions to (a) and (b) only were 
submitted by F. Balogh (Hungary), G. E. Bilodeau, R. J. Hendel, W. Janous (Austria), R. Martin 
(student), and the proposer. 


A Series for e 


E 3352 [1989, 838]. Proposed by G. I. Senum, Brookhaven National Laboratory, 
Upton, Long Island, NY. 


Show that 
e° 1 e 


Lon 


ag Wnt +n? + 1) 2. 
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(a) Prove that s(n) = n/2 if n is even. 
(b) If n is odd, prove that s(n) > |n/3] + 1. 
(c)* If n is odd, is s(n) = |[n/3] + 1? 


Solution by Torleiv Klove, Bergen, Norway. (a) If n is even, the set of odd 
integers is a sum-free set of size n/2. On the other hand, if T is sum-free and 
a €&T, let S={a+b: b€ T,b # a}. Then S and T are disjoint, which implies 
n > 2|T| — 1, or |T| < | + 1)/2| = 7/2. 

(b) The set {a: |[n/3] < a < 2[n/3]} is sum-free and has |n /3] + 1 elements. 

(c) The answer is “sometimes, but not always.” Let p be a prime such that 
p = —1(mod 6), and let n be a multiple of p. Let T be the subset of Zmodn 
consisting of those residue classes a mod n such that a is congruent modulo p to 
one of the (p + 1)/3 integers in the interval [(p + 1)/3,(2p — 1)/3]. If c= 
(a + b\(mod n) for a,b € T, then c = (a + b\(mod p), which implies c ¢ T; thus 
T is sum-free. Further |T| = n(p + 1)/Gp). If n =p or n= 3p, then 
n(p + 1)/@B3p) = |n/3] + 1; in these two cases presumably no larger sum-free set 
exists. However, if n > 5p, then n(p + 1)/@Gp) > [n/3] + 1. 


Editorial comment. Concerning the correct answer for odd n, five solvers (in- 
cluding Klgve), conjectured that s(n) =|n/3]+ 1 if nm has no prime divisors 
congruent to 5mod 6, but that s(n) = n(p + 1)/@Gp) if p is the smallest prime 
divisor of n congruent to 5 mod 6. Kléve commented that exhaustive search verifies 
this for n < 55. Thus n = 25 is the smallest value of n for which s(n) > [n/3] + 1. 

This problem requires that the summands be distinct. The variation arising by 
dropping that requirement is considered in H. L. Abbott and E. T. H. Wang, 
““Sum-free sets of integers,” Proc. Am. Math. Soc. 67(1977), 11-16. 


Added in proof. In a manuscript received by the editors after the above material 
was sent to the printer, Umberto Zannier (Venice, Italy) proved the conjecture 
enunciated in the first paragraph of the Editorial comment. He used a theorem of 
Martin Kneser given as Theorem 16 of chapter 1 of H. Halberstam and K. F. Roth, 
Sequences, second edition, Springer-Verlag, 1983. Professor Zannier will publish 
his proof elsewhere. 


Solved also by J. Duemmel, R. B. Eggleton (Brunei), F. Galvin, D. R. Guichard, O. P. Lossers 
(Netherlands), J. H. Nieto (Venezuela), R. B. Richter (Canada), J. H. Steelman, R. Stong, G. 
Torchinelli, B. M. de Weger (Netherlands), J. Zhang (China). Solutions to (a) and (b) only were 
submitted by F. Balogh (Hungary), G. E. Bilodeau, R. J. Hendel, W. Janous (Austria), R. Martin 
(student), and the proposer. 


A Series for e 


E 3352 [1989, 838]. Proposed by G. I. Senum, Brookhaven National Laboratory, 
Upton, Long Island, NY. 


Show that 


°° 1 e 
L 


ag Aint +n? + 1) 2. 
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Prime Factors of 1 + 9° 


E 3355 [1989, 927]. Proposed by Leo J. Alex, State University of New York, 
Oneonta. 


Characterize those positive integers a such that the prime factors of 1 + 9% are 

all less than 40. Some useful factorizations: 
1+ 9*=2-17- 193, 1+9°=2-57- 1181, 
1+9°=2- 41 - 6481, 1+ 97=2-5-29- 16493. 

Solution (independently) by Hugh Noland, Chico, CA, by John Henry Steelman, 
Indiana University of Pennsylvania, Indiana, PA, and by Richard Stong, UCLA. The 
only such positive integer is 1. If p is an odd prime dividing 1 + 9%, then —1 isa 
quadratic residue of p (since 9 is a square), and hence p = 1 (mod 4). The only 
such primes less than 40 are {5, 13, 17, 29, 37}. Modulo 13, the powers of 9 are 
{1, 3,9}, and modulo 37 they are {1,7, 9, 10, 12, 16, 26, 33, 34} so neither of these 
primes ever divides 1 + 9%. Hence the only possible prime factors less than 40 are 
{2, 5,17, 29}. 

Consider the powers of 9 modulo 17, 25,29. If 17 divides 1 + 9%, then a = 4 
(mod 8) and hence 9% is an odd power of a number 9% that is congruent to —1 
(mod 193), as given above, so 193 divides 1 + 9%. Similarly, if 25 divides 1 + 9%, 
then a = 5 (mod 10) and 1181 also divides 1 + 9%, and if 29 divides 1 + 9%, then 
a = 7 (mod 14) and 16493 also divides 1 + 9%. Finally, since 9 = 1 (mod 4), we 
have 1 + 9% = 2 (mod 4), so 4 never divides 1 + 9°. Hence if all prime factors of 
1 + 9% are less than 40, then we have at most one power of 2, at most one power 
of 5, and no powers of 17 or 29, so 1 + 9% = 10. 


Editorial comment. With a little extra work, B. M. M. de Weger showed that 
only for a = 1,2, 3,4 are all the prime divisors of 1 + 9% less than 200. 


Twenty-five correct solutions and one incorrect solution were received. 


Limit of a Reciprocal Sequence 


E 3356 [1989, 927]. Proposed by Wu Wei Chao, Jinhua City, Zhejiang, China. 


Suppose a, = 1 and a,,, =n/a, for n = 1,2,.... Evaluate 


Solution dedicated to the late Michael J. Dixon by his colleagues in the Chico 
Problem Group, California State University, Chico. We show that the limit is 
y2/7 + y¥7/2. Using the recurrence, we have a,,,°a, =n and a,:a,_,=n 
— 1. Subtracting these yields a,,, — a,_, = a,', so the sum of reciprocals tele- 
scopes to yield ay' + --- +a>'=a,,, +4, — a, — a). Hence 


aj} +--+ +az! (Ae a | 


= lim | —— + = 
vn van 


1 
n-o Vn n—-oo ( ) 
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Since a,,, =n/a, =a,_,n/(n — 1), we can iterate to obtain 


(2n + 1)! 2°"(n!1)* 


Qon+2 — 22"(n!)? ) Qo,41 = ~(2n)! —. 


By Stirling’s formula, we have a,,,, ~~ ¥2/m7 -v2n and a,,,, ~ Vw/2°:v2n. 
Hence the two contributions to the right-hand side of (1) approach y¥2/m and 


yr/2. 


Fifty-seven correct solutions and one incorrect solution were received. 


ADVANCED PROBLEMS 


6655. Proposed by Paul Erdos, Hungarian Academy of Sciences, Budapest. 


Given a positive integer n, we wish to find distinct integers greater than n such 
that their product with n is a square and we wish to do this in such a way that the 
largest number used is as small as possible. Let f(n) be this minimal value of the 
largest number used. For example f(10) = 18, since 10 - 12 - 15 - 18 = (180)? and 
the product of 10 with any subset of {11, 12, 13, 14, 15, 16, 17} is not a square. 


Show that if ¢,, €5, €3,... 1S any sequence of positive numbers tending to zero, 
then the set 


{n: f(n) —n>n'~*} 


has density zero. 


6656. Proposed by Dragomir Z. Dokovic, University of Waterloo, Ontario. 


(i) If P(z) is a non-zero polynomial over C with at least two distinct zeros, prove 
that 1/P(z) has at least one non-zero residue (or equivalently 1/P(z) is not the 
derivative of a rational function). 

(ii) Prove or disprove the following more general assertion: If f and g are 
rational functions over C such that f'(z)g(z) = 1, then f(z) = c(z — a)* for 
some complex constants c,a and some integer k # +1. 


6657. Proposed by J. J. Rotman and P. M. Weichsel, University of Illinois at 
Urbana-Champaign. 


Suppose we make a finite group G into a graph I by defining two elements a, b 
of G to be adjacent if (ab~')* # e. If the elements of G are v,,V,,...,U,, let A 
be the 1 by n matrix in which the element in the 7th row and jth column is 1 if v, 
and vu, are adjacent and is 0 if v; and v, are not adjacent. (A is an adjacency matrix 
for T.) 

Show that det A is an even integer. 
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Since a,,, =n/a, =a, —,n/(n — 1), we can iterate to obtain 


(2n + 1)! 27"(n!)? 


Q2n+2 — 2"(nly> , Qon+1 = “(n)!_ ; 


By Stirling’s formula, we have a,,,,~ 2/7 - V2n and A>,.,~ Vr/2-V2n. 
Hence the two contributions to the right-hand side of (1) approach y2/m and 


yr/2. 


Fifty-seven correct solutions and one incorrect solution were received. 


ADVANCED PROBLEMS 


6655. Proposed by Paul Erdos, Hungarian Academy of Sciences, Budapest. 


Given a positive integer n, we wish to find distinct integers greater than n such 
that their product with n is a square and we wish to do this in such a way that the 
largest number used is as small as possible. Let f(n) be this minimal value of the 
largest number used. For example f(10) = 18, since 10- 12: 15 - 18 = (180)? and 
the product of 10 with any subset of {11, 12, 13, 14, 15, 16, 17} is not a square. 


Show that if ¢,, 65, €3,... iS any sequence of positive numbers tending to zero, 
then the set 


{n: f(n) —n > ni~*r} 


has density zero. 


6656. Proposed by Dragomir Z. Dokovié, University of Waterloo, Ontario. 


(i) If P(z) is a non-zero polynomial over C with at least two distinct zeros, prove 
that 1/P(z) has at least one non-zero residue (or equivalently 1/P(z) is not the 
derivative of a rational function). 

(ii) Prove or disprove the following more general assertion: If f and g are 
rational functions over C such that f'(z)g(z) = 1, then f(z) = c(z — a)* for 
some complex constants c,a and some integer k # +1. 


6657. Proposed by J. J. Rotman and P. M. Weichsel, University of Illinois at 
Urbana-Champaign. 


Suppose we make a finite group G into a graph I’ by defining two elements a, b 
of G to be adjacent if (ab~')* # e. If the elements of G are v,, UV ,...,U,, let A 
be the n by n matrix in which the element in the ith row and jth column is 1 if vu, 
and v, are adjacent and is 0 if v; and vu, are not adjacent. CA is an adjacency matrix 
for T.) 

Show that det A is an even integer. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Poles of Green’s Function 


6513 [1986, 218]. Proposed by Maurice Machover, St. John’s University, 
Jamaica, NY. 


(a) Show that Green’s function for the operator Lu = u", with any boundary 
conditions, has no pole of order higher than three. 

(b) Devise boundary conditions (the simpler the better) for which there is a 
third order pole. 


Solution by the proposer. We consider the Green’s function for u” + A*u on 
[0, +] with fixed boundary conditions, and examine its properties as a meromorphic 
function of A* at A” = 0. That is, for ¢ € [0, 7] and fixed A € C we consider the 
function G(x, t, A), if indeed it exists, with the following properties: It satisfies the 
differential equation 


G,,(x,t,A) + WG(x,t,A) = 6(x — ft), 
and the linear boundary conditions 
BG): = a,,G(0,t,A) + a,G,(0,t,A) + a3G(7,t,A) + 4,,G,(7,t,rA) = 0, 
B,(G): = a,,G(0,t, A) + a,G,(0,t, A) + ay,G(m;t, A) + a,,G,(7,t,rA) = 0, 
where a,,,...,@, are constants and B,, B, the corresponding operators. We 
make the standard assumption that the matrix (a,,) has rank two. Here d(x — £) 
denotes the usual delta function. 
Now u, =u,(x,t, A) = H(x — t)A~' sin A(x — t), where A(x) is the usual 


Heaviside function, is a particular solution of the differential equation. Hence 
Green’s function has the form 


G(x,t,A) =a(t,A)cosAx + b(t, A)sinAx + u,(x,¢t,A). 
To determine a = a(t, A) and b = b(t, A) note that 
aB,(cos Ax) + bB,(sinAx) + Bu, = 0 
aB,(cos Ax) + bB,(sin Ax) + Bu, = 0, 
so by Cramer’s rule a = A,(t, A)/A(QA) and b = A(t, A)/A(A), where 


B,cosAx) B,(sin Ax) 


ACA) = B,(cosAx) B,(sin Ax) }’ 


and similar formulas give A, and A,. Note that uw, is analytic in A at A = 0, so any 


pole at A = 0 must arise from a zero of A(A) at A = 0. 
For part (a) observe that G cannot exist if ACA) vanishes identically. Write 


A(A) = Ved. 
i=0 


The existence of a pole of order at least four in A* implies that c,; = 0 for i < 7. 
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and 


Ar 


A,(t,A) = |1 4+ cosAm — 15 sin ae |[ 230" sin A( 7 — t) | 


TT 
+ [At sin A(a —t) + 75 008 AC — 1) [22 + 23 cos Ar]. 


Since A,(O) = 0 we have N(O) = 0. Furthermore, as A — 0 the first term of N(A) 
iS 


K,A + O(A’) 
where K, # 0 is independent of x, while the second term of N(A) is 
K,A + O(A’), 


where 


K, = 2x[23(t — 7)| + 45x] —t+ =| = (t+27)x. 


Hence the zero here is of first order, and the result follows. 


Editorial comment. The proposer evidently intended the underlying space to be 
an interval and the boundary conditions to be linear in the function and its first 
derivative. Two useful references here are B. Friedman, Principles and Techniques 
of Applied Mathematics, Wiley, New York, 1956, pp. 172-174, and 
E. A. Coddington and N. Levinson, Theory of Ordinary Differential Equations, 
McGraw Hill, New York, 1955, pp. 188-193. The proposer remarks that for higher 
order ordinary differential equations, nothing seems to be known about the 
maximum order of the poles of the Green’s function. In fact, aside from this 
problem, the literature seems to stop at second-order poles. The proposer thanks 
Joseph B. Keller and Samuel N. Karp for valuable discussions, and a referee for 
suggesting improvements in the problem’s formulation. 


Solved also by Clark Givens. 


Square Clusters of Primitive Lattice Points 


6568 [1988, 264]. Proposed by Itshak Borosh and Douglas Hensley, Texas A & M 
University. 


A lattice point (s,t) € N X N is called primitive if the greatest common divisor 
of s and ¢ is 1. It is known that the density of primitive lattice points is 
6/7” = 0.6079... (Hardy and Wright, Theory of Numbers, Theorem 459). Show 
that the density of lattice points (a,b) © N X N for which each of the eight 
neighbors (a + 1, b),(a, b + 1),(a + 1, b + 1) is primitive is 


1 
veut — 8/p’) = 0.012..., 
where the product is taken over all odd primes p. 


Composite solution by the University of South Alabama Problem Group, Mobile, 
and the proposers. Let us call a lattice point (a,b) € N X N having the property 
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and 


Ar 


A,(t,A) = |1+ cosAr — 15 sin ae |[ 230" sin A( ar — t) | 


7 
15 


Since A,(O) = 0 we have N(O) = 0. Furthermore, as A — 0 the first term of N(A) 
iS 


+ [At sin AC = 0) + cos A( ~ 1) [22 + 230s Arr}. 


K,A + O(A’) 
where K, # 0 is independent of x, while the second term of N(A) is 
K,A + O(A’), 


where 


K, = 2x[23(t — 7)] + 45x] —t+ | = (t+ 27)x. 


Hence the zero here is of first order, and the result follows. 


Editorial comment. The proposer evidently intended the underlying space to be 
an interval and the boundary conditions to be linear in the function and its first 
derivative. Two useful references here are B. Friedman, Principles and Techniques 
of Applied Mathematics, Wiley, New York, 1956, pp. 172-174, and 
E. A. Coddington and N. Levinson, Theory of Ordinary Differential Equations, 
McGraw Hill, New York, 1955, pp. 188-193. The proposer remarks that for higher 
order ordinary differential equations, nothing seems to be known about the 
maximum order of the poles of the Green’s function. In fact, aside from this 
problem, the literature seems to stop at second-order poles. The proposer thanks 
Joseph B. Keller and Samuel N. Karp for valuable discussions, and a referee for 
suggesting improvements in the problem’s formulation. 


Solved also by Clark Givens. 


Square Clusters of Primitive Lattice Points 


6568 [1988, 264]. Proposed by Itshak Borosh and Douglas Hensley, Texas A & M 
University. 


A lattice point (s,t) € N X N is called primitive if the greatest common divisor 
of s and ¢ is 1. It is known that the density of primitive lattice points is 
6/77? = 0.6079... (Hardy and Wright, Theory of Numbers, Theorem 459). Show 
that the density of lattice points (a,b) € N X N for which each of the eight 
neighbors (a + 1, b),(a,b + 1),(a + 1, b + 1) is primitive is 


1 
1 [1a — 8/p") = 0.012..., 
where the product is taken over all odd primes p. 


Composite solution by the University of South Alabama Problem Group, Mobile, 
and the proposers. Let us call a lattice point (a,b) € N X N having the property 
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Combining (2), (3), (4), (5), and (6), we obtain (for x > Pp) 


1 
Rog) (7) 


If we now take R = > log x, then x = e?* > Pp aS required, since by a familiar 
elementary estimate 


x? 8 
ats) =" T(t 5 


p>2 


+ O(xPp) + O 


7 
log Pr < Rlog4 < 5k. 


Since xPp < xe’*®/? = x!’/!. our result (1) follows from (7). 
In a similar way we may obtain an analogous result (with x7/4 replaced by 
ax*/16) for the number of nearprim points (a,b) € N X N such that a? + b? < 


x’. 


Solved also by Benjamin Max and by Matthias Winter (Germany). 


Rings and Functors 


6595 [1989, 264]. Proposed by P. A. Griffith and J. J. Rotman, University of 
Illinois at Urbana-Champaign. 


An R-module E is injective if it is a direct summand of any module containing 
it. Every module can be imbedded in an injective module. The injective envelope 
E(M) is an injective module containing M such that there is no injective module 
E' with MC E' C E(M) and E' # ECM). The injective envelope E(M) exists and 
is unique to isomorphism. 

(i) A functor T is called additive if, for every pair of R-modules A and B, we 
have T(f + g) = T(f) + T(g) whenever f, g © Hom,(A, B). Show that there is 
a ring R for which there is no additive functor T: R-mod — R-mod with T(M) = 
E(M). 

(ii) Show that there is a ring R for which there is no functor T: R-mod — R-mod 
with T(M) = E(M). 

Gii)* Find all rings R for which there exists a functor with the property 
mentioned in (ii). 


Solution by Fred Richman, Florida Atlantic University, Boca Raton. We show 
that the ring Z of integers satisfies (i) and (ii); modules over Z are just abelian 
groups. Let Z, denote the cyclic group of order p. Let Z,» be the p-primary 
component of Q/Z, and observe that E(Z » = Z,». It suffices to show that there ts 
no functor T: Z-mod > Z-mod with T(Z,) = Z,». 

Clearly such a functor cannot be additive on endomorphisms of Z,, because the 
identity map on Z,, is of additive order p, while the identity map on Z,» is of 
infinite additive order. We will show that JT must be additive on endomorphisms of 
Z,,, thereby reaching a contradiction. 

It suffices to show that T preserves the direct sum diagram for Z, @ Z, (1; 
Prop. 9.5] or [2; Prop. 4, page 193]); that is, if 4,, 65, 7, and 7r, are the injection 
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and projection maps for that diagram, then 
(1) T(7,)T(e,) = T(7,)T(e2) = 1, 
(2) T(72)T(4,) = T(7,)T(2) = 9, 
(3) T(u,)T(7,) + T(e,)T(72) = 1. 


Condition (1) is immediate from the definition of a functor. As E(O) = 0, it follows 
that T7(f)=0 if f=0, so condition (2) holds. Now let a = TW,)T(7,) + 
T(c,)T(ar,) — 1. Clearly aT(t;) = 0, so, to prove (3), we need only show that the 
images of the maps T(u;) generate T(Z, © Z,). But this is true because those 
images are disjoint copies of Z,» in Z,0 ® Z,». 

The same proof works for the ring of integers modulo 4, with Z, replaced by 
Z,, and Z,» by Z,; this is the smallest ring that satisfies (i) and (ii). 

As a partial response to (iii), we remark that for the polynomial ring R = k[X], 
where k is a perfect field, an additive functor T with T(M) = E(M) can be 
constructed. 

If we rephrase the original question slightly, we can get a positive result that 
eliminates pathological examples. A true functorial injective envelope would 
include a natural embedding of A in T(A). We want a superfunctor T of the 
identity so that if f: A—- B, then T(f): T(A) > T(B) extends f. But the 
following theorem rules out nontrivial cases of that. 


THEOREM. If T is a superfunctor of the identity on the category of R-modules, 
and T(A) is an injective envelope of A for each R-module A, then every R-module is 
injective, and T is the identity. 


Proof. If « is the inclusion of A in ECA), and f: ECA) > E(B) is such that 
fe =0, then T(fe) = T(f)T(@) = 0. But T(r) is an isomorphism because it is 
one-to-one on A, and maps one injective envelope of A into another. Thus 
T(f) = 0, so f = 0, and we have shown that zero is the only map from E(A) to 
E(B) that extends the zero map from A to B. With B = E(A)/A, the natural map 
from E(A) onto B extends the zero map from A to B, hence is zero, so B = 0 
and A is injective. Thus every R-module is injective. 


REFERENCES 


1. P. J. Hilton, and U. Stammbach, A Course in Homological Algebra, Springer-Verlag, 1970. 
2. S. Mac Lane, Categories for the Working Mathematician, Springer-Verlag, 1971. 


Solved also (to the same extent) by R. J. Chapman and the proposers. 


The Additive Group Generated by the Rational Unit Vectors 


6603 [1989, 455]. Proposed by Carl Pomerance, University of Georgia, Athens. 


Let G, denote the set of points in Euclidean n-space which can be reached 
from the origin in a finite number of steps of unit length, where each step lands on 
a point with rational coordinates. In other words G,, is the additive subgroup of Q” 
generated by those points in Q” at distance one from the origin. 

(a) Show that G, = Q” if and only if n > 5S. 

(b) Obtain an arithmetic characterization of G,,G,, G3, G,. 
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and projection maps for that diagram, then 
(1) T(7,)T(4,) = T(72)T(e2) = 1, 
(2) T(72)T(41) = T(7,)T(e2) = 9, 
(3) T(e,)T(7,) + T(e,)T(72) = 1. 


Condition (1) is immediate from the definition of a functor. As E(O) = 0, it follows 
that T(f)=0 if f= 0, so condition (2) holds. Now let a = T(t,)T(7,) + 
T(c,)T(ar,) — 1. Clearly aT(t;) = 0, so, to prove (3), we need only show that the 
images of the maps T(u;) generate T(Z, © Z,). But this is true because those 
images are disjoint copies of Z,» in Z,0 ® Z,». 

The same proof works for the ring of integers modulo 4, with Z, replaced by 
Z,, and Z,. by Z,; this is the smallest ring that satisfies (i) and (ii). 

As a partial response to (iii), we remark that for the polynomial ring R = k[| X], 
where k is a perfect field, an additive functor T with T(M) = E(M) can be 
constructed. 

If we rephrase the original question slightly, we can get a positive result that 
eliminates pathological examples. A true functorial injective envelope would 
include a natural embedding of A in T(A). We want a superfunctor T of the 
identity so that if f: A — B, then T(f): T(A) > T(B) extends f. But the 
following theorem rules out nontrivial cases of that. 


THEOREM. If T is a superfunctor of the identity on the category of R-modules, 
and T(A) is an injective envelope of A for each R-module A, then every R-module is 
injective, and T is the identity. 


Proof. If u is the inclusion of A in E(A), and f: ECA) > E(B) is such that 
fe =0, then T( fe) = T(f)T(@) = 0. But T(t) is an isomorphism because it is 
one-to-one on A, and maps one injective envelope of A into another. Thus 
T(f) = 0, so f = 0, and we have shown that zero is the only map from E(A) to 
E(B) that extends the zero map from A to B. With B = E(A)/A, the natural map 
from E(A) onto B extends the zero map from A to B, hence is zero, so B = 0 
and A is injective. Thus every R-module is injective. 


REFERENCES 


1. P. J. Hilton, and U. Stammbach, A Course in Homological Algebra, Springer-Verlag, 1970. 
2. §. Mac Lane, Categories for the Working Mathematician, Springer-Verlag, 1971. _! 


Solved also (to the same extent) by R. J. Chapman and the proposers. 


The Additive Group Generated by the Rational Unit Vectors 


6603 [1989, 455]. Proposed by Carl Pomerance, University of Georgia, Athens. 


Let G, denote the set of points in Euclidean n-space which can be reached 
from the origin in a finite number of steps of unit length, where each step lands on 
a point with rational coordinates. In other words G,, is the additive subgroup of Q” 
generated by those points in Q” at distance one from the origin. 

(a) Show that G, = Q” if and only if n > 5S. 

(b) Obtain an arithmetic characterization of G,,G,,G,, G. 
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Solution by Yoshio Mimura, Kobe University, Kobe, Japan. Let U,, be the set of 
points in Q” at distance one from the origin. We must determine the additive 
group G,, generated by U,,. If x = (x,,..., x,,) € Q” we define the positive integer 
dnm(x) to be the least common denominator of x,,..., x 

(1) G, = Z. This is clear. 

(2) G, = {x € Q’|dnm(x) is a product of primes = 1 mod 4}. To see this we 
take a point u = (a/t,b/t) € U, with t = dnm(u). Then we have t* = a’ + b? 
with gcd(a, b) = 1. Hence any prime divisor of ¢ is congruent to 1 mod 4, which 
implies that G, is contained in R,, the group on the right-hand side. Let t be a 
product of primes = 1 mod 4. Then we have t* = a” + b? for some integers a and 
b with gcd(a, b) = 1. Since (a/t, + b/t) € U, we see by addition that (2a/t,0) € 
G,. Similarly (2b/t,0) € G,. Since gcd(a, b) = 1, there exist integers u,v such 
that au + bu = 1. Hence 


(2/t,0) = u(2a/t,0) + v(2b/t,0) € G,. 


Similarly (0,2/t) € G,. We may assume without loss of generality that a is odd 
and b is even. Since (a/t, b/t), (2/t,0), and (0, 2/t) all lie in G, and 


(1/t,0) = (a/t,b/t) — |a/2|(2/t,0) — |b/2\(0,2/2), 
it follows that (1/t,0) € G,. Similarly (0,1/t) € G,. Since R, is generated by the 
points (1/1, 0) and (0,1/t), we have R, C G5. 

(3) G; = {x € Q?|dnm(x) is odd}. Take any point u = (a/t, b/t,c/t) € U; 
with ¢ = dnm(u). Then t? = a” + b* + c* with gcd(a, b,c) = 1. Hence ¢ must be 
odd, which implies that G, is contained in R,, the group on the right-hand side. 
For the reverse inclusion it is sufficient to show that (1/t,0,0) € G, for any odd t¢. 
Since t? = 1(mod 8), it is known that t* = a” + b* + c” for some integers a, b and 
c with gcd(a,b,c) = 1. (See, for example, B. A. Venkov, Elementary Number 
Theory, Wolters-Noordhoff, Groningen, 1970, p. 165 or E. Grosswald, Representa- 
tions of Integers as Sums of Squares, Springer, New York, 1985, p. 52. In fact the 
number of integers (a,b,c) such that a” + b* +c’ =t? and gcd(a,b,c) = 1 is 
equal to 


n° 


6rTT {1 — (-1)? Pp}, 
p\t 
where the product is taken over the primes dividing the odd number ¢.) Since 
(a/t,+b/t,+c/t) © U;, we see by addition that (2a/t,0,0) € G,. Similarly 
(2b/t, 0,0) € G, and (2c /t, 0,0) € G,. Since gcd(a, b,c) = 1, there exist integers 
u,v,w with au + bv + cw = 1. Hence 


(2/t,0,0) = u(2a/t,0,0) + v(2b/t,0,0) + w(2c/t,0,0) € G;. 


Similarly (0,2/t,0) € G, and (0,0,2/t) € G;. We may assume without loss of 
generality that a is odd while b and c are even. Since (a/t, b/t, c/t), (2/t, 0, 0), 
(0,2/t,0) and (0,0,2/t) are all in G;, it follows as in (2) that (1/t,0,0) € G;. 
Since similarly (0,1/t,0) € G, and (0,0,1/t) € G,, we have R, C G3. 

(4) G, ={g,¢+ elg © G3}, where G? = {g € O*ldnm(g) is odd} and e = 
(1/2,1/2,1/2,1/2). Take any u = (a/t,b/t,c/t,d/t) € U, with t = dnm(u). 
Then we have t* = a” + b* + c* +d’. If t is odd, then u € G). If t is even, then 
t is divisible by 2 but not by 4, and a, b, c, and d are odd. Hence we have 
u =v +e with v € G2. Therefore G, C {g, g + elg € G{}. For the reverse inclu- 
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sion it is enough to show that G? C G,. If t is odd then 47 — 1 = 3 mod 8. Hence, 
we can write 4¢7 — 1 = a* + b* +c’ for some integers a, b,c. Thus (1/t, 0, 0, 0) 
& G,, because (1/(2t), + a/(2t), + b/(2t), + c/(2t)) © U,. Hence, G? C G,. 

(5) G, = Q” if n>5. It is enough to show that (1/t,0,...,0) © G, for all 
positive integers ¢t. We can write 4¢7-—1=a’+b*+c* +d’ with integers 
a,b,c, d. Then we have (1/(2t), + a/(2t), + b/(2t), + c/(2t), + d/(2t),0,...0) 
e U,. Hence, (1/t,0,...,0) € G,,. 


Editorial comment. The proposer remarked that the assertion of part (a) is 
proved in a paper by K. B. Chilakamarri, Unit-distance graphs in rational n-space, 
Discrete Math., 69 (1988) 213-218 


Solved also by R. J. Chapman (England), F. J. Flanigan, Stephen M. Gagola, Jr., and the proposer. 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


Japanese Temple Geometry Problems. By Hidetosi Fukagawa and Dan Pedoe. The 
Charles Babbage Research Centre, P.O. Box 272, St. Norbert Postal Station, 
Winnipeg, Canada, R3V 1L6, 1989. xvi + 206 pp. 


DAN SOKOLOWSKY 
Professor Emeritus, Mathematics, Antioch College, Yellow Springs, OH 45387 


The first time I ever saw a Japanese temple geometry problem was some years 
ago in Roger A. Johnson’s Advanced Euclidean Geometry. In reference to a 
certain theorem the author mentions that it is of Oriental origin, and that 


It was the custom of Japanese mathematicians to inscribe their discoveries 
on tablets which were hung in the temples, to the glory of the gods and the 
honor of the authors. 


The custom struck me as a lovely way to display respect for a beautiful idea and 
share it publicly. Moreover, the theorem itself, I felt, truly justified such regard. 
Readers can judge for themselves. It stated: 


Let a convex polygon, inscribed in a circle, be divided into triangles by 
diagonals from one vertex. Then the sum of the radii of the circles inscribed 
in these triangles is the same, whichever vertex is chosen. 


For a long time thereafter I found no other mention of such temple theorems, 
and assumed few existed with little known about them. So my interest naturally 
perked up when, in 1984, the Canadian journal Crux Mathematicorum began to 
publish such theorems from time to time as problems for solution. Readers like 
myself would learn that a particular problem was a temple problem when its 
solution appeared, since it would be accompanied by pertinent historical com- 
ments about the problem, such as the location and date of the tablet on which it 
had been found, or perhaps a book of the period in which its solution had been 
discussed. These comments were supplied by the proposer of these problems, 
Hidetosi Fukagawa, a Japanese scholar who has been devoting himself for the past 
fifteen years to collecting them and studying their history. 

The deeper Fukagawa got into this study the more he felt that this body of work 
deserved wider attention, and eventually he wrote about it to a number of 
mathematicians in the West. Fortunately one of them was Dan Pedoe, who, after 
studying some of the material, had the acuity to appreciate its significance, while 
also recognizing that much of it was unknown in the West. Knowing that its then 
editor, the late Léo Sauvé, as well as many of its readers were keen lovers of 
geometry, he suggested to Fukagawa that he submit some of these theorems as 
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marvelling at the sheer diversity of these problems, and the imagination and 
ingenuity of their creators. I recall becoming intrigued years ago by, for example, 
the well-known Ancient Theorem of Pappus, yet here one finds page after page of 
such problems, the variations seem endless. 

From the pedagogical point of view, while not a textbook, the book provides 
very desirable collateral reading for an appropriate course, say in geometry or the 
history of mathematics, besides being a sourcebook for innumerable interesting 
problems. 

It is certainly a most valuable addition to our body of classical geometry and its 
history. 


Symmetries of Culture: Theory and Practice of Plane Pattern Analysis. By Dorothy 
K. Washburn and Donald W. Crowe, University of Washington Press, Seattle, 
1988, x + 301 pp. 


Doris SCHATTSCHNEIDER 
Department of Mathematics, Moravian College, Bethlehem, PA 18018 


Classification is a tricky business. Yet it is difficult to think of any area of 
human enterprise in which this is not a fundamental activity. Classification is the 
way we sort and order, a means to identify and understand. How we classify 
reflects our biases and priorities—even the most “objectively scientific’? modes of 
classification are rooted in history and tradition, and not immune from cultural 
and professional bias. 

Over ten years ago, a colleague and I curated an art exhibit at the Allentown 
Art Museum in which we selected a variety of objects from their permanent 
collection. Fabrics, lace, quilts, decorated jars, silver serving pieces, tiles, plates, 
trivets, and lithographs were not classified by historical period, media, type of 
object, region, artist, or any other standard method of display used in art exhibits. 
Instead, the objects were displayed in groups according to the symmetry displayed 
in their design. Although it seemed perfectly natural to a mathematician to choose 
such a classification, it was highly unnatural to those on the museum staff. The 
mission of that art exhibit was similar to the mission of the book under review. 

The collaborating authors Dorothy Washburn, anthropologist, and Donald 
Crowe, mathematician, undertake a double task: first, convince those in the 
discipline of anthropology of the usefulness and viability of a relatively new 
method of classification of decorated objects, and then, teach them how to use the 
method. The method is one familiar to mathematicians: use symmetry groups to 
classify the decorative patterns that adorn the objects of study. The analysis of 
decorative designs in the plane according to their symmetries is a fairly standard 
exercise in mathematics courses that address the topics of geometric transforma- 
tions or groups. Although recently a few anthropologists have advanced this 
method of analysis, its use is by no means standard in the profession. 

It is not surprising, then, that the authors spend the largest part of the first 
chapter (‘‘History and Theory of Plane Pattern Analysis”) in making a convincing 
case for the use of symmetry analysis as an anthropological tool. The points that 
are made are especially interesting for mathematicians to read and keep in mind as 
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they teach symmetry and groups in an abstract setting. “‘Practically every culture in 
the world is known to decorate at least some portion of its material culture with 
repeated patterns.” [p. 29] Symmetry is certainly one of the most obvious charac- 
teristics of decorative art; less obvious perhaps is that it plays a strong role in 
perception. It is a salient feature of style: the arrangement of design units is at 
least as important as the individual elements. “Only a limited number of symme- 
tries are appropriate in any given culture, and adherence to these structures is 
necessary for approval and use.” [p. 28] Symmetry classification of motif arrange- 
ment provides the anthropologist a systematic measure of “‘design structure” and 
provides a tool to organize the information in a way that is replicable; this 
description can be used to present cultural preferences. “Without explicitly de- 
fined classes of phenomena, the formulation of general theories about design 
relevant to all cultures cannot be achieved.” [pp. 40-41] 

Decorative art often shows a motif that repeats in two colors; the arrangement 
of the colors as well as the arrangement of the repeated motif can be important in 
design analysis. Although crystallographers and mathematicians in the late nine- 
teenth and early twentieth centuries established the technique of categorizing 
repeating patterns in two (and three) dimensions according to symmetry groups 
and enumerated all the possibilities, the coloring of such patterns did not become 
part of their theory until the mid-1950s. In light of the audience to which 
Symmetries of Culture is directed, there is a bit of irony in the fact that the first 
systematic symmetry analysis and complete enumeration of two-color periodic 
types of plane-filling mosaics (complete with decorative illustrations) was published 
in 1935-1936 by the textile physicist H. J. Woods, in The Journal of the Textile 
Institute (Manchester). These “‘counterchange” patterns are characterized by the 
property that each symmetry of the underlying uncolored pattern either leaves the 
colored pattern unchanged (i.e., the symmetry makes black motifs fall on black and 
white fall on white) or uniformly interchanges the colors (i.e., the symmetry makes 
all white motifs fall on black and vice versa). The careful presentation of the 
classification theory of such two-color designs sets this book apart from other 
symmetry books produced in recent years. 

The book is the coffee-table variety—large size, superbly designed and lavishly 
produced, with printed endpapers and the two-color mosaics of H. J. Woods 
embossed in gold on the cover. Over 500 illustrations (most of them photographs) 
give visual instruction in analysis of plane patterns and provide convincing evi- 
dence of the wealth of decorative design to be found in cultures around the world, 
from ancient times to modern. The authors note that the book was ten years in the 
making—no doubt a large part of that time was spent in gathering the impressive 
collection of examples of decorative art to illustrate the complete list of one and 
two-color symmetry groups of repetitive plane patterns. In only a few instances did 
they fail to find real examples—photos or drawings that show designs painted, 
printed, woven, incised, or inlaid on actual objects; in those few cases, an 
“invented design” is provided to illustrate a symmetry group. 

Chapters 2 through 6 are devoted to a careful presentation of mathematical 
terminology (e.g., the four isometries, as well as the restricted meaning (within the 
book) of such words as “‘symmetry of a figure,” “design,” ‘‘pattern,” “dimension,”’ 
and “symmetry consistent with color’), the notation for symmetry groups, and 
detailed instructions on how to carry out the symmetry analysis of a design. 
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Chapter 4 treats one-dimensional patterns (friezes), of which there are 7 one-color 
and 17 two-color types; chapter 5 treats two-dimensional patterns (wallpaper 
patterns), of which there are 17 one-color and 46 two-color types, and chapter 6 
treats finite designs (rosettes), of which there are 2 infinite classes of one-color and 
3 infinite classes of two-color designs. The authors’ highly visual approach is to 
permeate the text with illustrative step-by-step examples of symmetry analysis of 
designs and to provide schematic diagrams, tables, yes-no flowcharts to guide 
identification, and (best of all) photos and drawings of real examples to illustrate 
each pattern type. This is a book in which the reader can learn general principles 
just by studying the pictures and diagrams and turn to the neighboring text for 
details. 

Borrowing a classification scheme from one discipline to apply to another can 
be both illuminating and risky. Illuminating, because it enables one to see with new 
eyes what is obscured by the tunnel vision of tradition formed within a single 
discipline. Risky, because the borrowed scheme may impose its own limitations 
inherent in its assumptions as to. what is important or “correct.” Washburn and 
Crowe give convincing reasons for the illumination to be gained by the addition of 
symmetry analysis to the standard repertoire of anthropological tools. Certain 
conventions must be made in order to use that tool, however. For example, one 
must decide when to treat a pattern as finite, or one-dimensional, or two-dimen- 
sional; decide when to ignore details or irregularities; decide when a colored motif 
is to be treated as part of the background and not part of the design. The authors 
give their reasonable conventions for each of these decisions, and in a final 
chapter, provide a few caveats. The most important point (in the reviewer’s 
Opinion) is saved for the last page—the information in a design that is disregarded 
for the sake of symmetry analysis may be the most telling for anthropological 
study. Purposeful irregularities in repetition or in the coloring of motifs may be 
mandated by the culture or contain encoded information. Indeed, in earlier 
chapters, the reader must become frustrated at the obvious inadequacy of classifi- 
cation by color symmetry groups to recognize highly orderly colorings of patterns 
that fail to have the coloring compatible with most or even any symmetries of the 
pattern. As such, they are classified as having little or no color symmetry. The need 
for new classification schemes that recognize orderly arrangements and orderly 
coloring of motifs in a repeated pattern has been noted by Grunbaum and 
Shephard [3], and an attempt to define such a scheme especially suited to fabric 
design has recently appeared [2]. 

It has often been noted that one of the most formidable aspects of mathematics 
to ‘‘outsiders” is the mysterious notation that is used for identification of objects. 
Even mathematical insiders deplore poor notation, or unnecessary notation. Un- 
fortunately, the notation situation in the literature on color groups is, in a word, a 
mess. It reflects the newness of the subject as well as the difficulty in finding a 
good characterizing notation. Crowe and Washburn do a real service by providing 
a cross-reference table of five notations in use for color symmetry groups in the 
plane. The notation they choose to use throughout the book (that of crystallogra- 
phers Belov and Tarkhova), however, is unfortunate from the point of view of 
teaching and understanding the algebraic structure of the symmetry groups. (From 
the point of view of the target audience, the choice of notation may seem 
inconsequential—by following the yes-no flowcharts, one can assign the correct 
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classification symbol to a pattern. It is merely a label; it need not be deciphered or 
understood.) A much more informed notation (with respect to algebraic structure) 
for color symmetry groups was implicit in a 1961 paper of B. L. van der Waerden 
and J. J. Burckhardt and later developed by M. Senechal [4]. Given a periodic 
pattern colored with k colors so that all elements of the symmetry group G of the 
uncolored pattern are compatible with the coloring, one identifies a subgroup H of 
G that leaves invariant the set of all motifs of a single chosen color. The index of 
H in G is k; the Senechal notation for the color symmetry group of the pattern is 
[G: H] =k. When k = 2, H is necessarily normal in G; for this case, H. S. M. 
Coxeter introduced the quotient group notation G/H. For all but one of the 46 
two-color groups, the pair G, H completely determines the color symmetry group. 
Crowe and Washburn discuss Coxeter’s notation, but do not use it; yet it is a 
notation that gives far more information on the arrangement of colors and is easy 
to use. (First, view a pattern as uncolored and determine its symmetry group G, 
then squint and focus only on the motifs of one color and determine the symmetry 
group H of that one-color pattern.) 

Why is this book being reviewed here? It was written for archaeologists, 
anthropologists, and art historians, but the authors have taken care in their 
presentation of the geometry of symmetry and color symmetry analysis. Aside from 
the fact that mathematicians and their students need to be more aware of 
unconventional applications of mathematics, it is a book that is a wonderful 
resource. It is also a natural choice for a text to teach an introductory course on 
one- and two-color plane symmetry groups. The level and purpose of such a course 
will determine how much mathematical detail will need to be supplemented. The 
book includes proofs of only two theorems: there are only four isometries of the 
Euclidean plane and there are exactly 7 classes of one-dimensional periodic 
patterns. The book concludes with an extensive bibliography. 
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General, P*, L**. Visualization in Teaching and 
Learning Mathematics. Eds: Walter Zimmermann, 
Steve Cunningham. MAA Notes Ser., No. 19. MAA, 
1991, viii + 224 pp, (P). [ISBN: 0-88385-071-0] A 
superb collection of profusely illustrated papers on 
various aspects of visualization in mathematics—its 
role in intuition and insights; its risks and benefits 
in proofs; its potential in the curriculum. Exam- 
ples from calculus, linear algebra, differential equa- 
tions, analysis, differential geometry, complex analy- 
sis, numerical analysis, and stochastic processes. A 
visual and intellectual delight that will surely stimu- 
late imaginative pedagogical innovation. LAS 


General, P. Las Matematicas En Costa Rica. Ed: 
Angel Ruiz Zufiiga. Departamento de Publicaciones 
Universidad Nacional (Escuela de Matematica, Uni- 
versidad de Costa Rica, San Jose, CR), 1990, x + 
533 pp, (P). Proceedings of the October 1990 third 
Costa Rican national mathematics conference, in- 
cluding sections on pure mathematics, applied math- 
ematics, mathematics of computation, mathematics 
education, history of mathematics, and philosophy 
of mathematics. A supplementary volume contains 
various late papers. LAS 


General, P*. Actions for Renewing U.S. Mathe- 
matical Sciences Departments. National Research 
Council, 1990, vii + 29 pp, (P). Advice to chairs 
of university mathematics departments about steps 
they can take to “become active participants” in 
the effort to renew U.S. mathematics. Focuses on 
setting goals, educational initiatives (at all levels), 
research support (both disciplinary and interdisci- 
plinary), and strategies to alter the reward sys- 
tem. This report, sponsored by the (research) Board 
on Mathematical Sciences at the National Research 
Council, is part of the wide-ranging effort of the Na- 
tional Academy of Sciences to improve mathematics 
education at all levels. LAS 

Mathematics Appreciation, S*(13-14), L*. 
More Mathematical Morsels. Ross Honsberger. Dol- 
ciani Math. Expos., No. 10. MAA, 1991, xii + 322 pp, 
$16 (P). [ISBN: 0-88385-313-2] The sixth of Hons- 
berger’s delightful Dolciani volumes: expansive so- 


lutions for intriguing elementary problems selected 
from Cruz Mathematicorum (a journal of under- 
graduate problem solving published by the Cana- 
dian Mathematical Society) together with “glean- 
ings” from Murray Klamkin’s “Olympiad Corner” 
which appeared regularly in that journal. A pure 
delight for problem solvers; a rich resource for math 
clubs. LAS 


Education, P. Algebridge: Concept Based Instruc- 
tional Assessment. College Board. Janson, 1990, 
xvii + 414 pp, $44.90 (P). [ISBN: 0-939765-34-9] 
An innovative set of diagnostic assessment units in- 
tended to help prepare students for the transition 
from arithmetic to algebra. Intended to be used as 
a supplement to any standard course or text; con- 
sists of a rich teacher resource guide plus sets of stu- 
dent assessment booklets dealing with such topics 
as fractions, negatives, proportional reasoning, vari- 
ables, inequality, and operations. Prepared as part 
of the College Board’s Educational EQuality Project 
intended to widen the access of secondary school stu- 
dents to college. LAS 


History, S$(16-17), P, L. History of Continued 
Fractions and Padé Approzimanits. Claude Brezin- 
ski. Springer-Verlag, 1991, 551 pp, $79. (ISBN: 0- 
387-15286-5] A history up to 1939. The exposition 
consists of a more-or-less chronological reporting of 
developments interspersed with passages of original 
texts and papers. Of value to any teacher of ele- 
mentary number theory. The 150-page bibliography 
must surely contain every relevant reference. SG 


History, S*, P, L*. Huygens and Barrow, Newton 
and Hooke. V.I. Arnol’d. Transl: Eric J.F. Prim- 
rose. Birkhauser Boston, 1990, 118 pp, (P). [ISBN: 
0-8176-2383-3] A penetrating historical probe that 
reveals contemporary perspectives on deep mathe- 
matical problems explored by Newton and his con- 
temporaries: evolutes and quasicrystals; Principia 
and Kirkwood gaps; Kepler’s second law and Abelian 
integrals. Written in a rich expository style; full of 
historical detail enriched by contemporary mathe- 
matical wisdom. LAS 
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Linear Algebra, T(16-17: 1). Fundamentals of 
Matriz Computations. David S. Watkins. Wiley, 
1991, xiii + 449 pp, $51.95. [ISBN: 0-471-61414- 
9] Designed to be used as a textbook for courses in 
numerical linear algebra, it can also serve as a refer- 
ence for those desiring an introduction to a particular 
topic or technique. It covers error analysis, elimi- 
nation schemes, least-squares problems, eigenvalues 
and eigenvectors, and the singular value decomposi- 
tion. AO 

Linear Algebra, T(14-15: 1). Algébre Linéaire. 
Joseph Grifone. Cepadues-Editions, 1990, viii + 439 
pp, (P). [ISBN: 285-428-239-6] A rather inclusive 
introduction to the theory of vector spaces, usually 
finite-dimensional, over an arbitrary field. Exercises, 
hints on solutions. JD-B 

Group Theory, T(18: 1), S, P. Quasigroups and 
Loops: Theory and Applications. Eds: O. Chein, 
H.O. Pflugfelder, J.D.H. Smith. Sigma Ser. in Pure 
Math., V. 8. Heldermann Verlag, 1990, xii + 568 
pp, DM 148. [ISBN: 3-88538-008-0] A collabora- 
tive effort with fourteen chapters, all by different au- 
thors, sharing a common belief that non-associativity 
has “now become susceptible” to systematic analy- 
sis. Wide range of topics, both algebraic and geo- 
metric. Quite contemporary. Index, extensive bibli- 
ography. JS 

Group Theory, T(16-17: 1), S, L. Quasigroups 
and Loops: Introduction. Hala O. Pflugfelder. Sigma 
Ser. in Pure Math., V. 7. Heldermann Verlag, 1990, 
147 pp, DM 68. [ISBN: 3-88538-007-2] Assuming 
only an elementary course in abstract algebra, this 
text provides a basic introduction to quasigroups and 
loops. Chapters on relations to geometry, isotopy, 
Moufang loops. Exercises, bibliography, index. JS 


Algebra, P. Lie Algebras and Related Topics. Eds: 
Georgia Benkart, J. Marshall Osborn. Contemp. 
Math., V. 110. AMS, 1990, xxxvi + 313 pp, $44 
(P). (ISBN: 0-8218-5119-5] Proceedings of a re- 
search conference held May 22-June 1, 1988 at the 
University of Wisconsin at Madison. LC 


Algebra, T(17-18: 1), S, L. Elements of Math- 
ematics: Algebra II, Chapters 4-7. N. Bourbaki. 
Trans]: P.M. Cohn, J. Howie. Springer-Verlag, 1990, 
vii + 461 pp, $98. [ISBN: 0-387-19375-8] Transla- 
tion from the French of the chapters on polynomials 
and rational functions, commutative fields, ordered 
groups and fields, and modules over principal ideal 
domains. Manages to preserve the spirit, style, no- 
tation, and mannerisms of the original. Exercises, 
historical notes, bibliography, index. JS 


Calculus, T(13: 1). Calculus, Volume 1 (Instruc- 
tor’s Preliminary Edition). Thomas P. Dick, Charles 
M. Patton. PWS-Kent, 1991, xvi + 366 pp, (P). 
[ISBN: 0-534-92779-3] The first semester’s part of 
the new calculus text being prepared by the NSF- 
funded Oregon State University Calculus Curriculum 
Project. Stresses intelligent use of technology (to 
avoid “becoming a victim of its pitfalls”); multiple 
representations of functions (graphical, numerical, 
symbolic); visualization (a machine-generated graph 
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“as the first step instead of the last one”); and math- 
ematical modelling. Typical syllabus, elaborated in 
an inviting, “lean” style; makes only metaphorical 
(e.g., “zoom”) reference to calculators or computers. 
A standard supplement (excerpted in this edition) 
on using technology in calculus discusses important 
issues (e.g., roundoff, viewing windows) indepen- 
dent of hardware and provides additional technology- 
dependent examples and exercises. LAS 


Calculus, S(13-14: 1-3). Uses of Technology in 
the Mathematics Curriculum. Benny Evans, Jerry 
Johnson. Oklahoma State University, 1990, 188 pp, 
(P). Outlines of over fifty laboratory projects for use 
with computer algebra systems to supplement calcu- 
lus, linear algebra, and differential equations. Writ- 
ten without reference to specific software; opens with 
a section providing mini-reviews of available soft- 
ware. Based on two NSF-sponsored workshops at 
Oklahoma State University. LAS 


Calculus, T(13: 2). Single Variable Calculus, Sec- 
ond Edition. James Stewart. Brooks/Cole, 1991, 
xvii + 736 pp, $47.25. (ISBN: 0-534-14532-9] A 
concise presentation of traditional topics with early 
coverage of trigonometric functions; introduces top- 
ics with intuitive descriptions when appropriate; 
incorporates Pdlya-like problem solving strategies. 
This edition (a revision of Calculus, TR, Novem- 
ber 1987) reorganizes treatment of limits, has ear- 
lier applications of integration, introduces exponen- 
tial functions before logarithmic, and includes more 
numerical integration. JNC 


Partial Differential Equations, P, L. Lecture 
Notes tn Mathematics-1450: Functional-Analytic 
Methods for Partial Differential Equations. Eds: 
H. Fujita, T. Ikebe, S.T. Kuroda. Springer-Verlag, 
1990, vii + 251 pp, $29 (P). [ISBN: 0-387-53393-1] 
Proceedings of a conference honoring Tosio Kato held 
at the University of Tokyo, July 1989. Contains 17 
papers and a list of recent publications of Kato. MLR 


Operator Theory, P. Nonlinear Superposition Op- 
erators. Jurgen Appell, Petr P. Zabrejko. Tracts in 
Math., V. 95. Cambridge University Pr, 1990, 311 
pp, $59.50. (ISBN: 0-521-36102-8] Gives definition 
of superposition operators. Analyzes the bounded- 
ness, compactness, and differentiability of a function 
and the related properties of the operator determined 
by the function. Involves nonlinear analyses on ideal 
spaces, Lebesgue spaces, Sobolev spaces, and other 
functional spaces. LS 


Operator Theory, P. Proceedings Seminar 1986- 
1987: Mathematical Structures in Field Theories. 
G.G.A. Bauerle, et al. CWI Syllabus, V. 26. Math- 
ematisch Centrum, 1990, iii + 169 pp, Dfl. 48 
(P). [ISBN: 90-6196-387-7] Six papers from lectures 
given at the Centre for Mathematics and Computer 
Science in Amsterdam ranging from Dirac’s formal- 
ism to supersymmetric field theory. SP 


Operator Theory, P. Operator Theory: Proceed- 
ings of the 1988 GPOTS-Wabash Conference. Eds: 
J.B. Conway, B.B. Morrel. Pitman Res. Notes in 
Math. Ser., V. 225. Longman Sci. & Technical (US 
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Distr: Wiley), 1990, 183 pp, $34 (P). [ISBN: 0-582- 
06190-3] Proceedings of a May 1988 conference in 
Indianapolis focusing on operator theory. Includes 
papers by Arveson, Ghatage, Ji and S. Pedersen, 
G.K. Pedersen, Peligrad, Price, and Teleman. JS 
Analysis, T(14-17: 1, 2), L. An Introduction to 
Abstract Analysis. W.A. Light. Chapman & Hall, 
1990, xiii + 194 pp, $29.95 (P); $65. [ISBN: 0-412- 
31090-2; 0-412-31080-5] Assumes only an advanced 
calculus background. Gently introduces the reader to 
abstract analysis with an eye toward setting the stage 
for further readings in functional analysis. Contains 
many examples and exercises and, at first viewing, 
appears quite readable. KS 


Algebraic Geometry, P. Helices and Vector Bun- 
dles: Seminaire Rudakov. A.N. Rudakov, e¢ al. 
Transl: A.D. King, P. Kobak, A. Maciocia. Lon- 
don Math. Soc. Lect. Note Ser., V. 148. Cambridge 
University Pr, 1990, 143 pp, $24.95 (P). (ISBN: 0- 
521-38811-2] A collection of twelve seminar talks 
delivered in Moscow on the use of helices to study 
exceptional vector bundles on an n-dimensional va- 
riety. SG 

Algebraic Geometry, P. Spectral Theory and 
Analytic Geometry Over Non-Archimedean Fields. 
Vladimir G. Berkovich. Math. Surveys & Mono- 
graphs, No. 33. AMS, 1990, ix + 169 pp, $53. 
[ISBN: 0-8218-1534-2] Exposition of the author’s 
work on non-Archimedean analytic spaces (extend- 
ing the concept of a rigid analytic space), and the 
basics of the spectral theory of bounded linear oper- 
ators on such spaces. SG 


Algebraic Geometry, P. G-Functions and Ge- 
ometry. Yves André. Aspects of Math., V. E13. 
Friedr. Vieweg & Sohn (US Distr: GLP International 
Vieweg), 1989, xii + 229 pp, DM 48 (P). [ISBN: 
3-528-06317-3] An exposition of Siegel’s theory of 
G-functions and its applications to diophantine ap- 
proximation and arithmetic algebraic geometry. SG 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1439: Automorphism Groups of Compact 
Bordered Klein Surfaces, A Combinatorial Approach. 
Emilio Bujalance, et al. Springer-Verlag, 1990, xiii + 
201 pp, $22 (P). [ISBN: 0-387-52941-1] The authors 
consider the question of realizing a finite group of a 
certain type (abelian, nilpotent, solvable) as the au- 
tomorphism group of a compact Klein surface. The 
book is well-written, contains an extensive bibliogra- 
phy, and is relatively self-contained. SG 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1447: Cohomology of Arithmetic Groups and 
Automorphic Forms. Eds: J.-P. Labesse, J. Schwer- 
mer. Springer-Verlag, 1990, 356 pp, $39 (P). (ISBN: 
0-387-53422-9] A collection of fourteen articles, all 
presented at a 1989 conference. Includes a survey 
article by Schwermer, as well as papers by Wallach, 
Ash and Borel, Harder, Oda, and others. SG 


Algebraic Geometry, P. Algebraic Geometry. 
Eds: H. Kurke, J.H.M. Steenbrink. Kluwer Aca- 
demic, 1990, viii + 305 pp, $99. [ISBN: 0-7923-0934- 
0] A special issue of Composito containing fifteen 
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papers covering a wide range of topics in algebraic 
geometry. SG 


Differential Geometry, P. Geometric Analysis 
and Computer Graphics. Eds: P. Concus, R. Finn, 
D.A. Hoffman. Math. Sci. Res. Inst. Public., V. 17. 
Springer-Verlag, 1991, ix + 203 pp, $39.80. [ISBN: 0- 
387-97402-4] Twenty-two papers from a workshop 
on differential geometry, calculus of variations, and 
computer graphics at the Mathematical Sciences Re- 
search Institute in Berkeley, May 23-25, 1988. In- 
cludes many computer-generated figures, including 
eight pages of color plates. DFA 


Operations Research, T(15-17), L. Operations 
Research: Applications and Algorithms, Second Edi- 
tion. Wayne L. Winston. PWS-Kent, 1991, xvi + 
1262 pp. [ISBN: 0-534-92495-6] Second Edition of 
an excellent text for a first course in operations re- 
search. Full of examples that motivate mathemat- 
ical solutions and theory. Changes include: 3D 
geometry of LPs feasible region, matrix generators 
of LPs, graphical approach to sensitivity analysis, 
rewritten transshipment and shortest path problems, 
new section on Karmarkar’s method, network sim- 
plex method, cutting plane method for integer pro- 
gramming, method of feasible directions for nonlin- 
ear programming problems, Bayesian inference with 
the normal distribution, extended coverage of inven- 
tory theory, analytical hierarchy principle, first pas- 
sage times in Markov chains, “open” queueing net- 
works, and a chapter on forecasting. MK 
Optimization, T(17-18). Ezistence and Opti- 
mality of Competitive Equilibria. Charalambos D. 
Aliprantis, Donald J. Brown, Owen Burkinshaw. 
Springer-Verlag, 1990, xii + 284 pp, $35 (P). (ISBN: 
0-387-52866-0] A “systematic exposition ...on gen- 
eral equilibrium models with an infinite number of 
commodities.” Discusses existence and optimality of 
competitive equilibria in Walrasian general equilib- 
rium model with finite number of households, firms, 
and commodities, and extends to the infinite com- 
modities case. Uses Riesz spaces to model the infinite 
case; includes chapter on basic Riesz theory. Con- 
tains a number of exercises after each section. MK 


Optimization, T(17: 1, 2), L. Foundations of 
Integer Programming. Harvey M. Salkin, Kamlesh 
Mathur. North-Holland (US Distr: Elsevier Sci- 
ence), 1989, xix + 755 pp, $55. [ISBN: 0-444-01231- 
1] An introductory survey of integer programming 
techniques and applications. Covers cutting plane 
techniques, enumerative methods, group theoretic 
techniques, and specialized algorithms for the knap- 
sack, set covering, fixed charge, and traveling sales- 
man problems. The bibliography contains more than 
600 entries. AO 


Optimization, P. Large-Scale Numerical Optimiza- 
tion. Eds: Thomas F. Coleman, Yuying Li. SIAM, 
1990, 255 pp, $38.50 (P). [ISBN: 0-89871-268-8] 
Fifteen papers from a workshop held at Cornell Uni- 
versity on October 19-20, 1989. The papers are or- 
ganized into four sections: applications, linear and 
quadratic problems, sparse and nonlinear problems, 


390 


and parallel approaches. AO 


Probability, P. Mazimum Entropy and Bayesian 
Methods. Ed: Paul F. Fougére. Fund. Theories of 
Physics, V. 39. Kluwer Academic, 1990, xiii + 480 
pp, $129. (ISBN: 0-7923-0928-6] Text from pro- 
ceedings of Ninth Annual Maximum Entropy Work- 
shop held at Dartmouth College, August 14-18, 1989. 
Thirty-two essays include nice discussion of ontolog- 
ical vs. epistemological probability (Edwin Jaynes), 
Bayesian methods applied to astrophysical problems, 
discussions of model selection, parameter estimation, 
construction of priors, supernova analysis, drug ab- 
sorption, etc. MK 


Stochas/ic Processes, P. Counting Process Sys- 
tems: Identification and Stochastic Realization. 
P.J.C. Spreij. CWI Tract, V. 71. Mathematisch Cen- 
trum, 1990, 135 pp, Dfl. 38.50 (P). (ISBN: 90-6196- 
388-5] A slightly revised version of the author’s the- 
sis, this monograph presents results on recursive pa- 
rameter estimation algorithms and realization prob- 
lems for counting process systems. AO 


Stochastic Processes, P. Adaptive Algorithms 
and Stochastic Approzimations. Albert Benveniste, 
Michel Métivier, Pierre Priouret. Transl: Stephen 
S. Wilson. Appl. of Math., V. 22. Springer-Verlag, 
1990, xi + 365 pp, $59. [ISBN: 0-387-52894-6] An 
introduction to the field of adaptive algorithms— 
mathematical techniques which repeatedly apply a 
recurrence formula to converge on the correct nu- 
merical result. The text introduces the basic ideas— 
convergence rates, parameter tracking, validation— 
and demonstrates these ideas with a number of exam- 
ples drawn from different areas of mathematics. GMS 


Elementary Statistics, T(15-17: 2), L. Biostatis- 
tics: A Foundation for Analysis in the Health Sci- 
ences, Fifth Edition. Wayne W. Daniel. Wiley, 
1991, xii + 740 pp, $49.95. [ISBN: 0-471-52514- 
6] Fifth Edition of a first-principles statistics text 
(First Edition, TR, February 1975). Aimed at ad- 
vanced undergraduates, first-year sraduate students, 
and health-care professionals. Covers standard, fre- 
quentist, statistical methodology; probability distri- 
butions, point and interval estimation, testing, one 
and two-way ANOVA, simple and linear regression, 
and nonparametric statistics. Additions to this edi- 
tion include exploratory data analysis; new exer- 
cises; free diskettes with large data sets; incorpo- 
ration of Minitab, complete with appendix of com- 
mands and syntax. Includes examples with detailed, 
well-exposed problems. MK 


Elementary Statistics, T*(13-14: 1, 2), L. In- 
troduction to Statistics: The Nonparametric Way. 
Gottfried E. Noether. Texts in Stat. Springer- 
Verlag, 1991, xii + 414 pp, $49.50. [ISBN: 0-387- 
97284-6] Successor to the author’s 1976 text In- 
troduction to Statistics: A Nonparametric Approach 
(TR, August-September 1976). Provides an alter- 
native to the standard pre-calculus statistics course. 
Includes the classical normal-theory procedures af- 
ter each topic has been more thoroughly discussed 
using a nonparametric approach. Includes a Minitab 
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section at the end of all but two of the twenty chap- 
ters. RSK 


Computational Statistics, S, C. Learning Data 
Analysis with Data Desk. Paul F. Velleman. WH 
Freeman, 1989, xii + 353 pp, $26.95 (P). [ISBN: 
0-7167-2072-8] A simple graphics-oriented Macin- 
tesh statistics package that provides multiple win- 
dows displaying various summary statistics, graphs 
(e.g., histograms, scatterplots), random samples, in- 
ference, ANOVA, and regression. Tools permit entry 
and editing of data (the student version does not 
allow importing of data except from the full pro- 
fessional version), extensive editing of the graphical 
displays, and saving of both text and graphical out- 
put for use in other programs. “HyperViews” (sub- 
menus) make options visible; “hot” data makes the 
consequences of changes instantly visible in various 
windows. A valuable exploratory tool for standard 
elementary topics. LAS 


Computational Statistics, T(17-18), S, P, L. 
Generalized Additive Models. T.J. Hastie, R.J. Tib- 
shirani. Mono. on Stat. & Appl. Prob., V. 43. Chap- 
man & Hall, 1990, xv + 335 pp, $55. (ISBN: 0-412- 
34390-8] Discusses intuition, application, and the- 
ory for various nonparametric regression-like models. 
Covers a multiplicity of smoothing techniques with 
exposition of trade-offs, choice of smoothing parame- 
ters, cross-validation, and special topics such as non- 
linear smoothers, Kriging, and resistant smoothing. 
Additional chapters cover generalized additive mod- 
els, transformations (in particular, ACE), additive 
models for matched case-control data, proportional- 
hazards, seasonal time-series, model selection, and 
adaptive regression splines. MK 


Statistics, T(16: 1), L. A First Course in the The- 
ory of Linear Statistical Models. Raymond H. My- 
ers, Janet S. Milton. Ser. in Stat. & Decision Sci. 
PWS-Kent, 1991, x + 342 pp. [ISBN: 0-534-91645- 
7] Fine introduction to the theory of estimation 
and testing in full rank and less-than-full rank linear 
models. Preliminary chapters review matrix algebra 
and present quadratic forms and their distributions. 
Includes illustrative SAS output at the end of the 
chapters. RWJ 


Statistics, T(13: 1, 2), L. Statistics for the Life 
Sciences. Myra L. Samuels. Dellen, 1989, xi + 
597 pp. (ISBN: 0-02-405501-8] Standard introduc- 
tion to descriptive and inferential statistics assuming 
an elementary algebra background. Covers correla- 
tion, regression, one-way ANOVA, and basic non- 
parametric tests. Also considers questions of experi- 
mental design and simultaneous inference. Contains 
a wealth of real data sets. RWJ 


Statistics, T(16-17), L. Design and Analysis of 
Ezperiments, Third Edition. Douglas C. Mont- 
gomery. Wiley, 1991, xvii + 649 pp, $56.95. [ISBN: 
0-471-52000-4] Standard topics for first course in 
experimental design: completely randomized, ran- 
domized block, Latin square, incomplete, factorial, 
fractional factorial, nested and split-plot designs. 
Also, regression analysis, response surface methods, 
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and analysis of covariance are discussed. This edi- 
tion reorganizes material on 2* and fractional facto- 
rial designs. Discusses Taguchi approach to parame- 
ter design. Numerous exercises and terse examples. 
(Second Edition, TR, August-September 1984.) MK 


Statistics, P. Statistical Analysis of Measurement 
Error Models and Applications. Eds: Philip J. Brown, 
Wayne A. Fuller. Contemp. Math., V. 112. AMS, 
1990, xii + 248 pp, $53 (P). (ISBN: 0-8218-5117- 
9] Collection of papers presented at a conference 
held June 10-16, 1989 at Humboldt State Univer- 
sity. Considers linear and nonlinear measurement 
error models; a few papers deal with computational 
aspects and robustness questions. Includes a short 
survey paper of work in this area. RWJ 


Statistics, P. Lecture Notes in Statistics-65: Non- 
Standard Rank Tests. Arnold Janssen, David M. 
Mason. Springer-Verlag, 1990, 252 pp, $34 (P). 
[ISBN: 0-387-97484-9] Theoretical monograph pre- 
senting “a methodology to construct rank tests for 
models where the standard regularity assumptions 
do not hold and to study their asymptotic proper- 
ties.” RSK 


Statistics, P. Lecture Notes in Statistics-64: Mini- 
maz Solutions in Sampling from Finite Populations. 
Siegfried Gabler. Springer-Verlag, 1990, v + 132 pp, 
$19 (P). (ISBN: 0-387-97358-3] Minimax and modi- 
fied minimax solutions for survey sampling problems 
with various types of parameter spaces. RWJ 


Statistics, P. Aspects of Risk Theory. Jan Grandell. 
Ser. in Stat. Springer-Verlag, 1991, x + 175 pp, 
$39. (ISBN: 0-387-97368-0] Generalizes the clas- 
sical stochastic model of an insurance business by 
allowing the occurrence of claims to be described by 
point processes which are not necessarily Poisson. In- 
tended for actuaries with a good knowledge of classi- 
cal risk theory, and for probabilists with an interest 
in modern ruin theory. RWJ 


Statistics, P. Statistics in Science: The Founda- 
tions of Statistical Methods in Biology, Physics and 
Economics. Eds: Roger Cooke, Domenico Costan- 
tini. Boston Stud. in Philo. of Sci., V. 122. Kluwer 
Academic, 1990, ix + 184 pp, $80. [ISBN: 0-7923- 
0797-6] Proceedings of an international conference 
held in Luino, Italy. Based on “the conviction that 
all non-deductive inferences in science are ultimately 
statistical inferences.” Contains ten papers illustrat- 
ing probabilistic reasoning in a variety of fields. Note 
price! RSK 

Statistics, S(18), P. State Space Modeling of 
Time Series, Second, Revised and Enlarged Edition. 
Masanao Aoki. Springer-Verlag, 1990, xvii + 323 
pp, $29.50 (P). [ISBN: 0-387-52870-9] ARMA mod- 
els are converted into state space forms, properties 
of state space models are discussed, and two alter- 
nate decompositions of Hankel matrices are used in 
constructing estimators. The new material in this 
edition include an expanded description of the sta- 
tistical properties, especially asymptotic properties, 
of the proposed estimators. RWJ 

Statistics, T(14: 1), L. Applied Regression Analy- 
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sis for Business and Economics. Terry E. Dielman. 
Ser. in Stat. & Decision Sci. PWS-Kent, 1991, ix + 
573 pp. [ISBN: 0-534-92238-4] Introduces simple 
and multiple linear regression models (including one 
and two-factor ANOVA) assuming a prior course in 
statistics and college algebra. Discusses the check- 
ing of model assumptions, variable selection, the use 
of indicator variables, and applications to time series 
data. Data sets, some of which are revisited through- 
out the text, are analyzed with the aid of SAS and 
Minitab. RWJ 


Statistics, T(18: 2), P. Introduction to Statisti- 
cal Pattern Recognition, Second Edition. Keinosuke 
Fukunaga. Computer Sci. & Scient. Computing. 
Academic Pr, 1990, xiii + 591 pp, $59.95. [ISBN: 
0-12-269851-7] Presents statistical decision making 
and estimation, both parametric and nonparamet- 
ric, from the viewpoint of pattern recognition. Dis- 
cusses procedures for classifier design, parameter es- 
timation, and density estimation in detail. Con- 
cludes with a chapter on parametric and nonpara- 
metric clustering. RSK 


Statistics, T(18: 1), P*. Generalized Multivariate 
Analysis. Fang Kai-Tai, Zhang Yao-Ting. Springer- 
Verlag, 1990, xi + 220 pp, $59. [ISBN: 0-387-17651- 
9} Summarizes the theory developed during the 
past fifteen years of multivariate analysis based on 
elliptically contoured distributions, which includes 
classical (normal) multivariate analysis as a special 
case. Good set of references. RSK 


Statistics, P, L. Linear Models for Multivariate, 
Time Series, and Spatial Data. Ronald Christensen. 
Texts in Stat. Springer-Verlag, 1991, xii + 317 
pp, $49.50. (ISBN: 0-387-97413-X] Theory of mul- 
tivariate linear models with a variety of applica- 
tions. Examples from published, real-world, refer- 
enced sources. Due to breadth of topics covered, 
discussion of each is necessarily brief. Topics range 
from discrimination, principal components and fac- 
tor analysis to time series, both frequency and time 
domain, and Kriging for spatial data. MK 


Statistics, S(17-18), P, L. Analysis of Repeated 
Measures. M.J. Crowder, D.J. Hand. Mono. on 
Stat. & Appl. Prob., V. 41. Chapman & Hall, 1990, 
x + 257 pp, $55. [ISBN: 0-412-31830-X] Details 
wide variety of methods for analyzing data involv- 
ing repeated measures. Includes analysis of longi- 
tudinal and spatial studies. Discusses multivariate 
techniques, regression models, hierarchical models, 
crossover experiments, categorical data models, and 
more. Contains examples complete with data and 
graphs, but there are no exercises. Makes use of 
BMDP and SPSS, including source code, output, 
and analysis. MK 


Programming, P, L. Wicked Problems, Righteous 
Solutions: A Catalogue of Modern Software Engi- 
neering Paradigms. Peter DeGrace, Leslie Hulet 
Stahl. Computing Ser. Prentice-Hall, 1990, ix + 
244 pp, (P). [ISBN: 0-13-590126-X] A monograph 
recounting and commenting on the various mod- 
els of software development, includes the sequen- 
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tially phased “waterfall” model, which is most com- 
monly used; variations on the waterfall model; pro- 
totyping; “all-at-once” problem solving. The au- 
thors’ stimulating remarks question contemporary 
approaches. RB 


Programming, T(15), S. Object-Oriented Model- 
ing and Design. James Rumbaugh, et al. Prentice- 
Hall, 1991, xii + 500 pp. [ISBN: 0-13-629841-9] 
An introductory text on the topic of software de- 
velopment using the concept of object-oriented soft- 
ware design. In this technique real world elements 
are modelled using simulated software objects, which 
are then manipulated using the techniques of mes- 
Sage passing, class structure, and inheritance. It is 
claimed that this is a superior technique for con- 
structing efficient, correct, and maintainable soft- 
ware. GMS 


Languages, P. Encapsulated PostScript: Appli- 
cation Guide for the Macintosh and PCs. Peter 
Vollenweider. Prentice-Hall, 1990, xvii + 226 pp, 
(P). [ISBN: 0-13-275843-1] An eclectic collection of 
vendor information, user hints, and programming 
examples intended to introduce the new standard 
PostScript page description language EPS. Useful de- 
tails obscured by poor organization; the lengthy in- 
dex may be of some help. LAS 


Languages, P, L*. Inside Smalltalk, Volume II. 
WilfR. LaLonde, John R. Pugh. Prentice-Hall, 1991, 
xvii + 553 pp. [ISBN: 0-13-465964-3] The second 
of two volumes providing an in-depth introduction 
to Smalltalk-80, an object-oriented programming en- 
vironment. This volume focuses on the construc- 
tion of interactive graphical interfaces. It examines 
the model-view-controller paradigm, window man- 
agement, and mouse interaction in the Smalltalk-80 
system. AO 


Languages, P, L. Object Oriented Programming 
with Smalltalk/V. Dusko Savi¢. Ser. in Comput. & 
Their Applic. Ellis Horwood (US Distr: Simon & 
Schuster), 1990, 340 pp. [ISBN: 0-13-040692-9] An 
introduction to the principles of object-oriented pro- 
gramming using the Smalltalk/V language and envi- 
ronment. Language features and programming tech- 
niques are introduced in the context of examples from 
a variety of applications areas. AO 


Algorithms, P. Parallel Algorithms for Matriz 
Computations. K.A. Gallivan, et al. SIAM, 1990, 
ix + 197 pp, $14.50 (P). [ISBN: 0-89871-260-2] 
“Three papers that collect, describe, or reference an 
extensive selection of important parallel algorithms 
for matrix computations ... . The first paper con- 
tains a general perspective on modern parallel] and 
vector architectures and the way in which they influ- 
ence algorithm design. The second paper is primarily 
concerned with parallel algorithms for solving sym- 
metric positive definite sparse linear systems. The 
final paper consists of an extensive bibliography on 
parallel and vector numerical algorithms.” LC 

Computer Systems, P, L. MH & zmh: E-mail for 
Users and Programmers. Jerry D. Peek. O’Reilly & 
Associates, 1990, xxviii + 553 pp, $27.95 (P). (ISBN: 
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0-937175-63-3] The MH Message Handling System 
is a collection of public domain electronic mail pro- 
grams for computers running UNIX. zmh is an X 
Window System interface to MH. This book is acom- 
prehensive guide to the features of MH. AO 


Computer Systems, P. TUGboat: The Commu- 
nications of the TRX Users Group. Ed: Barbara 
Beeton, Christina Thiele. Conf. Proc., V. 10, No. 
4. TpX Users Group (PO Box 9506, Providence, RI 
02940), 1989, 315 pp, $75 (P). Proceedings of the 
tenth annual meeting of the TpX Users Group held at 
Stanford University, August 20-23, 1989. includes 30 
papers describing projects using METAFONT, Trx, 
IATpX, and related software. AO 


Computer Graphics, S(16-18), P. 3D Interac- 
tive Computer Graphics: The Hierarchical Modelling 
System HIRASP. Wim J. Teunissen, Jan van den 
Bos. Ser. in Comput. & Their Applic. Ellis Hor- 
wood, 1990, ix + 229 pp. (ISBN: 0-13-921867-X] 
Complete presentation of the HIRASP software sys- 
tem, which is entirely oriented to raster graphics 
hardware and device-independent in that realm. HI- 
RASP defines 3D objects for rendering instead of 
drawing primitives, treats domain and color at the 
same level, contains a complete 3D viewing pipeline, 
provides a set of graphics input facilities, and allows 
interactive drawing manipulation. DFA 


Artificial Intelligence, P. Advances in Artificial 
Intelligence: Natural Language and Knowledge-based 
Systems. Ed: Martin Charles Golumbic. Springer- 
Verlag, 1990, xiv + 303 pp, $39. [ISBN: 0-387- 
97355-9] Fourteen papers selected from recent ar- 
tificial intelligence conferences in Israel, viz., the Is- 
raeli National Conferences on Artificial Intelligence 
(1987, 1988), and the applications track of the Bar- 
Ilan Symposium on the’Foundations of Artificial In- 
telligence. Topics include design and architecture, 
morphology and linguistic analysis, intelligent tutor- 
ing and training systems, semantics for constraint- 
based systems. DFA 


Artificial Intelligence, P, L. Verifying and Vali- 
dating Personal Computer-Based Expert Systems. A. 
Terry Bahill. Prentice-Hall, 1991, xi + 205 pp, (P). 
[ISBN: 0-13-957457-3] Topics: extracting knowl- 
edge from the system, verifying the knowledge base, 
validating the system, testing the system. Includes 
a study of the “pregnant man” problem, sensitiv- 
ity analyses, use of neural networks. References, 
glossary, collection of thought problems and pos- 
sible solutions, animal classification expert system. 
Good for self-study; chapters are independent of each 
other. DFA 


Artificial Intelligence, $(18), P. Intelligent Sys- 
tems: State of the Art and Future Directions. Zbig- 
niew W. Ras, Maria Zemankova. Ser. in AI. Ellis 
Horwood, 1990, vii + 529 pp. [ISBN: 0-13-465931-7] 
Twenty papers on knowledge representation, infer- 
ence mechanisms, reasoning under uncertainty, ma- 
chine learning, expert systems, intelligent informa- 
tion systems, the role of artificial intelligence in com- 
puter applications, and the impact and future of in- 


1991] 


telligent systems. Some are surveys, some are new 
developments. DFA 


Computer Science, T(14-15: 1, 2). Data Struc- 
tures and Program Design in C. Robert L. Kruse, 
Bruce P. Leung, Clovis L. Tondo. Prentice-Hall, 
1991, xv + 525 pp. (ISBN: 0-13-725649-3] C version 
of the first author’s text using Pascal. Suitable for 
both the ACM CS2 and CS7 (Data Structures and 
Algorithm Analysis) courses. Assumes some expe- 
rience with C, but provides a succinct introduction. 
Includes many examples of substantial length, exer- 
cises, and programming projects. Emphasizes prob- 
lem specification; program design, analysis, testing, 
verification, and correctness; major principles of soft- 
ware engineering; data abstraction. Good discussion 
of recursion and its uses. DFA 


Computer Science, T(15-16). Parallel Programs 
for the Transputer. Ronald S. Cok. Prentice-Hall, 
1991, xii + 242 pp, (P). [ISBN: 0-13-651480-4] In- 
troduces the Transputer Architecture—a massively 
parallel, MIMD-based, private memory computer ar- 
chitecture. Also teaches how to program in Occam-2, 
the parallel language of the transputer. Both of these 
systems are widely used in academia and industry, 
and are excellent vehicles for learning the concepts 
of parallel programming. GMS 


Applications, P. Continuation and Bifurcations: 
Numerical: Techniques and Applications. Eds: Dirk 
Roose, Bart De Dier, Alastair Spence. NATO ASI 
Ser. C, V. 313. Kluwer Academic, 1990, xili + 426 
pp, $132. [ISBN: 0-7923-0855-7] Contains papers 
from ten countries in Europe and North America. 
The central goal is to describe bifurcation problems 
and numerical techniques. The papers extend over 
spectrum theories, numerical analysis, software, and 
applications. LS 


Applications, P, L. Mathematica for the Sciences. 
Richard E. Crandall. Addison-Wesley, 1991, xiii + 
300 pp. [ISBN: 0-201-51001-4] A collection of ex- 
amples and exploratory exercises from mathematics, 
physics, biology, chemistry, and engineering that il- 
lustrate the use of Mathematica as a problem-solving 
tool in the sciences. The examples explore the sym- 
bolic, graphical, and numerical capabilities of Math- 
ematica. Programming techniques are developed on 
an “as needed” basis. AO 


Applications, T(17-18: 2), P, L. Log-Linear Mod- 
els. Ronald Christensen. Texts in Stat. Springer- 
Verlag, 1990, xi + 408 pp, $49. [ISBN: 0-387-97398- 
2] Bridges gap between Fienberg’s monograph and 
Bishop, Fienberg, and Holland’s definitive reference. 
This excellent text contains many fully-examined ex- 
amples and exercises along with readable text. Dis- 
cusses theoretical and computational aspects of the 
applications. Includes full discussion of logistic lin- 
ear modelsand a brief discussion of generalized linear 
models. MK 


Applications (Biological Science), P. Some 
Mathematical Questions in Biology: Sez Allocation 
and Sez Change: Ezperiments and Models. Ed: 


TELEGRAPHIC REVIEWS 
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Marc Mangel. Lect. on Math. in the Life Sci., 
V. 22. AMS, 1990, ix + 205 pp, $41 (P). [ISBN: 
0-8218-1172-X] Proceedings of the 1989 symposium 
held at the annual meeting of the AIBS in Toronto, 
Canada, August 7, 1989. Topics include sex alloca- 
tion in plants, male allocation and cost of biparental 
sex in a parasitic trematode, population genetics of 
sex allocation through exact population models, and 
various aspects of hermaphroditism in animals. MK 


Applications (Communication Theory), T(17) 
P. Discrete Cosine Transform: Algorithms, Advan- 
tages Applications. K.R. Rao, P. Yip. Academic Pr, 
1990, xviii + 490 pp, $69.95. [ISBN: 0-12-580203-X] 
First, basic theory: definitions, general properties, 
relations to the Karhunen-Loeve transform, fast al- 
gorithms, two-dimensional algorithms, performance. 
Then, the heart of the book: over 200 pages de- 
scribing 23 major applications. Concludes with 90 
pages of computer programs, a list of appropriate 
VLSI chip manufacturers, and an extensive, up-to- 
date bibliography. Good exercise sets throughout. 
Includes many figures, including color plates. DFA 
Applications (Physics), T*(14-15), L. The 
Principles of Electromagnetic Theory. Attay Kovetz. 
Cambridge University Pr, 1990, xvii + 221 pp, 
$49.50; $19.95 (P). [ISBN: 0-521-39106-7; 0-521- 
39997-1] The title suggests that this is just one 
more introductory text in electromagnetic theory. 
However, the difference here is that the author 
presents the subject as a classical theory independent 
of the atomic constitution of matter. He begins with 
a formal presentation of the laws of e-m theory (in 
tensor form) and then establishes a link between e- 
m theory, classical mechanics, and thermodynamics. 
The last half of the book deals with applications to 
electrostatics, dielectrics, etc. The level of the text is 
appropriate for second or third-year students with a 
background in multivariable calculus. Not that many 
problems. MPR 


Applications (Social Science), P. Frontiers of 
Mathematical Psychology: Essays in Honor of Clyde 
Coombs. Eds: Donald R. Brown, J.E. Keith Smith. 
Recent Res. in Psychology. Springer-Verlag, 1991, 
xxvi + 202 pp, $35 (P). [ISBN: 0-387-97451-2] Col- 
lection of seven essays honoring Clyde Coombs. In- 
troduction and biography by Amos Tversky. Top- 
ics include axiomatic approach to measurement the- 
ory; social dilemmas, economic self-interest, and evo- 
lutionary theory; probabilistic dimensionality; judg- 
ments of relative importance in decision making; val- 
idating the dimensional structure of psychological 
spaces. MK 


Reviewers 


DFA: David F. Appleyard, Carleton; RB: Richard Brown, 
St. Olaf; LC: Laura Chihara, St. Olaf; SG: Steven Galovich, 
Carleton; MK: Michael Kahn, St. Olaf; RSK: Richard S. Kle- 
ber, St. Olaf; AO: Arnold Ostebee, St. Olaf; MLR: Margaret 
L. Reese, St. Olaf; MPR: Matthew P. Richey, St. Olaf; KS: 
Karen Saxe, St. Olaf; GMS: G. Michael Schneider, Macalester; 
JS: John Schue, Macalester; LS: Li Sheng, Macalester; LAS: 
Lynn Arthur Steen, St. Olaf. 


SUBSCRIBE TO 


UME 
TRENDS 


News and reports on 
Undergraduate 
Mathematics 
Education 


Keep up with what's happening in 
Undergraduate Mathematics 
Education. 


UME TRENDS is conducting a 
subscription drive for its third 
volume beginning March 1991. 


Whether you are receiving it now or 
not, you must subscribe in order to 
keep your issues coming. 


We must receive a minimum 
number of subscriptions in order to 
keep publishing UME TRENDS. 


SUBSCRIBE NOW! 


Subscriptions are 
$12 per year for six 
issues. 

Copy or clip the adjoining form and 


mail it to arrive by February 8, 
1991, 
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When was the last 
time a computer 
program helped 
you think about 
mathematics? 


Call or write for our latest software catalog. 
For PC or Macintosh. 


Lascaux Graphics 3220 Steuben Ave. Bronx, NY 10467 (212) 654-7429 


Saunders College Publishing Salutes 


Mathematics 


Awareness Week 
April 2.1st—27th 


and its sponsors 
The American Mathematical Society 
The Mathematical Association of America 
The Society for Industrial and Applied Mathematics 


The Powers™ Mathematics 
#& SAUNDERS COLLEGE PUBLISHING 


The Curtis Center, Independence Square West, Philadelphia, PA 19106 


Career 
Information 
from the 


MAA 


MATHEMATICAL 
SCIENTISTS AT WORK 


If you have difficulty clarifying career op- 
portunities in the mathematical sciences for 
your students, then let these 14 practition- 
ers speak for you. This attractive book- 
let provides career information about statis- 
tics, actuarial science, operations research, 
and computer science as well about math- 
ematics. Each essay is accompanied by 
a photograph of the author and by addi- 
tional information relevant to the occupa- 
tion. The booklet also contains general in- 
formation about careers in mathematics and 
sources where additional information may 
be obtained. 


This is what some of our writers have to say 
to students: 


...when seeking a mathematical posi- 
tion, don't be put off the trail by ti- 
tles of jobs or work units that may not 
seem mathematical. Mathematicians 
can pop up, and do mathematical work, 
all over the place. 


Michael Weiss, Agricultural Economist 


The attempts to build expert systems, 
which can perform tasks such as med- 
ical diagnosis or operating a computer, 
show the need for mathematical mod- 
els of how computer systems can store 
data and “reason” about knowledge. 


Maria Klawe, Computer Scientist 


This valuable booklet should find its way 
into every mathematics department and 
guidance office in the country. Be sure that 
your students get their hands on a copy. 
32 pp., 1990, ISBN 0-88385-452-X 

Price: 1-9 copies $5.00 each 

Price: 10-49 copies $4.50 each 


Price 50 or more copies, $4.00 each 
Price 100 or more copies, $3.00 
Catalog Number MSW 


CAREERS IN THE 
MATHEMATICAL SCIENCES 


This pamphlet contains condensations of 
six of the essays in “Mathematical Scien- 
tists at Work” plus sources for additional in- 
formation. It is ideally suited for distribu- 
tion to high school or college mathematics 
classes and to mathematics clubs. Take 
some with you when giving a talk to student 
groups and make them available to students 
who visit your college campus—you will find 
many uses for these handy pamphlets. 


Price: 1-99 copies 25 cents each 
Price: 100 or more copies 20 cents each 
Catalog Number CMS 


Order From 


M® 1529 18th Street, N.W. 
Washington, D.C. 20036 


Mathematical Association of America 


Recursive & iterative programming ¢ Over 70 new math functions e New manual, 500 examples & exercises 


The Joy of BERRY=)ing. 
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. 

13} £4 Assistant program. 

| mo © | “43 

| mm od PC Magazine SaySitS 
Wry] “a joy to use” and proclaims it 


“Editors’ Choice.” PC Week calls 
it “fast and capable.” The 
DERIVE® program Is de- 
livered with built-in 
standard equipment 

that delights 


EDITORS’ 
CHOICE 


May 29, 199D 
Dene, Version 16 


both 
math lovers and 
math phobics. 


High performance. 


DERIVE does numeric and symbolic 
equation solving, exact and approximate 
arithmetic to thousands of digits, calculus, 
trigonometry and matrices. It displays or- 
mulas in comprehensible 2D format using 
raised exponents and built-up fractions. It 
plots beautiful 2D curves and 3D surfaces on 
monochrome or color monitors. 


Compact and sporty. 


PC Magazine calls ita “small wonder.” All it 
takes is 512K of memory and one floppy disk 
drive. DERIVE takes to the road on PC com- 
patibles, and really gets around on laptop 
and even handheld computers! 


DERIVE is a Handcrafted 
registered trademark of Software 
Soft Warehouse, Inc. for the Mind 


Human engineering. 


You don't have to be fluent in computerese to 
use DERIVE. In fact, it’s the friendliest and 
easiest to use of any symbolic math package 
on the road today. 
lis menu-driven 
interface 


and 
on-line help 
make il easy- 
you ll soon be up to 
spced doing math. instead ol 
trying to learn how to drive the 
software. 


Freedom to maneuver. 


DERIVE’s automated expertise releases you 
from the drudgery of hand calculations. 
You can do problems you'd never attempt 
otherwise and obtain exact symbolic solu- 
tions, in addition to approximate numerical 
solutions. 


No sticker shock. 


DERIVE’s suggested retail price is $250. And 
of course DERIVE doesn’t require an expen- 
sive computer, a math co-processor, or even 
a hard disk drive. 

Order DERIVE through your favorite software 
dealer or mail-order house. For a list of 
dealers, write Soft Warehouse, Inc. at 3615 
Harding Avenue, Suite 505, Honolulu, HI 
96816. Or call (808) 734-5801 after 11 a.m. 
Pacific Standard Time. 


And happy DERIVEing! 


Soft Warchouse: 
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Another Year of Excellence 
D. C. Heath Mathematics in 1991 


The popular Brief Calculus— 
now part of a remarkable series! 


Brief Calculus with Applications 
Third Edition 
1991 Cloth 812 pages 


and . og 
Brief Calculus with Applications 
Alternate Third Edition 

1991 Cloth 648 pages 


Roland E. Larson and Robert P. Hostetler, 
both of The Pennsylvania State University, 
The Behrend College 

Bruce H. Edwards, University of Florida 


Finite Mathematics 
1991 Cloth 500 pages 


and 
Finite Mathematics with Calculus 


1991 Cloth 926 pages 


Roland E. Larson, The Pennsylvania State 
University, The Behrend College 
Bruce H. Edwards, University of Florida 


Also new for 1991: 


Understandable Statistics 
Fourth Edition 

Charles Henry Brase, Regis College 
Corrinne Pellillo Brase, 

Arapahoe Community College 

1991 Cloth 704 pages est. 


Elementary Linear Algebra 

Second Edition 

Roland E. Larson, The Pennsylvania State 
University, The Behrend College 

Bruce H. Edwards, University of Florida 
1991 Cloth 592 pages 


Arithmetic for College Students 
Sixth Edition 

D. Franklin Wright, Cerritos College 
1991 Cloth 528 pages 


Mathematics for 

Elementary Teachers: 

A Balanced Approach 

Second Edition 

Eugene F. Krause, University of Michigan 
1991 Cloth 928 pages est. 


Basic Mathematics 

Third Edition 

David Novak, Simmons College 
1991 Paper 656 pages est. 


For more information, call us toll-free: 
(800) 235-3565. 


D. C. Heath and Company 


125 Spring Street 
Lexington, MA 02173 


Just Published from HBJ 


INTRODUCTORY ALGEBRA When you asked for a 

and sensible, no-frills, main- 
INTERMEDIATE ALGEBRA stream calculus textbook at 
RONALD HATTON and a reduced cost for your 


GENE R. SELLERS 


both of Sacramento City College 


students, we heard you... 


Each Paperbound, 1991 CALCULUS 
seven acca nS es 
end Edition DENNY GULICK 
INTERMEDIATE ALGEBRA Po oot goss Ne 
YOSHIKO YAMATO and 

SINGLE-VARIABLE CALCULUS 
Naleaeeeiebine- ian WITH DISCRETE MATHEMATICS 


both of Pasadena City College 
Each Hardcover, 1991 


JOHN A. FEROE and 
CHARLES STEINHORN 


ALGEBRA FOR both of Vassar College 

COLLEGE STUDENTS Hardcover, 1991 

Third Edition 

BERNARD KOLMAN, ELEMENTARY LINEAR ALGEBRA 

Drexel University, and WITH APPLICATIONS 

ARNOLD SHAPIRO Second Edition 

Hardcover, 1991 RICHARD O. HILL, JR., 
Michigan State University 

FINITE MATHEMATICS Hardcover, 1991 

WILLIAM OWEN and 

FRED CUTLIP ABSTRACT ALGEBRA 

both of Central Washington DENNIS KLETZING, 

University Stetson University 


Hardcover, 1991 


Hardcover, 1991 


HARCOURT BRACE JOVANOVICH, INC. 
Alle College Sales Office 
7555 Caldwell Avenue, Chicago, IL 60648 


(708) 647-8822 
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An MAA 
Report 

on Teacher 
Preparation 


A CALL FOR CHANGE 
Recommendations for the 
Mathematical Preparation of 
Teachers of Mathematics 


James R. C. Leitzel, Editor 


How can we improve the teaching and 
learning of mathematics in our schools to 
better prepare our students for the future? 
We can begin by making some changes in 
the way our teachers learn and teach math- 
ematics. A CALL FOR CHANGE, a new 
MAA Report, offers a set of recommenda- 
tions that come from a vision of ideal mathe- 
matics teachers in classrooms of the 1990’s 
and beyond. It describes the collegiate 
mathematical experiences that a teacher 
needs in order to meet this vision. 


A CALL FOR CHANGE discusses stan- 

dards common to the preparation of math- 

ematics teachers at all levels; Elementary 

School Level (K—4), Middle School Level 

(5-8), and Secondary School Level (9-12). 

It shows what type of preparation teachers 

need to help them: 

m (i) view mathematics as a system of in- 
terrelated principles, 

@ (ii) Communicate mathematics accurately 
verbally, and in writing, 

mM (iii) understand the elements of mathe- 
matical modeling, 


weommendations For The 

3 ; 

ematical Preparation Of Teachers Of Mathemat 
atics 


An MAA @ Report 


Th 
2 e maematica Association Of Amenica 
In The Mathematica] Education Of Teachers 


James R C Leitze} Editor 


mM (iv) understand and use calculators and 
computers appropriately in the teaching 
and learning of mathematics, and 

™ (v) appreciate the development of math- 
ematics both historically and culturally. 


Mathematics continues to be a dynamic 
changing discipline. There is new mathe- 
matics that can be exciting for young people 
to learn and technology provides new ap- 
proaches for teachers to engage students 
in the teaching and learning of mathemat- 
ics. The mathematical preparation of teach- 
ers must adapt to these changing realities. 
Order your copy now of this important set 
of guidelines. 


60 pp., 1991, ISBN 0-88385-072-9 
List: $7.00 
Catalog Number CFC 


Order From 


4, Mathematical Association of America 
» 1529 18th Street, N.W. 
Washington, D.C. 20036 
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Recreations for the 
Mathematically Minded... 


FROM W. H. FREEMAN AND COMPANY 


CAN YOU WIN? 


The Real Odds for Casino Gambling, 
Sports Betting, and Lotteries 


MIKE ORKIN 


Mike Orkin lays out the hard, cold, and often 
amusing facts about gambling in this offbeat 
guide for novice and expert alike. He tells of 
the odds of winning and possible strategies for 
roulette, keno, slots, sports betting, blackjack, 
lotteries, horse racing, and “‘prisoner’s dilem- 
ma.’° Enjoyable reading and an invaluable 
reference. 


March, 181 pages, paper, O-7167-2155-4, $11.95 


MATHEMATICA* 
IN ACTION 
STAN WAGON 


Extend the power of Mathematica to the limits 
of the possible with alternative methods to 
generate three-dimensional graphics, iterative 
graphics, and animations. This book contains 
many valuable shortcuts, complete programs 
with line-by-line explanations, and hundreds 
of advanced examples worked in detail. 


March, 352 pages, 133 illustrations, 4 pp. color 
plates, paper 0-7167-2202-X, $29.95; 
Cloth 0-7167-2229-1, $39.95 


VISUALIZATION 


The Second Computer Revolution 
RICHARD MARK FRIEDHOFF 


Visualization explains why and how we use 
the computer to make images. Included in this 
comprehensive guide for the general reader 
are sections on computer animation, simula- 
tion of complex phenomena such as black 
holes and snowflakes, medical imaging, pro- 
cedural graphics, architecture with CAD 
systems, Numerical experiments, and the new 
“artificial realities.” 


March, 216 pages, 200 illustrations, including 136 
in full color, paper, O-7167-223 1-3, $24.95 


W. H. FREEMAN AND COMPANY 


The book publishing arm of Scientific American 


KEEP YOUR EYE ON 
THE BALL 


The Science and Folklore of Baseball 
ROBERT G. WATTS and A. TERRY BAHILL 


Could Sandy Koufax’s curve really “‘fall off a 
table?’ Did Ted Williams actually see the ball 
hit his bat when he connected? Engineers 
Watts and Bahill put some of baseball’s 
cherished myths to the test of scientific 
scrutiny in a highly informative and entertain- 
ing guide. 

March, 220 pages, 10! illustrations, paper, 
O-7167-2248-8, $12.95 


FRACTALS 


An Animated Discussion with Edward 
Lorenz and Benoit B. Mandelbrot 


A 63-Minute Video 


H.-O. PEITGEN, H. JURGENS, D. SAUPE, 
and C. ZAHLTEN 


This video presents exciting color animations 
of the world of fractals, chaos, and self- 
similarity. An accompanying 28-page booklet 
provides the algorithms and mathematical 
equations that underlie the fractals featured 
in the video. 


1990, 63-minute full-color video, VHS format, 
O0-7167-2213-5, $59.95 


THE MAGIC MACHINE 


A Handbook of Computer Sorcery 
A. K. DEWDNEY 


This is the second collection of Dewdney’s 
popular ‘Scientific Recreations’ columns, 
drawn from Scientific American. The author 
discusses some of today’s hottest topics, in- 
cluding chaos, computer viruses, and artificial 
landscapes. 


1989, 357 pages, 107 illustrations, cloth, 
0-7167-2125-2, $23.95; paper, 0-7167-2144-9, 
$15.95 


AVAILABLE AT FINE BOOKSTORES or order 
directly from: W/. H. Freeman and Company, 
4419 West 1980 South, Salt Lake City, UT 
84104. 


INTRODUCING 
E.Z. MATH 


For use with HP'S HP 48SX 


For Teachers, Students, 
and other Professionals. 


Graphs. E.Z. Math covers the entire high school and 
college graphing curriculum from elementary algebra to 
advanced calculus. The student can choose from 182 
families of equations, inequalities, functions, systems of 
equations and systems of inequalities, in rectangular, 
polar and parametric form, all laid out and arranged in an easy- 

to-use, logically organized system of menus. E.Z. Math makes the graphic 

analysis of polynomial, rational, trigonometric, hyperbolic, logarithmic and exponential 
functions quick and easy. 


HP 48SX 
$275.00 


Numbers. E.Z. Math makes it easy to handle numbers of various types. Elementary and high school 
students will find it easy to compute all the factors and the prime factorization of a natural number. In 
addition, finding the greatest common factor, least common multiple and average of any set of natural 
numbers is a snap. Any set of rational numbers, whether in whole number, fraction of mixed number 
form, may be added, subtracted, multiplied or divided yielding an answer expressed as a fraction or 
mixed number in lowest terms or as an integer. Any set of complex numbers may be added, subtracted, 
multiplied, divided, or raised to a power. Any number of terms of the sequences of perfect nth powers, 
triangular numbers, binomial coefficients, Fibonacci numbers and multiples of anumber can be easily 
computed. 


Savings. E.Z. Math makes it easy to solve problems involving a single deposit or repeated deposits to 
asavings account, certificate of deposit (C.D.), term deposit (T.D.), money market account or other such 
investment. Anyone interested in saving money can quickly and easily compute the amount accumulated, 
the amount that must be deposited, the number of years necessary to keep the money on deposit and 
the annual interest rate. 


Loans. E.Z. Math makes it easy to solve problems involving fixed rate mortgage on houses, condos, 
co-ops, and other real estate investments. Anyone wanting to purchase a home can quickly computed 
the monthly mortgage payment, the amount that can be borrowed, the number of years needed to pay 
off the loan and the annual interest rate and see a detailed breakdown of each payment in a complete 
amortization table. 


Recreation. E.Z. Math provides a couple of strategy number games and an introduction to the musical 
capabilities of the HP48SX. These are provided both for fun and as an encouragement to explore some 
of the many features of the calculator, including a bit of the built-in rich programming language. 
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The Cauchy Problem for the Wave Equation with Distribution Data: 
an Elementary Approach 


Cavin H. Witcox, University of Utah, Salt Lake City, UT 84112 


Cavin H. Witcox received his Ph.D. at Harvard under Garrett Birkhoff 
and H. Levine. He has held positions at Caltech, the Universities of 
Wisconsin, Arizona and Utah, and Denver University, and has served as 
visiting professor at the Universities of Geneva, Liege, Kyoto, Stuttgart and 
Bonn. He received an Alexander von Humboldt Senior U.S. Scientist 
Award in 1976. He has written some 60 research articles on boundary value 
problems, spectral analysis of differential operators in Hilbert spaces, wave 
propagation and scattering theory as well as three monographs: Scattering 
Theory for the d’Alembert Equation in Exterior Domains, Scattering The- 
ory for Diffraction Gratings, and Sound Propagation in Stratified Fluids. 


Introduction. The Cauchy problem for the wave equation in two space dimen- 
sions asks for a function u(x, y, t) that satisfies 


Uj, —Uy, —Uy, =h(x,y,t) W(x, y,t) © R* X [to,~) (1) 
and 
u(xX,y,to) =f(x,y), Uu,(x,y,to) =g8(x,y) W(x,y) ER’, (2) 


where f, g,h are prescribed functions on their respective domains. It was discov- 
ered in the early nineteenth century that a formal solution of the problem can be 
constructed by quadratures. The result is usually written 


u(x, y,t) 
1 ’ 
-~ ff ee 101k) 

2! D(x,y,t-t0) V(t — tp)? — (x - €)? — (y — 0)” 
Ce | ; 
o —|f egy 
Ot | 27! ! Dix, y,t-00) V(t — ty)? — (x - €)? — (y — 7)’ 
{1 h(é,1,7) 

FSD aca cane 
tol oT “Dix y.t- V(t — Tr)” — (x —- €)° -(y —- 7) 


(3) 


where D(x, y,7) = {(€, n)KE — x)? + (y — y)? < 7’) is the disk in R* with center 
at (x, y) and radius +. This relation is usually attributed to M, Parseval (?-1836). 
It can be derived from the well-known formula of S. D. Poisson (1781-1840) for 
the solution of the corresponding problem in three space dimensions; see 
Hadamard [4] for references to the original literature. In older texts, equation (3) 
is usually called Parseval’s formula, or the Poisson-Parseval formula; see, for 
example [2], [8]. 

Classical theory established that the Parseval formula defines a solution of the 
Cauchy problem whenever the data f,g,h have sufficiently many continuous 
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derivatives. This theory is too restrictive for the study of phenomena such as shock 
waves, which correspond to data that are nondifferentiable or even discontinuous. 
An extended theory which is applicable to these cases became available in 
1950-1951 with the advent of L. Schwartz’s theory of distributions [7]. This in turn 
made it possible to study physically important solutions, such as the fundamental 
solution, whose Cauchy data f, g,h are themselves distributions. 

The solvability of the Cauchy problem (1), (2) for Cauchy data f, g,h which are 
arbitrary distributions has been known for many years; see for example [5, p. 139, 
Theorem 5.6.3]. Unfortunately, the published expositions of this result that are 
known to the author stand at the end of long and intricate abstract theories which 
tend to obscure the simplicity of the final results. 

The purpose of this article is to formulate the Cauchy problem (1), (2) for 
arbitrary distribution data, and to prove the uniqueness and existence of a 
solution, by an elementary method that does not use concepts or theorems of 
functional analysis. The method is constructive and shows that Parseval’s formula 
(3), when properly interpreted, yields the solution of the Cauchy problem for all 
distribution data f, g,h. The exposition uses only the simplest concepts and facts 
of distribution theory. The necessary material may be found in the first 17 pages of 
Hormander’s book [5] of 1963. A more leisurely treatment of the same material is 
given in the first third of the book of Friedlander [1]. 


Classical Solutions of the Cauchy Problem. A classical solution of the Cauchy 
problem is a function u(x, y, t) € C?(R? X [t9, ©)) that satisfies conditions (1) and 
(2). Several methods are available to prove the uniqueness of classical solutions. 
One is the energy method; see, for example, [9], p. 280. A second method is to 
prove that any classical solution must be given by Parseval’s formula (3). Thus one 
has 


THEOREM 1. The Cauchy problem (1), (2) has at most one classical solution. 


The conditions f € C*(R*), g € C'(R’), h © C(R? X [to, ©)) are clearly neces- 
sary for the existence of a classical solution. However, it is known that they are not 
sufficient. Sufficient conditions may be derived from Parseval’s formula. The 
simplest result of this type is 


THEOREM 2. Parseval’s formula defines a classical solution u(x, y,t) of the 
Cauchy problem (1), (2) for all f € C°(R?), g € C7(R*), h © C*(R? X [t, ©)). 


Sketch of the Proof. The linearity of the Cauchy problem (1), (2) implies that if 
Uf, gh) denotes the solution for data f, g,h then 


Ucf.e,h) — Ugz,0,0) T Mo, 2,0) + 4EO,0,h): (4) 
Moreover, if the integral operator P, , ,(g) is defined by 


P.,.(8) = — || egg, (5) 
| Om PO (1? = (x =)? = (y= 0) 


then Parseval’s formula implies that 
Ue, go) X> y,t) = Pyoy ttf &)> (6) 
Ur F.0,0)( ¥> y,t) — (d/dt)P. ytd f)> (7) 
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and 
Uo,0,m)(¥s Yot) = JP. cyt, 57) dr, (8) 


where the last integrand denotes the action of the integral operator P, ,,_, on 
h(é, n, 7) with 7 fixed. The relations (7), (8) are called Stokes’s rule and Duhamel’s 
principle, respectively. The proof of Theorem 2 will now be sketched for the 
special case that f = 0, h = 0, tj = 0 and g € C?(R°). The general case will then 
follow from the decomposition (4)-—(8). 

Consider then the function v(x, y, t) which is defined for (x, y, t) € R* x [0, &) 
by 


u(x,y,t) = — ff tit (9) 
2m 11D yD Yt? — (x — €) — (yn) 


where g € C?(R°). In this classical form the integral is not well suited to proving 
the continuity and differentiability of v. The simple change of variables € > x + ¢té, 
7 2 y + tn yields the more useful representation 


t g(x +té,y + tn) 
y,t)h=— aa aannan, |, Doe 10 
69) Felwan yee 


The main tools in proving that (10) defines a classical solution of the Cauchy 
problem are Lebesgue’s dominated convergence theorem and Fubini’s theorem. 
Note that 1/ y1 —é 2 _ »* is unbounded but integrable on the disk D(0,0, 1). In 
fact it is in the Lebesgue class L,(D(0, 0, 1)) for 1 < p < 2. Thus for (x, y, t) in an 
e-neighborhood N(X%p, Yo, to) = x, y, Ox — x9)? + (y — yo)? + (t — ty)? < €7) 
the integrand in (10) is bounded by M/ y 1 — 2 — n? © L,(D(O,0, 1)) where M is 
the maximum of |g(é,7)| on the disk D(Xo, Yo, tg + 2€). The continuity of vu at 
(Xo, Yo. to) follows easily by Lebesgue’s dominated convergence theorem. 

The existence and continuity of the derivatives of v follow similarly. For 
example, if we define 


t g(x + t&,y + tn) 
v(x, y,t) = tooo Vireo dédn, (11) 


then v, € C(R? X [0, )) and Fubini’s theorem implies that 
v(x, y,t) — U(X, y,t) = if v,(é, y,t) dé, (12) 
Xo 


whence v, = v, © C(R?* xX [0, )). The same method can be used to complete the 
proof that v € C*(R? x [0,)). Finally, the verification that v satisfies (1) follows 
from a careful application of the divergence theorem to the integral representation 
for U,, — V,, — Vy,. Details are left to the reader. 

The method outlined above can also be used to prove 


CoROLLARY 3. The Cauchy problem (1), (2) has a classical solution u © 
C*(R* X [tp, ©)) if and only if f, g € C*(R?*) and h € C*(R? X [ty, ©)). 
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Distribution Solutions of the Cauchy Problem. A distribution-theoretic formu- 
lation of the Cauchy problem can be found by beginning with a classical solution 
u(x, y,t), and calculating u,, —u,, — uy, in the distributional sense. To this end 
it is natural to extend the definitions of u and h from R? X [t), ©) to all of R° by 
defining u(x, y,t) = h(x, y,t) =0 Wx, y,t) © R* X (—~,¢,). With this under- 
standing, one may regard u and h as elements of J'(R°) = the set of distributions 
on R? and f,g as elements of Y'(R*). Then, if ¢ € DB R*) = C5(R?) = the set 
of Schwartz testing functions, then the basic rules of distributional calculus and 
integration by parts give 


(uy, — Uyy — Uy, )(@) — u(¢,, — Oy — by,) 
= cr [ u(x, y,t)(by — bey — by) de dy dt 


_ fff Bony (x, ys) dedy de 


+f (Po, 9)6(x ¥.t0) 


+(x, y)b(x, y,to)} dedy 
=h(¢) + (f @ 5; )\(o) + (g @ 4, )(4), (13) 


where f ® 6;,, g ® 6,, are the tensor products of distributions f, g € J'(R*) with 
distributions 6,,,6;, © J'(R); see, for example, [1]. Thus the classical solution uw, 
when regarded as a distribution on R°, satisfies 


Uj, — Uy, — Uy, =k in D'(R?), (14) 
and 
supp u C R* x [t),%), (15) 
where 
k=h+f@6;+¢@6, © DR’), (16) 
and 
supp k C R* x [t5,%). (17) 


Note that conditions (14), (15) are meaningful for any distributions u and k in 
QJ'(R?). This suggests the 


DEFINITION. A distribution solution of the Cauchy problem (1), (2) with distribu- 
tion data f, g € D(R*) and h € D'(R°) with supph Cc R? X [tp, ©) is a distribu- 
tion u € DR?) that satisfies (14), (15) and (16). 

Note that in the distributional formulation of the Cauchy problem there are no 
explicit initial conditions. Instead, the initial conditions are included as additional 
terms f ® 6;,, g @ 6,, in the source term k. 


The Existence and Uniqueness of Distribution Solutions. The goal of this 
article is to present an elementary and constructive proof of 


THEOREM 4. The Cauchy problem (14), (15) has a unique distribution solution 
u € JR?) for every distribution k € ZY'CR°) that satisfies (17). 
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Proof of Uniqueness. The strategy of the proof is to show that any solution 
u € YR?) of (14), (15) must be given by a suitable version of Parseval’s formula 
(3). With this in mind it is helpful to consider the functions 
U4( Xx, y,t) — u( p(x’ — XX, y’ —y,t' _ t)) 


(18) 
k (x, y,t) = k(¢(x' — x,y’ —y,t’ a t)), 


where ¢ € D(R°*) and u and k act on the variables x’, y’,t’. u, and ky are the 
convolutions of u and k, respectively, with the testing function ¢(—x, — y, — ft). 
Hence a standard result of distribution theory implies that u,,k, € C*(R°) and 
differentiations with respect to the variables x, y, ¢ commute with the actions of u 
and k; see, for example, [5, p: 14]. These properties are also easy to verify directly. 

u, and ky, may be thought of as smoothed versions of u and &. It will be shown 
that uy is a classical solution of the Cauchy problem 


(a* /at? — a? /ax? — a’ /dy*)ug=k, W(x,y,t) © R*X[t,,~), (19) 
Ug(X,y,t;) =0, (A/dt)ug(x,y,t,)=0 W(x,y) eR’, (20) 


where t, = t,(@) < 0 is a suitable number. To verify these equations note that the 
rules of distributional calculus and equation (14) imply that 


(0? /at? — a? /ax* — d°/dy”)ug(x, y,t) 
= u((d? /at? — a? /ax? — a? /dy”)h(x' — x,y’ —y,t'—t)) 
= u((d?/at’? — a? /ax'? — a? /ay’)b(x' —x,y'—y,t' —t)) 
= (Uy, — Uy, — Uyy)(O(x' — x,y’ —y,t’ —t)) 
=k(o(x'-x,y’—y,t'-t)), Wx, yt) eR, (21) 
which implies (19). Next, note that 
(x'—x,y’—y,t'—t) € supp @ (x’, y’,t') © (x, y,t) + supp. (22) 
Thus if T(d¢) = Max{t'l(x’, y’, t') © supp ¢} then 
T(@) +t = Max{t'l(x', y’, t') © supp d(x’ —x, y’—y,t'—t)}. (23) 


In particular, (15) implies that supp u M supp ¢(x’ — x, y’ — y,t'’ — t) = O when 
T(p) +t <ty or t < ty — T(@). Hence, if t, = t) — T(¢) then suppu, Cc R*x 
[t,,0) and similarly supp k, C R* X [t,,%). In particular, (20) holds. Note that it 
may be assumed that T(@) > fo, and hence ft, < 0, since otherwise uy, and ky are 
identically Zero. 

Equations (19), (20) and Theorems 1 and 2 imply that u, and ky are necessarily 
related by Parseval’s formula: 


1 t k3(€,7,7) 
J») —= 2 ——— (] d d 
Ug( x, y,t) mI Nec y 1-0 Goan (gn é)jo (yon)? Edndr 
(24) 


for all (x, y,t) © R* X [t,, ©). In particular, since t, < 0 and u,(0, 0,0) = u(¢), by 
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(18), one has 


1 0 k4(é, 7,7) 
u(¢d) = inl, Vovo.0, <0 Jp? oa dé dn dr. (25) 


Thus any solution u of (14), (15) is related to the source term k by (25). This 
completes the proof of the uniqueness statement of Theorem 4. 


Proof of Existence. It is natural to use (25) to construct a distribution solution of 
the Cauchy problem (14), (15). Recall that in (25) t, = t, — T(¢é) depends on @. 
However, since supp ky C R? x [t), ©), one can write 


1 O k4(é, 1,7) 
u(¢) = Ia oio0,-n Yeon? dé dn dr. (26) 


The existence proof will be completed by showing that, for any k € D’(R?) with 
supp k C R* X [t), ©), equation (26) defines a distribution u € Z’(R°) that satis- 
fies (14) and (15). 


Proof that (26) defines a distribution. For each 6 € B(R*) one has k, € C*(R?) 
and supp k, C R* X [t,,~). Hence, V ¢ € A(R?), (26) defines a finite number 
u(d@). Moreover, the linearity of u(@) is evident from (18) and (26). It remains to 
show that u(@) is continuous in the distribution sense. This means that to each 
compact set K Cc R° there must correspond constants C = C(K)>0 and N = 
NCK) > 0 such that one has [5, p. 4] 

lu(p)l<C YY  suplDe DD?! (27) 
a,ta,+a,<N K 
for all d € DR’) with supp d C K. To verify this define T(K) = Max{t\(x, y, t) 
© K} and assume that T(K) > t, (since otherwise u(@) = 0). Then t,(K) = t, — 
T(K) < 0 and V ¢ € Q(R*) with supp ¢ < K one has supp ky C R? X [t,(K), ») 
whence 


it k4(é, 7) >» T) 
u(d) = rN. Tee dé dndr, (28) 
where 


K'= {(é,0,7)Ir + Vé2 + 7? <Oandt,(K) <7 <0} (29) 


is the solid cone with vertex at the origin and base D(0,0, — t,(K)) in the plane 
7 = t,K) < 0. Equation (28) will be used to prove (27). 
‘ The sets K and K’ are compact and so is their vector sum 
K+K'={(x4+x',y+y',t+t')i(x, y,t) © K and (x', y’,t') © K'}. (30) 
Hence, since k € B'(R*) by hypothesis, 3 constants C’ = C(K + K’), N' = 
NCK + K’') such that 
Ik( pd) <C' y, sup |D@'D%2D%4 (31) 


a, t+a,+a,<N' K+K' 


for all @¢ € DR’) with supp? CK + K'. In particular, if supp@ CK and 
(é,y,7) © K'’ then (x’ — é,y’—7,t'’ -—7) € K = (x', y',t') © K+ K'. Thus 
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b(x' — €, y'—»,t' — r), as a function of (x’', y’, t’) with (€,7, 7) € K’ fixed, has 
support in K + K' and (31) is valid with @ replaced by this function. Hence (31) 
and (18) imply that 


Iky(€,757)| =|k(o(x' _ g, y’ ~ n,t' ~ T))| 
<c sup | De'Dy?DPd(x — €,y — n,t-—7) 


@, +a ,+a,<N' K+K’ 


» (32) 


provided that supp @ C K and (é, 7,7) € K’. Moreover, for each fixed (€, n, T) € 
K', the support of (x — €, y — n,t — 7), as a function of (x, y, t), lies in (€, n, T) 
+ K c K' + K, whence (32) can be written 


lkylén7r)(<cC YF sup [De D@DB(x -é,y—7n,t—7)| 


a,ta,t+a,<N' (€,n,7)+K 


= C' > sup | D' D2 D2o(x, y,t)| (33) 


a, +a,+a,<N' 


V ¢ © A(R’) with supp @ C K and V (é, n, 7) € K'. Relations (28) and (33) imply 
(27). To see this note that the right hand side of (28) with k4(é, 7,7) = 1 is an 
elementary integral with value (1/2)t?(K). Thus, (28) and (31) imply 


1 
lu(d)l< st7C’ = sup |D De7™4), (34) 
2 a,+a,+a,;<N' K 


which is equivalent to (27). 

Proof that u satisfies (14). It must be shown that V @ © DCR?) one has 
(uj, — Uz, — Uy, Mb) = k(). Now the definition (28) and the rules of distribu- 
tional calculus give 


(u,, — Uy, — uy, )( od) = u(d,, a Py 7 ,,) 


-=< fff 77) te dmdr, (35) 


where J, is given by 
Ig(€,0,7) =kg 4-6, (6 77) 
= k((a? /at? — a? /ax* — a? /ay?)b(x — £,y — 9,t-7)) 
= k((a?/ar? — a? /ae? — 8? /dn*)b(x — Ey — 0, t — 7)) 
= (0° /dr* — 0° /aé? — 7 /dn*)kg(E, 0,7). (36) 


In particular, 1, € C*(R*) and supp/, C R? X [t,,%). Thus, k, is the classical 
solution of the Cauchy problem 


(a°/dr? — a°/aé* — 7 /dn*)k, =1, 0 (E,7,7) © R* X [t,,%), (37) 


kg(€,7,%) = 0, (d/at)k (én, ¢,) = 0 V (&, 7) e R’. 
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and Theorems 1 and 2 imply that 


1 ont l4(€,7,7) 
ky(x,y,t) = — eee dé dy 
sl ysF) = (t—7)’—(x-€)?-(y-7)’ panes 
(38) 


for all (x, y,t) € R* x [t,, ©). Finally, combining (35) and (38) gives 
(uy, —U,, — Uy,)(¢d) = k,(0,0, 0) = k(¢) Vo € B(R?). (39) 
Thus uw satisfies (14). 


Proof that u satisfies (15). It must be shown that if ¢ € D(R*) then u(d) = 0 
whenever supp ¢ C R* X (—~, f,). But in this case one has T(@) = Max{t\(x, y, t) 
© supp $} < t, and hence one may take t, = tp) — T(¢) = 0. This gives u(d) = 0 
by (25). 

This completes the proof of Theorem 4. 


The Fundamental Solution. A fundamental solution for the wave equation is a 
distribution solution of the wave equation 


Ui, — Uy, — Uy, = 4, (40) 
where 6 is the Dirac delta distribution in R°*, defined by 
5(¢) = £(0,0,0) Whe DR’). (41) 


Theorem 4 implies that there is a unique fundamental solution G that has its 
support in the half-space R* x [0, 0). Thus 


G,, — Gy, — Gyy =6€ J R°), (42) 
supp G C R? X [0,). (43) 


In some texts G is called the Green’s function, or the Riemann function, of the 
wave equation. Here G will be constructed explicitly and used to represent the 
solution of the distributional Cauchy problem (14), (15). 

The forward and backward wave cones with vertices at the origin will be 
denoted by I, and I_, respectively. Thus 


Pi = {(x, y,t)|t — yx? +y? > ot, 
r= {(x, y,t)lt+ Vx2+y?2 < of. 


Note that equation (26) which defines the solution of the Cauchy problem can be 
written 


(44) 


1 k( (x —-€,y—7,t—7)) 
u(¢d) = aN. aaa dr, (45) 


where k acts on the variables (x, y, t). Of course the integrand in (45) vanishes 
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outside a compact subset of _. On taking k = 6 one gets 


=f SS dé dn dr 


VP -2 7. 


1 (é,7,7) 
aS. 72 — £2 — yf? dé dn dr, 
since [_ is carried into [, by the mapping (é, 7,7) ~ (—é, — n, — 7). This 
result shows that the distribution G is actually a locally integrable function with 


supp G = [,. It may be written 
1 H(t — yx? +y?| 
G(x, y,t) = 
Jar 1? — x2 — y? 


G(¢) 
(46) 


(47) 


where H(t — ¥x* + y”) is the characteristic function of I’ ,. 
The principle of superposition for the Cauchy problem is expressed by 


THEOREM 5. WV k © YR?) with supp k Cc R? X [to,) the solution u of the 
Cauchy problem (14), (15) satisfies 


u=G*«k. (48) 


Proofs of equation (48) are given in most standard texts; see [5], p. 139 or [1], 
Ch. 5. It may also be verified directly from equation (45) by using the easily verified 
relation 


(G*k)(b) = (G*k,)(0,0,0). (49) 


Concluding Remarks. A great advantage of the distributional formulation of 
the Cauchy problem is that it permits a definitive discussion of the existence, 
uniqueness and construction of the solution without a priori assumptions about the 
Cauchy data f, g,h. Questions concerning the regularity of the solution are then 
secondary, to be answered by means of the solution formulas (45) or (48). As a 
concluding example, conditions will be given which guarantee that the original 
Parseval formula (3) define a locally integrable solution. Parseval’s formula with 
f =g =0can be written 


oi. h(&,7,7) 
99° WI wooo leo ow 
(50) 


where ['_(x, y, t) is the backward wave cone with vertex (x, y, t), defined by 


P_(x,y,t) = {(én,7)M(r — 0) + V(x - €) + (y—2)° < Oh. 


Formally, (50) is just the convolution equation (48). Now it is easy to verify by 
direct integration that G(x, y, t) € LiP*(R°*) for 1 < p < 2, where 


Live = fies y, NPP pC y,t)|" dxdydt < V compact K CR}. 
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It follows by Hélder’s inequality that if h © Li*(R*) with 2<q < + (and 
supp h C R* X [t,, )) then the integral in (50) converges absolutely for all (x, y, t) 
© R°. In this case the function u(x, y,t) defined by (50) is in L'°°(R*) and 
coincides with the distribution solution of the Cauchy problem. A proof may be 
obtained from equation (45) by writing 


k(o(x —é,y—7,t-7)) — [ff h(x,y,t)d(x —é,y —n,t —1) dxdydt 
R3 
and using Fubini’s theorem to write u(@) as 


u(d) = [JJ ule, y,t)o(x,y,t) dxdydt. 


A similar argument shows that Parseval’s formula with g € Li(R*), 2<q< +, 
and f = 0, h = 0 defines a function uo, 9%, y,t) € LP(R*) which coincides 
with the distribution solution with source term k =g ®6,. If f © LO*(R*), 
2<q< +, and g = 0,h = 0, one can only say that u,¢ 9 9) 1s the distributional 
t-derivative of a function P, , ,_,(f) in LP°CR?). 
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Continuous Nowhere-Differentiable Functions — 
an Application of Contraction Mappings 


HipeEFumM! KatTsuuRA, San Jose State University 


HipDEFUMI KaTSUURA was born and raised in Wakayama about 300 miles 
south of Tokyo, Japan. He came to the United States to study at the 
College of Charleston, where he received his B.A. He earned his Ph.D. at 
the University of Delaware under Professor David P. Bellamy. Currently, he 
is teaching and doing research in point set topology and in analysis. 


1. Introduction. The existence of a continuous nowhere-differentiable function 
is well known, and many such functions are known. Most of them are variations of 
the following examples. 


Examples of nowhere differentiable functions: 


ae | 
If f(x) = YL 5m COs(3"x), (1) 
n=0 
then f is a continuous nowhere-differentiable function. (This is due to 
Weierstrass. See [2, p. 195].) 
(2) Let g(x) be the distance from x to the nearest integer. If 


2 | 
f(x) = L ze2"*), 
n=0 2 
then f is also a continuous nowhere-differentiable function. (See [3, p. 115].) 

The above examples have concise definitions and establish the existence of 
continuous nowhere-differentiable functions. However, it is not easy to visualize or 
guess what their graphs look like, let alone to see intuitively why they work. Our 
continuous nowhere-differentiable function f: [0,1] ~ R is the uniform limit of a 
sequence of piecewise linear continuous functions f,: [0,1] - R with steep slopes. 
In constructing the sequence ¢ f,), we will be using a contraction mapping w from 
the family of all closed subsets of X = [0,1] x [0,1] into itself with respect to the 
Hausdorff metric induced by the Euclidean metric. (A contraction mapping on a 
metric space (Y,d) is a function g: Y — Y for which there is a positive constant 
k <1 such that d(g(x), g(y)) < kd(x, y) for every x, y © Y.) It is interesting to 
note that, in our example, if A is any non-empty closed subset of the square X, 
even if it is a singleton set, the iterated sequence (w"(A)) of sets converges to the 
graph of f in the Hausdorff metric. If A is the diagonal of slope 1 in the square X, 
then w”(A) is the graph of f,, and hence, a reader can obtain an intuitive idea of 
the graph of f. This idea first occurred to me when I attended Mr. Gary Church’s 
master’s thesis defense at San Jose State University at which he talked about 
attractors of contraction mappings (see [4]). 


II. The construction of a continuous nowhere-differentiable function. Let X = 
[0, 1] x [0,1] be the closed unit square in the plane with the usual Euclidean 
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metric. Let w,;: X — X, i = 1,2,3 be contraction mappings defined by 


x 2y 
w(x) = (5.5), 
2-x 1t+y 
wy) = (=, ]. 
and 
2+x 1+2y 
s(n y) = | 3 ) 3 | 


(See Ficure 1.) 


Fic. 1 


In Ficure 1, w, shrinks X onto w,(X), with the fixed point (0,0), w, flips X 
about the line x = 1/2 and shrinks it onto w,(X’), with the fixed point (1/2, 1/2), 
and w; shrinks X onto w,(X), with the fixed point (1, 1). 

Let F(X’) be the collection of all non-empty closed subsets of X. We define a 
function w from FCX) into itself by 


w(A) =w,(A) Uw,(A) Uw,( A) 
for every A in F(X). For every A, B in FLX), let 
d,,(A, B) = inf{e > 0: N,(A) > B and N.(B) > 4}, 


where N.(A) and N.(B) are e-neighborhoods of A and B, respectively. Then d,, 
is a complete metric on F(X), and is called a Hausdorff metric. It is interesting to 
note that the function w is a contraction mapping on F(X) under this metric d, 
(see [5]). 

Let Dy = {((x, x) € X} be the diagonal in X. For each n = 1,2,3,..., let 
D,, = w(D,,_,). Then for every n = 0,1,2,..., D, is the graph of a continuous 
function, call it f,, from [0, 1] onto itself. 

Note that if m <n, then w"(X) 2 D,,, and w"(X) is the union of 3” rectangles of 
heights < (2/3)". Hence, we have 


sup{|fin(t) — fat)l: ¢ © [0, 1]} < (273)". 
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1 1 


Fic. 2. w(X) and D,. Fic. 3. w*(X) and Dy. 


Fic. 4. w3(X) and D3. 


This implies that the sequence of continuous functions <f,,) is uniformly Cauchy. 
Thus there is a continuous function f from [0, 1] onto itself such that f,, converges 
to f uniformly on [0, 1]. We will prove that this function f is nowhere-differentiable 
on the interval (0, 1). 

Since w is a contraction mapping on FCX), it has a unique fixed point D in 
FCX) (see Theorem 4.48 on page 92, [1]). Note that if A is an arbitrary element of 
F(X), then the sequence (w”(A)) converges to D with respect to the metric 
d,,. Hence, D, converges to D and D is the graph of f. If we let B= 
{(0, 0,(1/2, 1/2), (1, 1)} be the set of fixed points of w,, w,, and w,, then the 
sequence (w"(B))> in FLX) also converges to D. Moreover, D > w"(B). Hence, if 
one plots the points of w"(B) for a large n, then not only does it resemble the 
graph of f, but actually it is a part of the graph of f. 


III. Proof of the nowhere-differentiability of f. Let 7 be the set of all ternary 
rationals in the interval (0,1), and for each n = 1,2,3,..., let J,, be the set of 
n-digit ternary rationals. 


Lemma 1. Let (x,) and <y,) be sequences in T such that, for eachn = 1,2,..., 
(1) x, and y, are in T,, 

(2) y, — x, = 1/3", and 

(3) either x, =X,41 OF Yn =Ynet- 
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414 
Then 
_ | fy,) — f(+,) 
lim | ————————_ | = 
no yy, — Xp 
Proof. We claim that 
fC y,) — f(x, 
fn) = fen) > 2"! (n = 1,2,3,...). 


Yn — XxX, 


First, note that if x © 7,, then f(x) = f,(x). If n = 1, then 


2 
3 
—=2 if (x, = 0, and y, = +) or(x, = 2 and 
fo.) Ax) | 4 (ar Strand es sD ors sans 
7” 1 
y,—x 3 
mt ~=-1 ifx,=4andy, =2. 


Hence, if m = 1, we have proved the claim. 
Suppose, for some integer k > 1, the claim is true, i.e., 


|f(y,) — f(x,)| > 2k! |¥_ — X,|. 


Then 
f( yx41) — f(%,41) _ SlfCy,) — f(x,)| 
Ve+1 — *k+1 31Vx — x,| 


(see Ficure 5 for the case x, = x,,4,) 
2*- "ly, — x, 


lV, — xX, 


by the induction hypothesis 
= 2(k+I-1 


which proves the lemma, by induction. 


(2/3) fCy,) — f(x,)) 


Xx Ye+l Vy 


N \ayany, = x) 


XK+1 


Fic. 5 


[May 
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LemMa 2. Let (x,) and <y,) be sequences in T such that for each n = 
1,2,...,x,,y, <= T,, and y, — x, = 1/3”. Suppose x, #xX,4, and y, #y,4, for 
infinitely many n = 1,2,... . Then 


. f(y,) a f(x,,) 
lim ———_—_ 


n—oco yy, _— Xn 
does not exist. 


Proof. First, we claim that 


f(y,) — f(x,) 


Yn — Xn 


>1 foreach n = 1,2,3,... 


From the proof of the previous lemma, this is surely true if n = 1. First, if 
X, =Xzp4, OF Vy = Yz41, then as in the previous lemma, we have 


fC yea) — Fx,41) slf( y,) — f(x,)| 


1 
Ve+i  *k+1 31V x — X,| 


(see Figure 5 for the case x, = X,,1) 
>2°:1 
by the induction hypothesis 
> 1. 


On the other hand, if x, #x,,, and y, # y,,,, then 


FO Yn41) — £% 441) _ 3lf( y,) — f(x,)| 


1 
Ve+i — *k+1 31Ve — Xql 
(see Figure 6) 
> 1 


by the induction hypothesis, which proves the claim. 


Now, if 1 is an integer such that x, #x,,, and y, #y,,,, then 
Ynt1 *n+1 5(¥, — Xn) 
(see Figure 6) 
_ ff) — £C,) 
ie 


So surely if the limit exists it must be 0. But by the claim proved above, it can’t be 0 
either. This means that the limit does not exist. 
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(1/3) fy,) — fx,)) 


Xp Xk41 Ye+1 Yk 


Fic. 6 


The following is well known. 


LemMa 3. Let g be a real valued function on [0,1] differentiable at t,0 <t < 1. 
If, for every n = 1,2,3,..., 0<x, <t<y, <1, and ifx, > tandy, — t then, 


g(y,) — g(x,) 
m — ee 
—X 


no yy, n 


g'(t). 


THEOREM. The function f: [0,1] — [0, 1] is not differentiable at any x € (0, 1). 


Proof. lf x € T, for some integer n, then let y, =x + 1/3"** for every 
k =1,2,.... Then lim, _,.. y, =x and 


f(y,) — f(x,) 


Ve Xk 


lim 


k—- a 


by Lemma 1. Hence, f is not differentiable at x. 

On the other hand, if x ¢ T, then there are sequences (x, and ¢<y,,) such that, 
for each n = 1,2,..., 

(1) x, and y, are both in T,, 

(2) y, — x, = 1/3”, and 

(3) a, <x <b,. 

Then by Lemmas 1 and 2, we know that 


_ f(y,) — f(x,) 
lim. ———_——_ 
no yy, —X, 


does not exist. Hence, by Lemma 3, f is not differentiable at x. 
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LETTERS TO THE EDITOR 


Correction to “Overdetermined Systems of Linear Equations” 
(This MontTnuLy, 97 (1990) 511-513) 


The result announced in this paper is false. It says that “the least squares 
solution to an overdetermined system of linear equations AX = Y is the point 
such that the sum of the squares of the distances from the point to each of the 
subsets defined by the equations is a minimum.” This conclusion followed from the 
erroneous step (A'D2A)'14'D*Y = (D74'A) 'D*A'Y, where D was a positive 
diagonal matrix. 

If the assertion were true, it would imply that the systems AX = Y and 
DAX = DY, where D is any diagonal matrix, have the same least squares solution, 
but that is not true. However, there are interesting connections between two such 
systems. 

The system DAX = DY is commonly called a weighted least squares system, 
and the weights d,,; modify the least squares solution. Such row scaling operations 
defined by D are nevertheless useful. Let R be the residual vector R = Y — AX. 
Assigning a relatively large weight to the 7th equation will cause the corresponding 
component of the residual vector to be smaller. Thus, if some components of the 
data vector Y are known with more absolute accuracy than others, one can 
introduce relatively larger weights to these components [2]. 

This example in fact thus provides an interesting illustration for a linear algebra 
class of a “solution” (least squares) to a linear system that is not invariant under 
elementary row operations of the augmented matrix, and where this lack of 
invariance turns out to be very useful. 
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Gareth Williams 


To the Editor: 


1. The article, ‘“Conway’s Challenge Sequence” (January 1991), opens with the 
statement that J. H. Conway “introduced” the sequence 1,1, 2,2,3,4,... in “a 
lecture at AT&T Bell Labs [in July, 1988]”. The author later notes (p. 19) that a 
generalization of Conway’s sequence is the subject of Monthly Problem E3274. I 
think it should be noted that the appearance of E3274 (June 1988) preceded 
Conway’s lecture. 

2. Newman and Parsons (January 1988, p. 44) give a short proof that every 
sequence of reals has a monotone subsequence; they then say (p. 45), ““We have 
been unable to find [the proof] elsewhere [than in Newman’s 1982 book, A 
Problem Seminar |’. My colleague, George W. Booth, has found a similar proof in 
W. Maak’s An Introduction to Modern Calculus, p. 29 (Holt, Rinehart & Winston, 
1963). 


Sincerely yours, 


David M. Bloom 
Brooklyn College of CUNY 
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Lemma 4. Let (i,j, Aj;;) and (k,l, A,,) be two distinct entries of H having 
minimum and maximum values, respectively. By querying A,, and doing a constant 
amount of extra computation, we can reduce the size of H by one, while preserving 
properties P1 and P72. 


Proof. By property Pl, i # k and j #1. We say a row or column is remaining if 
it has a representative in H. We divide the analysis into three cases depending on 
the value of A,,. 

Case 1. Aj, < Aj; < A,,. Any strict saddlepoint in column / is no larger than 
A,,. However, A;, < A,;, so by Lemma 2 column / cannot contain a strict SP, and 
we can eliminate this column entirely. By Lemma 3, the only possible strict SP in 
row k is A,,, which we have already ruled out. Consequently, we can delete the 
entry (k,/, A,,) from H. 

Case 2. A;; < Ay, < Aj. This case is symmetric to case 1; here we eliminate 
row i and column j and, consequently, delete the entry (i, j, A,,;) from H. 

Case 3. A;; < Aj, < A,;. By Lemma 3, the only possible strict saddlepoints in 
column j or row k are A;; and A,,. The first inequality rules out A;;; the second 
rules out A,,. Hence, the column j and the row k can be eliminated. The row i 
and the column 7, however, cannot be eliminated yet. We, therefore, delete 
(i, j, A,,) and (k, 1, A,,) from H but insert (i, /, A,,). This preserves the properties 
of H while decreasing its size by one. 

This completes the proof. 


Once we have eliminated all but one entry as a possible saddlepoint, then to test 
whether this last entry really is a saddlepoint, we make comparisons with all other 
entries in its row and column; this requires 2m — 2 additional queries. Finally, we 
can store H as a min-max heap so that the operations Delete-Min, Delete-Max, 
Find-Min, Find-Max and Insert can be performed in worst-case time O(log n) [1]. 
This proves our main result. 


THEOREM 5. Given anim Xn matrix A, where m > Nn, we can determine whether 
A has a strict saddlepoint, and report such an entry, by querying O(m) matrix entries 
and doing O(m log n) additional computation. 
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Circumscribed Circles 


ROBERT OssERMAN 
Mathematics Department, Stanford University, Stanford, CA 94305 


In a paper giving a new derivation of the four-vertex theorem [1], I stated 
without proof some elementary lemmas about circumscribed circles. The argu- 
ments needed are familiar to those who work in the field, but are not completely 
obvious. In rethinking those arguments, I noticed that the statements hold in far 
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greater generality than 1s generally realized. Since I do not know of a reference, I 
thought it worthwhile to provide proofs. 


DEFINITION. For any set £ in the plane, a circumscribed circle is a circle C such 
that E is included in the closed disk determined by C and not in any disk of 
smaller radius. 


THEOREM 1. Every bounded set E in the plane containing at least two points has a 
unique circumscribed circle. 


LEMMA 1. Let C, be a circle of radius r, and center c,, for k = 1,2. Let 
r = max(r,,r,}, and let d be the distance between c, and c,. If d <r, + r,, then the 
intersection of the disks D,, D, corresponding to C,,C, lies in a disk of radius h, 
where 


h? =r? — (S) (1) 


Proof. Since D, M D, is included in the intersection of the disks of radius r 
centered at c,,C,, it is sufficient to prove the lemma when r, =r, =r. Then 


d < 2r, and the picture is 


The assertion is clear from the picture and also follows immediately from the 
equations; assuming the centers are at (0,0) and (d,0), a point (x, y) in D, ND, 
must satisfy 


and 
(x-d)t+y2<r?. 
Adding the inequalities, we find that (x, y) satisfies 
2 2 
[x - 5] ty? <r? - a 
so that it lies in a disk centered at (d/2, 0) with radius h given by (1). This proves 
the lemma. 


CorOLLARY. If a circumscribed circle exists, it is unique. 


1991] NOTES 421 


Proof. If C, and C, were both circumscribed circles of E, they would have to 
have the same minimum radius r, and the corresponding disks D,, D, would have 
nonempty intersection, since E C D, 1 D,. Hence the distance d between the 
centers must satisfy d < 2r. By lemma 1, £ lies in a disk of radius h <r, 
contradicting the definition of a circumscribed circle. 


Proof of Theorem 2. Since E is bounded, it lies in some disk of radius M. Let 


S = {r <M: E lies ina disk of radius r} 


and 
r, = inf r. 
° S 
Since FE contains at least two points whose distance apart is m, say, it follows that 
” 0 
rn z=  ~— > UV. 
°* 2 


Choose a sequence of r, in S such that 
lr, > a n>, 


The corresponding closed disks D, of radius r, and center c,, all include E. We 
shall show first that the points c, tend to a limit, cy, and second that E lies in the 
closed disk D, of radius rg centered at Cp. 

Given e > 0, there exists N such that 


r,<%ote for non. 


For m,n > N, let r = max{r,,,r,} and let d be the distance between c,, and c,,. 
Since E CD, 1 D,, we have 


d<r,+r,<2r. 


By lemma 1, EF lies in a disk D of radius h, where 
2 
he =r*— (5 
Then h € S, and therefore h > r,. Hence 
d 2 
(5 =r*—-h’ <r*?-r6=(r—ro)(rt+ro) < 2Me. 


If the coordinates of c, are (x,, y,), it follows that for m,n > N, 
Ix, —X,,l 


<d<2vV2Me. 
lv, — Vol 


Hence {x,} and {y,} are Cauchy sequences, and must converge tO X9, yo; SO C, 
converges tO Cy = (Xp, Yo). 

To show that the circle of radius r, centered at cy, is a circumscribed circle of 
E, it is sufficient to show that E lies in the corresponding closed disk Dp, since E 
cannot lie in any disk of smaller radius, by the definition of r). But any point (x, y) 
outside D, satisfies 


(x —x))° + (y —y,)* = R’, R> ro. 
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If 6 = R — ry, choose N so that n > N implies 


(Xn — Xo)” + (Yn = Yo)” < (>) 


and 


6 
Slot 5: 


Then since E Cc D,, by the triangle inequality every point of E has distance to cy 
at most 


6 
rn,t>< 


0 0 
n 5 rota} tgrR 


2 


so that (x, y) is not in E. This proves the theorem. 


THEOREM 2. Let E be a closed bounded set in the plane containing more than one 
point, and C its circumscribed circle. Then EQ C either consists of two points at the 
ends of a diameter of C, or else contains at least three points. 


LEMMA 2. Under the hypotheses of theorem 2, every closed semicircle of C 
contains a point of E. 


Proof. Let C be the circle x? + y? = r’, and let S denote the closed semi-circle: 
{(x, y) € C: x > 0}. Suppose that E MS is empty. Then, since E and S are both 
closed and bounded, there is a positive distance 6 between them. Let C’ be the 
circle C shifted a distance 6/2 to the left. Since any point inside C and outside C' 
is at a distance at most 6/2 from the point of S with the same y-coordinate, it 
follows that such points cannot be in FE. Hence E lies in the intersection of the 
disks determined by C and C’, and by lemma 1, E lies in a disk of radius less than 
r. But that contradicts the assumption that C is the circumscribed circle, and 
hence the lemma is proved. 


Proof of Theorem 2. The proof is immediate from Lemma 2, since if EMC 
were empty, contained a single point, or consisted of two points lying in an open 
semi-circle, then there would be a closed semi-circle of C containing no points 
of E. 


Remark. The following variant of Theorem 2 also follows from Lemma 2, with a 
bit more argument. 


THEOREM 2'. Under the hypotheses of Theorem 2, if E A C does not contain a 
pair of diametrically opposite points of C, then E Q C contains three points with the 
property that any open semi-circle of C contains at least one of them. 


REFERENCE 
1. R. Osserman, The four-or-more vertex theorem, Amer. Math. Monthly, 92 (1985) 332-337. 
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Stirling’s Series and Bernoulli Numbers 


EuiAs Y. DEEBA and DENNIs M. RODRIGUEZ 
Department of Applied Mathematical Sciences, University of Houston —Downtown, 
Houston, TX 77002 


Introduction. In an article published in the MONTHLY (see [4]), Professor Victor 
Namias gave a simple derivation of Stirling’s asymptotic series 


1 1 1 
— n —A 2 ——_—_— —_—__ — + ———————— + o ee 1 
ees ee Nem exp| 12n 360n? —-:1260n° | (1) 


or equivalently 


[(n) =n" 'e~-"y2a0n ex > | , (2) 


k=1 
where the a,’s were to be determined. 
Professor Namias’ elegant derivation was based on the Gauss Legendre duplica- 
tion formula 
1 
vir 


His derivation gives a recurrence relation for the coefficients a, in the Stirling 
series (2). The same method applied to the triplication formula 


[(2n) = T(n)T(n + . 22m) (3) 


1 (3n—1/2) I 2 
M(3n) = 5-3 P(n)P[n + =|P[n+ = 
7 3 


>|: (4) 


yields another recurrence relation for the coefficients a, of the Stirling series. As a 
result he was able to derive the following two recurrence relations for the 
Bernoulli numbers B,: 


1 m-—1 u(m 

Pn = arm amy Ee) o 
| — k m—-ky\(™ 

Bn = samy D3 (1+ 2) (7) By. (6) 


Formulas (5) and (6) are based on the multiplication formulas (3) and (4), 
respectively, and on the well-known formula (see [1] and [3]), 


T(n) =n" 'e "210 cx > aye} (7) 


may mm — 1)n"—! 


He conjectured that an infinite number of recurrence relations for Bernoulli 
numbers can be obtained. He also conjectured that an infinite number of distinct 
recurrence relations for the a,’s can be found, all giving the coefficients of the 
same Stirling series (1). 

In this note we present a very elementary procedure, without use of the Gauss 
multiplication formula, to obtain an infinite number of recurrence relations for the 
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Bernoulli numbers (as conjectured by Namias). In particular, we prove that 
1 m-1 m n—-1 
Bn mY > n*( |B, » jn * (8) 
n(l—n™) y= \* j=l 
is valid for any positive integer m and any positive integer n > 1. 

Notice that (8) generalizes Namias’ results, since (5) and (6) can be obtained 
from (8) by letting n be 2 and 3 respectively. Formula (8) was proposed as a 
problem in the MonTHLY (see [2]). Our proof of (8) is based only on the definition 
of the Bernoulli numbers as given by the formal power series expansion 


— » Be: (9) 


Using (7) and (8) we also show that there exist an infinite number of distinct 
recurrence relations for the a,’s (as conjectured in [4]). 


Derivation of Recurrence Relations. We shall first give the proof of the recur- 
rence relations contained in (8). 
Let n be any positive integer greater than 1. Noticing that (1 — e”*)/( — e*) is 
the sum of a finite geometric series, we have 
1 — e”™* S —1 n-1l © 
= die! 
l—e J= » J=0 m=0 


co n—l 

J x 
=_v L 
m=Q(0 j=0 


Multiplying both sides by x/(1 — e”*), we obtain 


m 


: :-+(7== jroo 


m=0 j;=0 


From the definition of the Bernoulli numbers, we now have 


fama (1 § all ss S| 


! 
n 0 mM. 


00 B, 12 m B,n*x* n—1 jm kym—k 
E m= 5 ¥ _ 
m=o0 MM: NM m=0k=0 kt jap (m—k) 
© 1 m B,n* n~1 
= L yy —— Yi * |x. 
~0 n o k!(m — k)! j2 L 


Because a power series expansion is unique, we have 


1 om n—-1 
B,=-b% n'(7) Be in * 


MN K=0 j=0 


1 m—1 n—-1 
=— n'(E)B, Dim * + —(n"B,, | 
NM K=0 j=0 j=0 

] m-1 m n-1 
_— _ n*(7)B, Dim * + n™B 
n - 
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Therefore, 
1 m-1 m n—-1 
B, =——_—~- ¥ n*( )B y y"-* forall m > 1, 
n(l—n™) 42 k mt 


which establishes (8). Next we shall exhibit distinct recurrence relations for the 
coefficients a, in the Stirling series. 
Equating the coefficients in (2) and (7), gives the relationship 
Bn+1 
A» = ——— form = 1,2,3,.... (10) 
m(m + 1) 
Using (8), we get 
1 1 m 


a, = ——__ —_____ ¥ ne(™ ') 2, y jmtink: 


m(m + 1) n(1—n™*") J 
Since By = 1 and B, = —-1/2, 


1 1 nai n(m +1) 72! 
nd ny | -— 7 bhi 


Om m(im+1)n(1 - 


Using (10) once again, and letting 
nal n(m + 1) "=! 


f(m,n) = Li a Li", 


j= j=l 
we get 
1 1 
mm m(m + 1) n(1 — nt!) 


«(fom + nk (7 FEC = tka, OS imei 4 


k=2 jJ=1 
1 1 
~~ m(m +1) n(1 — n™*!) 


«(rom n+ ¥ onbe(™ 4 Pek + ta, 5 3" +) 


k=1 j=1 
Hence 


1 | f(m,n) fi 


Om (1—n™*!) | m(m+1)n 


m-1 n*a, nly 
nN OE ES ] 


Notice that for each positive integer n, (11) gives a recursion formula for the 
sequence {a,,}. Furthermore, each recursion formula generates (1). 
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For example, letting n = 2 in (11), we get the recursion formula 
1 “1 2*a, 

= ——______ | — ————_—_ + — 1)! ——_—__——__—_—_—_—_—— |. (12 

om (1 —2™7*!) | 2(m + 1) (m ) x m—k)'\k-— 1)! (12) 


Letting n = 3 in (11) we get the recursion formula 
1 (1 — 3m)2™ — (1 + 3m) 
om (7 — 3") 6m(m + 1) 


(13) 


m-1 (2™-* + 1)3*a, 
+(m— Dt Lik = 1)! 


Formulas (12) and (13) are precisely the two recursion formulas Namias obtains 
in [4] using the Gauss multiplication formula. 

In summary, we have obtained, in an elementary fashion, a generalization of the 
results in [4], and have added an infinite number of recurrent relations for the 
Bernoulli numbers to the ones that already exist. 


REFERENCES 
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THE TEACHING OF MATHEMATICS 


EpItrED BY MELVIN HENRIKSEN AND STAN WAGON 


A New Scheme for Multiple-Choice Tests 
in Lower-Division Mathematics 


Bruce R. JOHNSON 
Department of Mathematics and Statistics, University of Victoria, Victoria, B.C., Canada V8W 2Y2 


With the class enrollment figures for my lower-division mathematics and statis- 
tics classes rising over the hundred mark, I reluctantly made the decision to 
experiment with multiple-choice tests. I began with a 50-minute midterm test 
consisting of 20 multiple-choice questions, each in the standard five-choice format. 
It was a pleasure to experience the speed and accuracy of computer grading with 
the added bonus of question-by-question statistical analysis of class performance. I 
felt reasonably comfortable about the additional emphasis on error-free work due 
to no partial credit awards. In our high-tech age it seems that small technical 
errors are often harder to spot and potentially more damaging than ever before, so 
it probably is appropriate to put more weight on accuracy in first-year and 
second-year mathematics courses. Also, I found that multiple-choice provided a 
framework for broader, if less deep, coverage of the concepts because of its 
Suitability for many short questions rather than a few long ones. However, I found 
much that troubled me more than the lack of depth. Setting the test was a slow, 
painful, mind-straining chore. It was very tiresome trying to anticipate, for each 
question, the four most common incorrect numerical answers to accompany the 
one correct answer. I was further troubled by the thought that some students 
would work backwards from the choices using a process of elimination and that 
guessing as a last resort would yield at least a 20% chance of success. But what 
bothered me most of all was the realization that the student who arrived at an 
answer that was not listed would be warned automatically of a mistake, while the 
student who arrived at a listed wrong answer would not. To warn some and not 
others is unfair. I rejected the possible solution of always using “none of the 
above” for the fifth choice, because any question having this phrase as the correct 
choice would reward every numerical answer except the four listed wrong answers. 

But there is an approach that substantially reduces the shortcomings of stan- 
dard multiple-choice tests; it is especially suitable for lower-division mathematics 
and Statistics where most of the questions have numerical answers. For each 
multiple-choice question requiring a numerical answer, ten numbers are given in 
numerically increasing order. The correct answer might be one of the ten choices, 
but usually is not. The students are to select the choice nearest their computed 
answer. In the event of a student’s answer being equidistant from the two nearest 
values given, the rule is to select the larger of these two choices. 

Questions are much easier to set in the new format. The choices should be 
spaced over a feasible range at intervals narrow enough to provide a good chance 


427 


1991] THE TEACHING OF MATHEMATICS 429 


1. Find the first row, second column entry of M~! where 


_[4 3 
M = 4 3}. 

(A) Does not exist (B) -—1.0 (C) —0.5 (D) 0.0 (EB) 05 
(F) 1.0 (G) 1.5 (H) 20 dd) 25 (J) 3.0 
-1 _ 0.4 —0.6 
M = Ey 0.8] 


2. A research team consists of 9 men and 6 women. A project outline is to be 
developed by a group of 3 men and 2 women from the team. How many such 
groups are possible? 

(A) 100 (B) 200 (C) 300 (D) 400 (E) 500 
(F) 600 (G) 900 (H) 1200 (I) 2400 (J) 4800 


The number of groups is. (2)(¢} = 1260. 

3. A certain fabric has an average of 1.5 flaws per square meter. In a 6-Square- 
meter piece of this fabric, what is the probability that there are no more than 11 
flaws? 

(A) 0.0 #(B) 0.1 (C) 0.2 WM 03 #+( 04 
(F) 05 GW 06 WH 07 =@W 08 (J) 0.9 


Since 


the correct choice is (1). 


4. Let X,, X,, X;, X, be a random sample from a distribution with mean uw = 2.4 
and standard deviation o0 = 3.0. Find the standard deviation of X. 


(A) 10 @®B 15 (© 20 WM £25 (2) 3.0 
(FF) 35 (G 40 (Gd 45 @W 50 06) 5.5 


A Simple Proof of the Weierstrass Approximation Theorem 


J. H. Linpsey II 
Department of Mathematical Sciences, Northern Illinois University, DeKalb, IL 60115 


Let f(x) be a continuous function on J = [a,b]. The Weierstrass theorem 
states that for given e > 0 there exists a polynomial p(x) with | f(x) — p(x)| < 
on I. The general idea of the proof in this note is to split / into intervals J, on 
which f(x) is nearly the constant c;, define k(x) to be c; on J, and 0 elsewhere, 


1991] THE TEACHING OF MATHEMATICS 429 


1. Find the first row, second column entry of M~! where 


_[4 3 
M = 4 3]. 

(A) Does not exist (B) -—-1.0 (CO —-05 (D) 00 (BE) 05 
(F) 1.0 (G) 1.5 (H) 2.0 dd 2.5 QW) 3.0 
-1 0.4 —0.6 
M> | 03 0.8] 


2. A research team consists of 9 men and 6 women. A project outline is to be 
developed by a group of 3 men and 2 women from the team. How many such 
groups are possible? 


(A) 100 (B) 200 (C) 300 () 400 (BE) 500 
(F) 600 (G) 900 (H) 1200 () 2400 (GJ) 4800 


The number of groups is. (2)(¢} = 1260. 

3. A certain fabric has an average of 1.5 flaws per square meter. In a 6-square- 
meter piece of this fabric, what is the probability that there are no more than 11 
flaws? 

(A) 00 #£«(B) 0.1 (C) 0.2 WM 03 #(€ 04 
(F) 05 @G 06 (© 07 £=@® 08 (GJ) 0.9 


Since 


the correct choice is (1). 


4. Let X,, X,, X3, X, be a random sample from a distribution with mean w = 2.4 
and standard deviation o = 3.0. Find the standard deviation of X. 


(A) 1.0 (B) 1.5 (C) 2.0 (D) 2.5 (E) 3.0 
(F) 3.5 (G) 4.0 (H) 4.5 (1) 5.0 (J) 5.5 


A Simple Proof of the Weierstrass Approximation Theorem 


J. H. Linpsey II 
Department of Mathematical Sciences, Northern Illinois University, DeKalb, IL 60115 


Let f(x) be a continuous function on J = [a,b]. The Weierstrass theorem 
states that for given e > 0 there exists a polynomial p(x) with | f(x) — p(x)| < 
on /. The general idea of the proof in this note is to split / into intervals J; on 
which f(x) is nearly the constant c;, define k(x) to be c; on J; and 0 elsewhere, 
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approximate f(x) by the sum of the k,(x), approximate kx) by a normal 
distribution, and approximate these normal distributions by their Taylor polynomi- 
als. This gives a polynomial f,(x) approximating f(x), but not closely enough. 
Then we similarly approximate f(x) — f,(x) by f,(x) and repeat this procedure 
until we have the desired p(x). The details follow. The idea of successive 
approximations is also used by E. Schenkman [2] in his proof. The classical proof 
by Bernstein polynomials (i.e., binomial distributions) is given in [1]. 

Extend f to J =[a — 1, b+ 1] by defining f(x) = f(a) for x < a and f(b) for 
x > b. Let M = max,|f(x)|. Assume M > 0. By uniform continuity there exists 6 
with 0 < 6 < 0.1 such that | f(x) — f(t)| < 0.01M if |x — t| < 26. Define a(x) = 
e~/8 and b(x) = a(x + 6) + a(x — 8). Then for 0<x <6, e!=0+ 
a(O — 6) < a(x + 6) + a(x — 6) < a0 +: 8) +. a6 — 6) =e! + 1 (because a(x) 
is increasing on negative real numbers and decreasing on positive real numbers). 
The same is true for —5 <x < 0 since b(x) is an even function. Thus |b(x) — 1| 
<1l-—e™! for —65<x<65. Let K={2i6: i€ Z and 216 © J} and g(x) = 
Lyexs(y)a(x — y). Let x © J. Then x €[y, y + 26] for some y € K. Let h(x) = 
fly)a(x — y) + fly + 28)a(x — y — 28). Then 


| f(x) b(x — y — 6) — h(x)| 
=|( f(x) — f(y))a(x —y) + (f(x) — fly + 26))a(x — y — 26)| 
<|f(x) —f(y)| +| f(x) — fly + 26)| 


<0.01M + 0.01M = 0.02M 
and 


| f(*) — h(x)| <| f(x) — f(x) b(x — y — 6)| +] f(x) b(x — y — 6) — h(x) | 
<|f(x)|( —e7') +0.02M < M(1.02 —e7'). 


In absolute value, the contribution to g(x) of the terms of g(x) after the two 
terms of A(x) is at most 

ore) 0 —4 

MY e728 <MYe*= ue = <0.1M. 

i=1 i=1 l—e 
Allowing this 0.1M and another 0.1M for the terms before those of h(x) 
| f(x) — g(x)| < M(1.22 — e~'). Some number of terms in the series for e“/!/® 
approximates it within 0.01 /|K|. The same number of terms approximates a(x — y) 
as Closely for y © K and x €/. Replacing each a(x — y) by such terms converts 
g(x) to a polynomial f,(x) with | f(x) — f,(x)| < MQ.22 — e~") + |KIM(.01)/ 
|K| = M(1.23 — e~') < 0.9M for x € J. Letting f(x) — f,(x) play the role of f we 
can find a polynomial f,(x) with | f(x) — f,(x) — f,(x)| < 0.9 (0.9M) for x € J. 
Recursively, we find f,(x) with | f(x) — Di, f(x) < 0.9‘M until this is suffi- 
ciently small. 
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Statistics Within Departments of Mathematics at 
Liberal Arts Colleges 


THomas L. MoorE 
Department of Mathematics, Grinnell College, Grinnell, IA 50112 


JEFFREY A. WITMER 
Department of Mathematics, Oberlin College, Oberlin, OH 44074 


In his important article “The Science of Patterns” [9], former MAA president 
Lynn Steen provides us with a state-of-the-discipline report on modern mathemat- 
ics. Steen divides the mathematical sciences into three parts “of roughly compara- 
ble size’’: statistical science, core mathematics, and applied mathematics. Currently, 
core mathematics dominates both the faculty and curriculum of departments of 
mathematics at liberal arts colleges. Although Steen claims that distinctions 
between his three divisions may be less “intrinsic differences” than ‘differences in 
style, purpose, and history,” we feel recognition of these differences can be an 
important benefit to the community of liberal arts colleges, where practicalities 
dictate that all three divisions of the mathematical sciences be housed in a single 
department. In particular, a broadened view of the mathematical sciences can 
energize a mathematics department by increasing enrollments, the number of 
majors, the number of students going on to graduate school, and the general level 
of enthusiasm for studying the mathematical sciences. Below we outline some of 
the essential differences between statistics and core mathematics and discuss some 
practical curricular implications that these differences have for a mathematics 
department at a liberal arts college. We concentrate on statistics rather than 
applied mathematics because we are statisticians. 

Statistics is underrepresented among mathematics faculty at liberal arts colleges 
in the United States. A recent survey [8] of mathematics departments at liberal arts 
colleges suggests that approximately half of all such departments have no one with 
an advanced degree in statistics and only 12.5% have more than one such person. 
And, unlike the situation at many universities, if statisticians are employed at 
liberal arts colleges they will generally be housed in the department of mathemat- 
ics, since mathematics is the traditional liberal arts discipline most closely aligned 
with statistics. Because the responsibility of statistics education at most liberal arts 
colleges rests with the mathematics department, it is imperative that the depart- 
ment-recognize the fundamental differences between statistics and core mathemat- 
ics and ensure that their statistics curriculum reflects these. differences. 

Core mathematics and statistics differ in two fundamental ways. Both fields look 
for structure and patterns, but core mathematics looks in the abstract arena of 
space and number while statistics looks at data from other, nonmathematical 
subject areas. Hence core mathematics and statistics differ in their objects of study. 

They also differ in their methods. The emphasis in mathematical thinking is on 
deduction. For example, the axioms of classical group theory form a framework 
from which theorems can be derived. Naturally, in the discovery of new truths 
inductive reasoning and exploration are essential tools. In developing a new 
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theorem in group theory one may observe patterns in known groups with certain 
characteristics. If all such groups examined exhibit a certain property then a 
conjecture may emerge and a proof be sought. Nevertheless, in this endeavor the 
emphasis is on the deduction that establishes the theorem. 

Statistical thinking, on the other hand, focuses on a dialog between models and 
data. For example, suppose one wants to develop a predictive relationship between 
a student’s SAT score and his or her college GPA, using linear regression. One can 
postulate a simple linear relationship with independent, normally distributed 
deviations from the line and from this model infer many features of the population 
from which the data came. For example, one could construct an interval estimate 
of the GPA of a student with an 1150 SAT. However, the linear regression model 
is only a model of reality. All statistical models are tentative. If diagnostic plots 
show that the model does not fit the data, then a more complicated model will 
have to be considered, the inferences redrawn, and the new model scrutinized for 
appropriateness in its own right. This dialog between model and data is a 
fundamental feature of statistical thinking. 

A critical aspect of this dialog between model and data is the quality of the 
data. For example, by properly choosing the sample or by designing a good 
experiment the statistician can usually simplify analysis and strengthen the inter- 
pretability of the model. It is therefore very important to present these ideas when 
teaching statistics. Even an introductory course should give attention to ideas such 
as sampling and experimental bias, the role of randomization, the idea of pairing 
observations, etc. Unfortunately many introductory courses and textbooks ignore 
these concepts. 

Current trends in statistical research reflect the fundamental difference be- 
tween statistics and core mathematics. Consider first the area of regression 
diagnostics. Often in linear regression one or more points may be anomalous. The 
fitted model may not fit these points well (i.e., they may be outliers) or these points 
may have enormous influence on the fitting process itself. Much recent work has 
gone into developing diagnostic procedures for identifying outliers and influential 
points in complicated regression problems (see, e.g., [3]). These procedures play a 
key role in the iterative cycle of data analysis in which a statistician fits a model, 
then checks the fit using diagnostic methods, then if necessary fits a modified 
model, and so on. 

A typical regression analysis using diagnostics will depend heavily on the 
computer. Nevertheless, other areas of modern statistical research make even 
greater computing demands. Classical statistical inference rests upon a precise 
model specification. For example, in the SAT/GPA example above we need both 
the underlying straight-line and the normally distributed errors to create the 
interval estimate. The bootstrap is a method for making a statistical inference 
without standard model specifications. Using the bootstrap one repeatedly draws 
pseudo-samples from the data at hand and uses these pseudo-samples as the basis 
of, for example, an interval estimate of a parameter. The bootstrap is particularly 
useful in situations where the model-based theory is poorly developed. A good 
survey of this technique is given by Efron and Tibshirani [4]. 

The area of dynamic graphical data analysis provides a third example of 
statistics research that is quite distinct from core mathematics. The thrust of this 


1991] THE TEACHING OF MATHEMATICS 433 


research is using modern computer graphics to develop methods that allow the 
data analyst to discover the structure in complex, multidimensional data sets. 
Becker, Cleveland, and Wilks [1] give a good survey of this area. 

For each of the past few summers, a small group of statisticians from liberal arts 
colleges has met to discuss the nature of statistics within liberal arts colleges. 
These Statistics in the Liberal Arts Workshops (SLAW) have considered three 
aspects of statistics within the liberal arts setting: the teaching of statistics, the role 
of statistics within the mathematics major, and the role of a statistician as a general 
campus resource (see [8]). A key conclusion was that a vibrant statistics curriculum 
includes real data and applied statistics and that every mathematics department 
should offer at least one data-driven statistics course that counts for mathematics 
major credit and that preferably can be taken early in a student’s career. Several 
models are possible for such a course: (1) Make an existing introductory course 
data-driven and count it for the major. (2) Add a (one- or two-credit) supplement 
to the traditional mathematical statistics course. (3) Teach new and different 
applied and data-driven statistics courses at the introductory level for mathematics 
majors. 

Model 1 may be the easiest to implement. Most liberal arts college mathematics 
departments teach a course in introductory statistics. Why not allow credit for the 
mathematics major for such a course? The traditional response to this question is 
that such a course is not mathematical enough, where “mathematical” is taken to 
mean ‘“‘core mathematical.’ However, once statistics is given equal status as a 
mathematical science, this objection will vanish if such introductory courses are 
taught with statistical rigor, that is, if they are data-driven explorations of the 
discipline of statistics, as opposed to the dry presentations of formulas that 
students often see. Model 1 together with the traditional mathematical statistics 
course then becomes similar to the model most of us now use for teaching calculus: 
an introduction that teaches basic techniques and the flavor of the subject followed 
later by a theoretical real analysis course. 

It is natural for mathematicians teaching statistics to emphasize the mathemati- 
cal (i.e., theoretical, deductive) side of the discipline. Such an approach makes 
things easier for the teacher, but gives only a partial picture of statistics. For within 
such a curriculum students who might be drawn to statistical thinking see little of it 
and potential mathematical scientists (and potential mathematics majors!) are lost. 

A good data-driven course uses real data to teach statistical thinking. Experi- 
ence suggests that real data are a far more powerful motivating force than are 
artificial data. Statisticlans—and their students—are interested in understanding 
and solving real problems. The analysis of data should be the focus of an 
introductory course. Students should discuss the real-world problems the data 
were collected to solve, the quality of the data, and their effective analysis. 
Exercises should be built around real data sets and students might even design 
projects that require them to produce and analyze their own data. The emphasis of 
such a course is on understanding the broad concepts that apply to most statistical 
problem solving. 

At least some portion of the course should deal with larger data sets and their 
analysis using a statistical computer package. It is important to include large data 
sets and computers for several reasons. First, a good interactive package is a basic 
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tool for the modern data analyst and it is good for the student to learn about one. 
Second, most real data sets are too large for hand calculations. Third, larger data 
sets can better motivate some techniques than smaller ones. For example, finding 
outliers in a small data set can be done by scanning a listing of the data, but with a 
larger data set one turns naturally to statistical graphics. 

The widespread availability of computers and easy-to-use statistics packages 
(e.g., Minitab, as opposed to SAS or SPSS-X, which are powerful but not 
user-friendly) makes it possible to avoid tedious calculation when analyzing real 
data and, instead, to concentrate on the relevant statistical concepts. When the 
computer is used, students can spend less time on calculation and more time on 
the important task of learning what the numbers mean. This in turn gives the 
teacher more room to test understanding of concepts. Dealing with statistical 
concepts can be difficult, but students should not be able to avoid thinking by 
hiding behind formulas (and neither should their teachers). We have found, after a 
period of adjustment, that de-emphasizing formulas helps us think more clearly 
about the concepts we are trying to teach. 

Teaching a data-driven course can be a challenge. Such a course will require a 
different style of teaching for most faculty, with more discussion and less lecture. 
More time will be spent on the computer with a statistical package and, possibly, 
simulations. Office time will be spent helping students work out the very nonmath- 
ematical aspects of the design and analysis of their own projects. Teaching a 
data-driven course will be difficult the first time, but the incorporation of the 
preceding ideas can be approached in steps and can result in a very rewarding 
teaching experience. 

A good textbook that teaches statistical thinking can be of great help. Three 
good examples, aimed at progressively more sophisticated audiences, are [6], [5], 
and [7]. A recent article by Cobb [2] provides excellent guidelines on what to look 
for in a textbook and also reviews several. We present a short list of resources and 
readings in the appendix to aid those who want to learn more about incorporating 
real examples and applications into statistics courses. A longer bibliography can be 
found in [8]. 

Model 2, the supplement to the mathematical statistics, is currently being used 
at Oberlin College, where the two-semester, six-credit sequence in probability and 
mathematical statistics is augmented with a one-credit supplement offered during 
the second semester. The course, called Data Analysis, meets once per week and 
covers various topics from applied statistics, including the tools of exploratory data 
analysis, normal probability plots and transformations of data, control charts, 
applied linear regression, and analysis of variance. All of the topics are introduced 
with computer applications to real data. In the spring of 1989 the course centered 
on’ an open-ended class project of analyzing a set of over 200 variables from a 
survey of college librarians at 97 liberal arts colleges, so that the students could 
gain first-hand experience as statisticians working on a real problem. 

Several implementations of model 3—new, applied, and introductory level 
courses for mathematics majors—currently exist and surely others are possible. 
Mt. Holyoke College teaches an applied regression course that carries a calculus 
and linear algebra prerequisite. The students are required to do individual projects 
in which they find or produce their own data. They must write a final report and 
present their project findings to the class. St. Lawrence University offers an 
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applied time series and forecasting course with a year of calculus as a prerequisite. 
This course uses much real data that are analyzed using a computer package. Both 
Carleton College and Swarthmore College offer courses in applied multivariate 
analysis that require linear algebra as a prerequisite. Although few theorems, if 
any, are proved in these courses, they all require some mathematical sophistication 
of their audiences so that proper understanding of rather elaborate statistical 
models is possible. 

When statistics secures a greater role in the mathematics department of a 
liberal arts college, everyone gains. The department becomes more properly a 
department of mathematical sciences as described by Steen. Students more in- 
clined toward statistics than core mathematics have an additional point of entry 
into the mathematics major and the college may send more students on to careers 
in the mathematical sciences. At the same time, these benefits require that the 
differences between statistics and core mathematics be appreciated and allowed 
for. Ideally, a department will hire a statistician (or two) to develop a solid 
Statistics curriculum. Short of that, mathematicians presently teaching statistics 
should develop a data component in their statistics curriculum along the lines 
outlined above. A data-driven, concept-oriented approach can breathe new life 
into your statistics courses. 


We gratefully acknowledge the help of our fellow SLAW participants in formulating the ideas in this 
paper: Donald L. Bentley of Pomona, George W. Cobb of Mt. Holyoke, Homer T. (Pete) Hayslett, Jr. 
of Colby, Gudmund Iversen of Swarthmore, Anju Joglekar of Smith, Robin H. Lock of St. Lawrence, 
David S. Moore of Purdue, Rosemary Roberts of Bowdoin, and Frank Wolf of Carleton. 

We also thank the referee for helpful comments. 


Appendix 


The following are introductory textbooks with a special strength or focus. Even 
experienced teachers should find them interesting. 


Items 1-6 list textbooks at the elementary level. 


1. J. Devore and R. Peck, The Exploration and Analysis of Data, 1986, West 
Publishing, St. Paul. 
This introductory textbook is loaded with real data from all areas of applica- 
tion. 

2. L. Koopmans, Introduction to Contemporary Statistical Methods, 2nd ed., 1987, 
Duxbury, Boston. 
Koopmans was among the first authors to thoroughly integrate the modern 
topics of exploratory data analysis, graphics, and robust methods into a 
coherent textbook. 

3. B’ F. Ryan, B. L. Joiner, and T. A. Ryan, Jr., Minitab Handbook, 2nd ed., 
1985, PWS-Kent, Boston. 
The Handbook is an excellent companion textbook to the highly popular 
Statistical package Minitab. The book discusses the several larger data sets that 
come with the package and includes many good exercises about them. 

4. D. S. Moore, Statistics: Concepts and Controversies, 1989, W. H. Freeman, 
New York. 

5. D. Freedman, R. Pisani, and R. Purves, Statistics, 1978, W. W. Norton, New 
York. 
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. D. S. Moore and George McCabe, Introduction to the Practice of Statistics, 
1989, W. H. Freeman, New York. 
We referred to these three in the article. 


Items 7 and 8 are textbooks for more advanced students. 


7 


. J. A. Rice, Mathematical Statistics and Data Analysis, 1988, Wadsworth, 
Belmont, Calif. 
This introductory textbook aimed at advanced undergraduates provides that 
audience with a true appreciation of the range of statistics and features the use 
of real data, exploratory data analysis and graphics, and a concern for model 
assumptions. 

. G. E. P. Box, W. G. Hunter, and J. S. Hunter, Statistics for Experimenters, 
1978, Wiley, New York. 
This textbook provides a lucid, conceptual introduction to statistics and experi- 
mentation for an audience of engineers, chemists, and other industrial scien- 
tists. 


Items 9 and 10 are excellent supplemental reading for students and teachers. 


9. W. S. Peters, Counting for Something: Statistical Principles and Personalities, 
1987, Springer-Verlag, New York. 
This charming and casual introduction to elementary statistical concepts gives 
the reader an excellent sense of the history of the discipline. 

10. J. M. Tanur, F. Mosteller, W. H. Kruskal, E. L. Lehmann, R. F. Link, R. S. 
Pieters, and G. R. Rising, Statistics: A Guide to the Unknown, 3rd ed., 1989, 
Wadsworth, Belmont, Calif. 

This is an anthology of short, nontechnical articles on a wide variety of 
Statistical applications. 
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ELEMENTARY PROBLEMS 


E 3439. Proposed by Jane Friedman, Widener University, Chester, PA. 
If M and WN are nonnegative integers, prove that 
Mi) x (Mog -1(Nea)+ y (ree eee, 
| M 0<a<(M-1)/2 a zat] O<acm/2\ © 2a 


E 3440. Proposed by William P. Wardlaw, U.S. Naval Academy, Annapolis, MD. 


Let A be a 3 by 3 magic matrix with real elements; i.e., there is a nonzero real 
number s such that each row of A sums to s, each column of A sums to s, the 
main diagonal of A sums to s, and the counter-diagonal of A sums to s. 

(i) Show that if A is also nonsingular, then A~! is magic. 
(ii) Show that A has the form 


s/3+u s/3-utuv S/3— UD 
s/3 -u-D s/3 sS/3+utuv}], 
S/3 +0 s/3 +u-vD s/3—-—u 


where u and v are arbitrary, and is nonsingular if and only if v? # uw’. 
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E3441. Proposed by Xu Chenglong, Shanghai University of Science and Technol- 
ogy, China. 


Suppose 0 < p < 1 and q > O. Put 


Determine lim, _,,.U,. 


E 3442. Proposed by Ray Wylie, Furman University, Greenville, SC. 


Given a sequence {b,}”_, of real numbers such that b, = 0 for n sufficiently 
large, put B, = LP _pby, +, (S = 0,1,...,m — 1), and let us say that the sequence 
{b,}°_. has property P,,, where m is a positive integer, if 


By = By = -** =By-1- 
Suppose d,,a5,...,a, are positive integers not necessarily distinct, and let C(n) 
be the number of r-tuples (1,,,,...,n,) of integers such that 
n=n,+n,+°:: 4+n,, O<n,; <a; fori =1,2,...,r. 


Prove that the k sequences 


{C(n)};,=0 {nC(n)}j=0> rey {n*-'C(n)},-0 


all have property P,, if and only if at least k of the integers a,,a5,...,a, are 
congruent to —1 modulo m. 


E 3443. Proposed by Calin Popescu, St. Michiels Brugge, Belgium. 


Let A,, i = 0,1,...,5, denote the vertices of a hexagon inscribed in a circle and 
let B, denote the intersection of the straight lines A,;A;,, and A;,,A;,3, for 
i=0,1,...,5, the indices being computed modulo 6. Prove that, if the triangles 


A,A,A, and A,A,A; have the same orthocenter, then the straight lines B,B;, 3, 
i = 0,1,2, are concurrent. (The orthocenter of a triangle is the intersection of its 
three altitudes.) 


E 3444. Proposed by Wilbur Jénsson, McGill University, Montreal, Canada and 
Gérard Letac, Université Paul Sabatier, Toulouse, France. 


Let E be a finite-dimensional vector space over a field K and let L(E) be the 
space of endomorphisms of E. Suppose L, and L, are subspaces of L(E) such 
that L(E) = L, + L, and xy + yx = 0 for all (x, y) in L, X L. 

(i) If char(K ) # 2, prove that either L, = 0 or L, = 0. 
(ii) If char(K) = 2, show that the conclusion of (i) need not hold. 
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Editorial comment. We remark that trivially 
R=S=T=Q when n = 1,2, 
R=Q,S =T=Q(V3) when n = 3, 
R=S = Q,T is undefined when n = 4. 


Several solvers noted that the problem is partially solved in D. H. Lehmer, “A 
Note on Trigonometric Algebraic Numbers,” this MONTHLY 40(1938) 165-166 and 
in Ivan Niven, Irrational Numbers, Carus Monograph No. 11, pp. 37-41. 


Solved also by S. F. Barger, D. Callan, M. R. Darafsheh (Iran), H. M. W. Edgar, D. W. Koster, 
D. E. Manes, R. A. Mena, National Security Agency Problems Group, S. Preis, J. Spencer, R. Stong, 
S. V. Ullom, and the proposer. 


- A Threshold for Divergence? 


E 3354 [1989, 838]. Proposed by V. A. Alexandrov and N. S. Dairbekov, 
Institute of Mathematics, Novosibirsk, USSR. 


Suppose M: [1, ©) —> (e,) is a nondecreasing function such that 


o at 
I my >” ©) 
(a) Prove that 
00 dt 


remy ~* 
1 t log M(t) 
(b) Show that there exist nondecreasing functions M satisfying (*) such that 


00 dt 
(——_<« 
1 t log M(t)loglog M(t) 


Solution by Kenneth Schilling, University of Michigan, Flint. For (a) suppose 
that ff dt/{t log M(t)} < . By the substitution u = log t we obtain 
fo du/log M(e“) < &, Since 1/log M(e“) is nonincreasing, we have 


(v /2) /log M(e’) < [_ du/roe M(e“) > 0 


as vu ~ ©, This implies v/log M(e’) < 1/2 for large v, which is equivalent to 
e*? < M(e”) or e’/M(e’) <e~” for large v. Hence, [%e’ dv/M(e’) < , which 
yields {> dt/M(t) < © via the substitution t = e’. Thus (a) is proved. 

For (b) we construct an example by letting k, = exp{exp[exp(n)]} for n = 
0,1,2,... and putting M(t) =k, for k,_,<t<k, for n = 1,2,.... Then 


[Uat/M(t) = XS kek, — kya) =, 
ef n=1 
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Editorial comment. We remark that trivially 
R=S=T=Q when n = 1,2, 
R=Q,S =T=Q(V3) when n = 3, 
R=S = Q, T is undefined when n = 4. 


Several solvers noted that the problem is partially solved in D. H. Lehmer, “A 
Note on Trigonometric Algebraic Numbers,” this MONTHLY 40(1938) 165-166 and 
in Ivan Niven, Irrational Numbers, Carus Monograph No. 11, pp. 37-41. 


Solved also by S. F. Barger, D. Callan, M. R. Darafsheh (Iran), H. M. W. Edgar, D. W. Koster, 
D. E. Manes, R. A. Mena, National Security Agency Problems Group, S. Preis, J. Spencer, R. Stong, 
S. V. Ullom, and the proposer. 


- A Threshold for Divergence? 


E 3354 [1989, 838]. Proposed by V. A. Alexandrov and N. S. Dairbekov, 
Institute of Mathematics, Novosibirsk, USSR. 
Suppose M: [1, ©) — (e,) is a nondecreasing function such that 
o dt 
J May.” (*) 


(a) Prove that 


00 dt 
—_————————- = 0 
J t log M(t) 


(b) Show that there exist nondecreasing functions M satisfying (*) such that 


00 dt 
(—*—__ ce. 
1 tlog M(t)loglog M(t) 


Solution by Kenneth Schilling, University of Michigan, Flint. For (a) suppose 
that ff dt/{t log M(t)} < «©. By the substitution u = log t we obtain 
fo du/log M(e“) < », Since 1/log M(e“) is nonincreasing, we have 


(v/2) /log M(e’) < [_ du/os M(e“) > 0 


as vu + ©, This implies v/log M(e’) < 1/2 for large v, which is equivalent to 
e*” < M(e") or e’/M(e’) <e~” for large v. Hence, [%e’ du/M(e’) < , which 
yields {y dt/M(t) < © via the substitution ¢ = e’. Thus (a) is proved. 

For (b) we construct an example by letting k, = exp{explexp(n)]} for n = 
0,1,2,... and putting M(t) =k, for k,_,<t<k, for n =1,2,.... Then 


[UarM(t) = kek, — ky) = 
ef n=1 
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since k,_,/k, ~ 0as n — ». On the other hand, 


Ee [* de/(tet"er} 


[aye log M(t)log log M(t)} 
e* n=1 Kn 


oO 


Y, (e* —e" )/(e@"e") < x 1/e” <0, 


n=1 n=1 


Solved also by L. Shepp, R. Stong, W.-B. Zhang, and a referee. Part (a) was solved by M. Falkowitz 
(Israel) and the proposer. One incorrect solution was received. 


The Limit of a Dirichlet Integral 
E 3357 [1989, 927]. Proposed by N. I. Alexandrova, Institute of Mining, Novosi- 
birsk, USSR. | 
Evaluate 
+o Sin x? 


lim / —— sin( xy) dx. 
0 Xx 


yoat+o 


Solution by K. F. Andersen, University of Alberta, Edmonton, Canada. The value 
of the limit is 7/2. To see this define a function f on (0, ©) by putting 


sinx? 1 sin x? 
—— for0d<x <1, f(x) = 
x 


for x > 1. 


f(x) = 


Since —x?/6 < (sin x7)/x*? — 1/x <0 for 0 <x <1, we see that f] L(0,~). 
Hence, we may write 


x3 


osin x? | 20 | 1 dx 
i 3 sin(xy) de = f f(x)sin(xy) dx + f sin( xy) —. 


The first term on the right tends to zero as y ~ © by the Riemann-—Lebesgue 
lemma, while the second term equals 


y . dt 
J (sin t)—, 
0 f 
which, as is well known, tends to 7/2 as y > ~., 


Edjtorial comment. Allen Pedersen and Kee-Wai Lau used arguments like that 
above to reduce the problem to other known integrals. Such integrals (and the 
integral fjt~ ‘(sin t) dt) are often evaluated via the Cauchy residue theorem; 
however, Michael Fichter was the only solver who used the residue theorem 
directly. 

Klaus Schurger and the Chico Problem Group both obtained the generalization 


lim J x~@*” sin x’ sin( xy) dx = 17/2 
yo“ 


for any fixed positive number r. 
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Several solvers calculated the limit to be 5.4415643575 ... . The early terms of the 
sequence are x, = 3, X, = 4.754887..., x3 = 5.419336... . D. B. Tyler remarked 
that x, and the limit agree to about 15 decimal places, but that x, and the limit 
agree to over 10!? decimal places. The latter fact follows from the inequality (* *) 
of solution II if we replace 5 by 5.4. 


34 correct solutions, 5 incorrect solutions, and one incomplete solution were received. 


Falsity of a Sharper Form of Carmichael’s Conjecture 


E 3361 [1990, 63]. Proposed by William P. Wardlaw, U.S. Naval Academy, 
Annapolis, MD. 


Prove or give a counterexample to the following conjecture concerning the 
Euler totient function: For every even positive integer n there is an odd positive 
integer kK such that 


p(k) = 9(n). 


Solution I by L. L. Foster, California State University, Northridge. We show that 
the equation 6(x) = 2° - 257 has even solutions but no odd solutions. The even 
solutions are 


x= 99-90-26; —4e2 . 3£0 . 561 . 1 7&2 . 2577, 


where each ¢e, is 0 or 1. Suppose there were an odd number & such that 
b(k) = 2'° - 257. If p is a prime factor of k, then either p — 1 = 2/ for some j 
with 1 <j < 16 or p — 1 = 2/: 257 for some j with 1 <j < 16. But 2’ - 257+ 1 
is composite for j = 1,2,..., 16 (being divisible by 67 when j = 7, divisible by 1123 
when j = 15, and divisible by 3, 5, 7, or 17 for the other values of j). Thus the only 
possible prime factors of k are 3,5, 17, 257, 65537, with the prime 257 occurring 
exactly twice and any other of these primes occurring at most once. But clearly 
65537 cannot occur in conjunction with 257 and, if €9, €,, and €, are each 0 or 1, 
then 


(3° - 5&1 - 1782 - 2577) = 28teot 21+ 4e2 . 957 < 216 . 257. 


Thus there is no odd k such that 6(k) = 2!'°- 257 = 6(2° - 2577), so that 
n = 2” - 2577 = 33817088 is a counterexample to the conjecture of the problem. It 
is not difficult to show that 33817088 is in fact the minimal counterexample; this 
can also be verified by computer. 


Solution II by R. J. Simpson, Curtin University, Perth, Australia. We shall prove 
the following rule. A necessary and sufficient condition for the equation é(x) = 
2” = 6(2”*') to have an odd solution x is that any 2/ occurring in the binary 
expansion of m must be such that 2?’ + 1 is prime. To prove this rule suppose that 
k is odd and that 6(k) = 2”. Then k must be the product of distinct primes of the 
form 2° + 1. Now a necessary (but not sufficient) condition for 2° + 1 to be prime 
is that s be a power of 2. Thus we must have 


k= [[(2%41), O<j, <i,< --- <i, 
i=1 
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where each factor in the product is a prime. But then 


2m = ack) = 112 
i=1 
or 
m= )) 2%, 
i=1 
and the necessity is proved. The sufficiency is immediate by the same argument. 
In particular, since 23* + 1 = 641 - 6700417, it follows that if the binary expan- 
sion of m contains 2°, then (x) = 2” = 6(2”*') has no odd solution x. 


Editorial comment. Most solvers settled the problem by observing that the 
equation #(x) = 2°” has even solutions, namely, 


x= 733 —€9— 26; — 4€2— 8e3— 16e4 - 3f0- £1 - 17&2- 25783 - 65537*4, 
where each ¢, is 0 or 1, but that (by an argument similar to that of Solution I) the 


equation ¢(x) = 2” has no odd solutions. By contrast, if 0 <m < 31 and the 
binary expansion of m is given by 


m = 6, + 26, + 46, + 86, + 166,, 
where each 6, is 0 or 1, then (x) = 2” has the odd solution 


xX = 3°0 - 591 - 1792 - 25793 - 655374. 
Using the fact that 27’ +1 is composite for 5 <j < 21, Eugene E. Ilgenfritz, 
David W. Doster, and James G. Marickel each noted that @(x) = 2” has no odd 
solutions for 32 <m < 2”* — 1 = 4194303. (This assertion also follows from the 
rule of Solution II.) 

A counterexample to the conjecture of the problem is given in Exercise 14 on 
page 234 of W. Sierpinski’s Elementary Theory of Numbers, Monografie Mat. 
Volume 42, Warsaw, 1964. Sierpifski essentially proved that if m = 32(mod 64), 
then there are no odd solutions x of 6(x) = 2” = @(2™*!). Again this is a special 
case of the rule of Solution II. 

No other solver found the minimal counterexample given by Foster in Solution 
I. Richard Stong, Douglas B. Tyler, and Jack V. Wales, Jr. each gave a solution 
similar to Solution II. 

Several solvers remarked that the conjecture mentioned in the problem is a 
stronger form of Carmichael’s conjecture that the equation (x) = C never has 
exactly one solution, but that the demonstrable falsity of this conjecture does not 
really help to decide Carmichael’s conjecture. 


Solved also by 63 other readers and the proposer. 


An Equation in Towers of Powers 


E 3364 [1990, 64]. Proposed by H. W. Lenstra, University of California, Berkeley. 


Determine whether there exist positive integers a,b,n,m such that a #)b, 


n >2,m > 2, and 
a b 


qt bn =p? \ im. 


1991] PROBLEMS AND SOLUTIONS 445 


Solution by James Duemmel and Allen Mauney, Western Washington University, 
Bellingham, WA. There is no solution. Suppose there is a solution; we may assume 
1 <a <b. If the relation is true, then we have an equation of the form a* = b’. 
Here a and b must have exactly the same prime divisors. Let a = II}_,p** and 
b = T1/_, p?, with all a, and B, positive and a,x = B,y for all i. Letting z = x/y, 
we have 8, = za, for all i, and hence b = a’. 

Next we show that z must be an integer. Since x = a“ and y = b” for some 
positive integers u and v, we have z = JIj_,p‘*“~*'”. These exponents are 
integers, and if they are all positive z will be an integer. From a* = b”, we have 
z = log b/log a. Also x > y since a < b, and hence a“ > b’. This implies u/u > 
log b/log a = z = B,/a;. Hence a,;u — B,v > 0 for all i. 

From the expression for z, we have z = a”, where w = u — zu > 0. We also 
have u = a’ for some positive integer s, since a < b implies that n > 3. Substitut- 
ing for z in the expression for w, we obtain w + a”v =u =a’. Since vu > 1, the 
left side is larger than a”. Hence s > w and a” divides u. Hence a” must also 
divide w. With a > 1, this is not possible. 


Editorial comment. Edward B. Burger and Peter D. Kohn pointed out that the 
equation has solutions when a and b are not integers. For example, (1/2)'”? = 
(1/4)'74. 


Solved also by B. Beckmann, E. B. Burger & P. D. Kohn (students), D. Callan, K.-W. Lau (Hong 
Kong), O. P. Lossers (The Netherlands), J. McHugh, J. G. Merickel (student), J.-M. Monier (France), 
T. S. Norfolk, E. Preissman (Switzerland), R. Stong, J. S. Sumner & K. L. Dove & E. Toro, W.-B. 
Zhang, Central Michigan University Problem Group, and the proposer. 


ADVANCED PROBLEMS 


6658. Proposed by L. Van Hamme, Free University of Brussels, Belgium. 


Define a sequence of integers by 


n-1l 


a(0) =1, a(n) = x (7 Jac) for n> 1, 


r=0 


so that L?_,a(n)x"/n! = (2 — e*)7' for |x| < log2. (This is sequence 1191 in 
N. J. Sloane’s Handbook of Integer Sequences, New York, Academic Press, 1973.) 
Prove that if p is a prime number and mi is an integer not divisible by p, then 


a(mp* + s) =a(mp*"'+-s) (mod p*) 


for k a positive integer and s a nonnegative integer. 


6659. Proposed by Abraham Ungar, North Dakota State University, Fargo. 
Let R2 be the subset of the Euclidean 3-space R? given by the equation 


R? = {x € R®: kl <c}, 


Cc 


where c is a positive constant. In the special theory of relativity c represents the 
speed of light, and the elements x of R? are admissible velocities. The relativistic 
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velocity composition law is given by the equation 


ie : ———_—— , 
1t+x-y/e> c* y,+1 14+x:-y/c? 


x+y 1 Vx x X (x X y) 
-5 


where y, is the Lorentz factor 
1 


ii PT ae 
yl-—x-x/c 


It is known that the space R? is closed under the relativistic velocity composition: 
if x,y © R? then x+y © R?. 
For given a,b € R® solve each of the two velocity composition equations 


axx =b (1) 
and 
xxa=b (2) 


for the unknown x € R?. 


6660. Proposed by Richard Alan Gillman, Valparaiso University, Valparaiso, IN. 


Let f be the multiplicative arithmetic function defined by f(1) =1 and 
f(p?) = pf(a) for all primes p and all positive integers a. 
(i) Prove that f(m) <n for all n. 
(ii) Prove that the ratio 


Ef) yj 
j= j=] 


has a limit C as n > ©. 
(iii) Estimate the difference 


x fi) CX). 


j=l j=l 


SOLUTIONS OF ADVANCED PROBLEMS 
How Often Is nd(n) < x2? 
6588 [1988, 963]. Proposed by Walther Janous, Ursulinengymnasium, Innsbruck, 
Austria. 


As in E3246 put W(n) = nd(n), where @ denotes the arithmetic function of 
Euler. For positive real x let F(x) denote the number of positive integers n for 
which w(n) < x*. Prove that lim, _,.. F(x)/x exists and find its value. 


Composite Solution by Joachim Herzog, Johann Wolfgang Goethe University, 
Frankfurt, Germany, Kee-Wai Lau, Hong Kong, and the editors. For Res > 1 we 


446 PROBLEMS AND SOLUTIONS [May 


velocity composition law is given by the equation 


x+y 1 Vs x X (x X y) 


 14+x- Je @ +1 1+x-y/c?’ 
y Yx y 


x*y x,y © R?, 


where y, is the Lorentz factor 
1 


‘eS 
yl-x-x/c 


It is known that the space R? is closed under the relativistic velocity composition: 
if x,y € R? then x*y € R®. 
For given a,b € R2 solve each of the two velocity composition equations 


axx =b (1) 
and 
x*a=b (2) 


for the unknown x € R?. 


6660. Proposed by Richard Alan Gillman, Valparaiso University, Valparaiso, IN. 


Let f be the multiplicative arithmetic function defined by f(1) =1 and 
f(p?) = pf(a) for all primes p and all positive integers a. 
(i) Prove that f(m) <n for all n. 
(ii) Prove that the ratio 


Ef) yj 
j= j=l 


has a limit C as n > ~. 
(iii) Estimate the difference 


x fi) -C¥ i. 


j=l j=l 


SOLUTIONS OF ADVANCED PROBLEMS 
How Often Is nd(n) < x2? 
6588 [1988, 963]. Proposed by Walther Janous, Ursulinengymnasium, Innsbruck, 
Austria. 


As in E3246 put W(n) = nd(n), where @ denotes the arithmetic function of 
Euler. For positive real x let F(x) denote the number of positive integers n for 
which w(n) < x*. Prove that lim, _,.. F(x)/x exists and find its value. 


Composite Solution by Joachim Herzog, Johann Wolfgang Goethe University, 
Frankfurt, Germany, Kee-Wai Lau, Hong Kong, and the editors. For Res > 1 we 


448 PROBLEMS AND SOLUTIONS [May 


The result of this problem may also be obtained by the methods of [4], [5], or [6]. 
An estimation of the difference F(x) — f(1)x can be obtained by the methods of 
[1], [2], or [3]. 


REFERENCES 


1. R. Balasubramanian and K. Ramachandra, On the numbers of integers m such that nd(n) < x, 
Acta Arithmetica 49 (1988) 313-322. 

2. Michel Balazard and Abdelhakim Smati, Elementary proof of a theorem of Bateman, Analytic 
Number Theory, Proceedings of a Conference in Honor of Paul T. Bateman, Progress in Mathematics, 
Volume 85, Birkhauser, 1990, pp. 41-46. 

3. Paul T. Bateman, The distribution of values of the Euler function, Acta Arithmetica, 21 (1972) 
329-345. 

4. Robert E. Dressler, A density which counts multiplicity, Pacific J. Math., 34 (1970) 371-378. 

5. Paul Erdés, Some remarks on Euler’s ¢-function and some related problems, Bull. Amer. Math. 
Soc. 51 (1945) 540-544. 

6. Jean-Louis Nicolas, Distribution des valeurs de la fonction d’Euler, L’Enseignement Math., 30 
(1984) 331-338. 


No other solutions were received. 


Asymptotic Formula for the Solution of a Difference Equation 


6610 [1989, 744]. Proposed by A. W. Goodman, University of South Florida, 
Tampa. 


Suppose the sequence x,,x,... is defined by x, = 1 and x, =x,_, + kx, 5, 


for n > 1, where k and ¢ are given positive constants. Find an asymptotic formula 
for x,,. 


Solution (jointly) by H. Kéditz, L. Mattner, E. Mues, and M. Reinders, 
Universitat Hannover, Germany. For n — © we have 
x, = ((c + 1)kn)'/*? + O(A). 


This is a consequence of the following result. 
Let the sequence {x,,} be defined by 


x, =1, Xne+i =X, + B(x,), n> 1, 


where g(x) is a continuous, positive, and nonincreasing function for x > 1. Let x(t) 
be the solution of the initial value problem 


x(1) = 1, x'(t) = g(x(t)), t>1. 
Then x, > © and x, =x(n) + OM) forn > ~., 

To prove this, first observe that with the conditions imposed on g the differen- 
tial equation has a unique solution x(t) that is defined for all t > 1. The function 
x(t) is increasing, and clearly x(t) ~ ~ as t > o, 

For t > 1 define p(t) by 


p(t) =x, + (t-—n)(x,41 —-,); n<t<ntl. 


This is the usual Euler-Cauchy polygonal approximation, with stepsize 1, for the 
differential equation. Clearly p(1) = 1 and 


p(t) =X,4; —X, = 8(x,) = 8(p(t)) 
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1974, 187]. It was proposed by J. J. A. M. Brands; the solution published was by 
F. W. Steutel. 

Solved also by Robert A. Agnew, Pull Cull, Thomas N. Delmer, Mordecai E. Falkowitz (Israel), 
Zachary Franco, Serge Gaignoux (France), Todor G. Genchev (Bulgaria), Murray Klamkin (Canada), 


Jean-Charles Leccia (France), Jean-Marie Monier (France), Bernard Pinchuk, Arthur J. Rosenthal, 
Richard Stong, Nora S. Thornber, and Douglas B. Tyler. 


Contracting Mappings in a Compact Metric Space 


6611 [1989, 744]. Proposed by O. Hajek and D. Singer, Case Western Reserve 
University, Cleveland, OH. 


Suppose X is a compact metric space with metric d and suppose T: X — X isa 
homeomorphism of X. If d(x, y) => d(Tx, Ty) for all x, y in X, prove that T is an 
isometry (i.e, d(x, y) = d(Tx, Ty) for all x, y in X). 


Solution by John Alexopoulos, Kent State University, Ohio. We replace the 
homeomorphism assumption by the weaker assumption that T is onto. For any 
fixed x and y in X and any positive integer nm there are points x,, y, in X such 
that 


T"(x,) =x, T'(y,) =y. 
Since X is compact there is a strictly increasing sequence of positive integers n(k) 
and a pair of points x’ and y’ in X such that 
lim, XK) =X’; lim, Yak) = Y'- 
Now 
d(T™(x'), x) = d(T" (x'), T7 (dng) < d(2', Xa): 


Thus lim T”(x') = x and (similarly) lim T”(y’) = y. 
Since the sequence d(T"(x'), T"(y')) is bounded and decreasing, it converges to 
d(x, y). Hence 


d(x, y) = lim d(T”™(x’'), T”™(y’)) 
= lim d(TT"™( x'), TT”™(y')) 


= d(T(x),T(y)). 

Editorial comment. Many solvers gave references for this result in the literature, 
including exercises in the textbooks of Bourbaki, Dugundji, Engelking, and 
Kaplansky. The two oldest references were to A. Lindenbaum, Contribution a 
étude de l’éspace metrique, Fund. Math., 8 (1926) 209-222, and H. Freudenthal 
and W. Hurewicz, Dehnungen, Verkiirzungen, Isometrien, Fund. Math, 26 (1936) 
120-122. A. Witkowski (Poland) mentioned that the result is the case G = T~! of 
the following. If X is a compact metric space, G: X — X is continuous, and 


d(Gx,Gy) > d(x, y) 


for all x and y in X, then G is an isometry. He informs us that the folklore 
version of the above is “if a hippopotamus closes its eye, its skin must expand 
somewhere.” 


Solved also by the proposer and 28 others. 
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Solved also by the proposer and 28 others. 
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Mahler’s Inequality 


6613 [1989, 846]. Proposed by Jeff Vaaler, University of Texas, Austin. 


Suppose P is a polynomial of degree n with complex coefficients, leading 
coefficient a and zeros a,,...,a,. Define the Mahler measure M(P) of P by 


M(P) = exp [ log|P(e77"')| dt =a [[ max(1, lal), 
j- 


where the latter equality follows by Jensen’s formula. If P’ is the derivative of P, 
prove that 


M(P') <nM(P). 


Solution by David W. Boyd, University of British Columbia, Vancouver. First 
observe that if O(z) is a polynomial of degree n with leading coefficient b having 
all its zeros inside the unit circle |z| < 1, then M(Q) = |b|. The leading coefficient 
of Q’ is then nb and all zeros of Q' lie inside the unit circle, by the Gauss-Lucas 
theorem (see p. 22 of Morris Marden, Geometry of Polynomials, Mathematical 
Surveys No. 3, Amer. Math. Soc., 1966). Thus 


M(Q') = nl|b| = nM(Q), 


so the desired inequality holds with equality for such Q. 

Now suppose P is an arbitrary polynomial of degree n. Then there is a 
polynomial Q of degree n with all zeros in |z| < 1 and such that |O(z)| = |P(z)| 
for all |z| = 1. Simply define 


Q(z) = P(z) TI —2. 


la,|>1 © ~ Aj 


Then Q has the property claimed since \((1 — @,z)/(z — a,)| = 1 for all |z| = 1. 
The integral formula for the measure then shows that M(Q) = MCP). 

Finally, we recall an important theorem of Bernstein (ibid, p. 23) which states 
that if |P(z)| < |Q(z)| for all |z| = 1, and if Q(z) does not vanish outside the unit 
circle, then |P'(z)| < |Q’(z)| for all |z| = 1. Thus 


M(P') < M(Q') =1M(Q) =nM(P). 


Remark 1. Mahler (Proc. Roy. Soc. Lond. Ser. A, 264 (1961) 145-154) gave a 
rather involved proof of this inequality. In a postscript, he pointed out that a 
referee had remarked that the result could be proved by observing that 


M(P’) 


1 n 
loe——— = [1 dt 
°F M(P) J 08 | 


1 
» eemit _ a; 


j=l 


is a subharmonic function of the zeros of P and hence the maximum is attained 
when they lie on the unit circle. The proof of the theorem of Bernstein used in the 
above solution uses only the maximum modulus principle for the function 
P(z)/Q(z) and the Gauss-Lucas theorem, and hence the above proof is somewhat 
more elementary. 
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Remark 2. For 0 < p < @, let 


1 op 1/p 
Pll, = | fice) ar| 


be the L? mean of P on the unit circle. Then ||P||, ~ MCP) = |IPllo as p > 0 
and 


IPI, - max,,,.,|P(z)| = II Plow 


as p — ©, The well known inequality of Bernstein states that ||P'll. < n|lPll. if P 
is a polynomial of degree n. This was extended to ||P'l|, < nl|Pll, for p = 1 by 
Zygmund (Trigonometric Series, Cambridge University Press, Volume II, 1968 
p. 11). The inequality of Mahler, proved above, shows that this holds also for 
p= 0. 

It is conjectured that the inequality ||P’ll, < nllPll, continues to hold for 
0 <p <1. In this range, the best result known is due to Maté and Nevai (Annals 
of Math., 111 (1980) 145-154), who showed that the inequality holds with the 
factor n(4e)'/? in place of n. 


One incomplete solution was also received. 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


Representations and Characters of Finite Groups. By M. J. Collins, Cambridge 
University Press, 1990. xii + 242 pp. 


MARK STEVEN MaAzuR 
Department of Mathematics and Computer Science, Duquesne University, Pittsburgh, PA 15282 


Representation theory and character theory have been a major tool in the study 
of finite groups. Without them the classification of the finite simple groups might 
never have been achieved, or at the very least, might still be underway. The author 
of this text is keenly aware of the central role that representation and character 
theory have played in the classification problem. In fact, the main objective of his 
presentation is a development of the representation and character theory neces- 
sary for tackling the classification problem. 

Representation and character theory began with Frobenius’s work, nearly a 
century ago. His theorem on transitive permutation groups along with Burnside’s 
theorem on the solvability of groups of order p%q? are major results in the theory. 
While more recently a proof of Burnside’s theorem has been given which is purely 
group theoretic, no such proof of Frobenius’s result exists. Notable, too, is Feit and 
Thompson’s odd order paper (the solvability of groups of odd order) which relies 
upon the developments of representation and character theory. Unlike many 
mathematicians, the author of this book has gone to considerable lengths to inform 
the reader of the theory’s historical importance and of the scholars who have 
figured significantly in the development of the theory; he ascribes the important 
theorems to their authors, as is customary, but also includes the year in which 
results were published. Such a technique gives even the casual reader a clear 
picture of how the many threads of the theory developed and how they were woven 
together and applied to the classification problem. 

Stated as simply as possible, the basic objects of study are representations and 
characters. A representation is a homomorphism from an abstract group G to the 
general linear group GL(V) where V is a finite-dimensional vector space over a 
field K. If the dimension of V is n and a basis is chosen for V, then V may be 
regarded as the vector space of n-tuples over K, and GL(V) may be viewed as the 
group GL(n, K) of nonsingular n X n matrices over K. Historically, these matrix 
representations of a group were considered first since the groups themselves were 
well understood. For each representation of a group there is an associated 
character defined as the trace of the matrix representation. These characters are 
class functions, because they are constant on the conjugacy classes of the group. 
Hence, the study of the characters of a finite group is often sufficient even though 
information is sacrificed (since only the trace of the matrix is used rather than the 
entire matrix itself). 

In terms of the classification problem one often wishes to deduce global 
information about the group G from local information (i.e., from knowledge of 
certain subgroups H of G). Many results in representation and character theory 
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give conditions under which representations and characters of a subgroup may be 
“lifted” to the group. This sometimes forces the group to have a particular 
isomorphism type. Such a strategy is employed often in the classification problem. 

Chapter 1 of Collins’s book contains an exposition of representation theory. 
Chapter 2 develops character theory. These two chapters contain all the standard 
basic theory from these areas. The author also includes a few specialized results 
which he will have occasion to apply later in the book. 

The remainder of the book is more specialized. Chapter 3 contains Suzuki’s 
theory of exceptional characters. This chapter deals with conditions under which 
the knowledge of the characters of a subgroup H of G provides information about 
the characters of a group G. As mentioned before, this knowledge may force G to 
have a certain isomorphism type. Chapter 4 discusses coherence and the excep- 
tional characters. The chapter’s goal is to study CN-groups of odd order and 
Zassenhaus groups. The study of these groups was important in the early stages of 
the classification of the finite simple groups. Chapter 5 contains Brauer’s charac- 
terization of characters: which gives necessary and sufficient conditions under 
which a complex-valued class function is a character in terms of the elementary 
subgroups of the group. Collins follows the Brauer-Tate proof rather than the 
more recent Goldschmidt-Isaacs proof. Leaving the characterization of characters 
to this late stage in a book on character theory is unusual. The author delays its 
inclusion because he makes a very interesting use of this result in chapter 6. He 
introduces block theory in a nonstandard way. For instance, he obtains a proof of 
the nonsimplicity of groups with homocyclic Sylow 2-subgroups of rank 2 by means 
of an isometry constructed using Brauer’s characterization of characters rather 
than using Brauer’s method of columns. Because of the success in applying 
character ring methods to situations in which modular characters have been 
applied in the past, the author speculates in the book’s preface that perhaps one 
day non-modular proofs may be obtained for Brauer’s second and third main 
theorems which are fundamental results in the area of modular characters. 

The author concludes the book with a brief appendix containing Glauberman’s 
Z*-theorem because of its importance to the classification problem. 

In summary, Collins’s book is a well-written text with a very clear purpose in 
mind—a self-contained development of representation and character theory with 
the classification of finite simple groups as the driving force behind the develop- 
ment. The book is written for the specialist or the graduate student with a strong 
background in algebra. I must disagree with the author who says that the text 
could be used with gifted undergraduates if “‘a judicious choice of material’’ is 
made. I certainly do not believe that most American undergraduates would possess 
the necessary background. 

The value of this book lies not in its encyclopedic approach to the 
subject—Curtis and Reiner is the standard here—nor in its plethora of examples 
—the text is written concisely. The text’s greatest attraction lies in its novel 
approach to the theory in light of the classification problem. 
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Theorist. By Prescience Corporation, 814 Castro Street, San Francisco, CA 94114. 


FRANK WATTENBERG 
Department of Mathematics and Statistics, University of Massachusetts, Amherst, MA 01003 


The Prescience Corporation describes Theorist as “the first symbolic algebra 
system to make computer mathematics easy.” The following table was included 
with Theorist. 


program retail recommended minimum required 
name price memory memory disk space 


Theorist 


Mathematica 
Maple 
Milo 


*Macintosh II version sold separately from regular version, which is $495. 
Other programs’ prices include both versions. 

With the list price of the new Mac Classic under $1,000 is Theorist the program 
we've all been waiting for? Do we and our students now have affordable access to 
a serious computer-based mathematical environment with graphics, numerics and 
symbolic algebra? Unfortunately, the answer is no. Although Theorist does some 
things very well, it has shortcomings that severely limit its usefulness to both 
working mathematicians and students. 

This review compares Theorist to Mathematica and the Hewlett-Packard HP- 
48SX calculator. Except for the unavoidable limitations of its checkbook size—its 
display is only 131 pixels by 64 pixels, and its tiny keyboard—the HP-48SX is a 
worthy, affordable, and portable competitor. In this league, in addition to the 
usual desiderata for computer programs—power, flexibility, reliability, robustness, 
and user-friendliness—we look for 


e An integrated environment for doing mathematics. 
¢ The ability to treat problems in a natural and direct way. 
¢ The ability to construct powerful mathematical modules. 


Running Theorist after using Mathematica is like driving a bright red, two-seater 
sports car—dquick, lively, and eye-catching. From a standing start on the hard drive 
it’s up and ready to go in six seconds!. The much larger Mathematica requires 
more than four times as long. Theorist graphs the function y = sin5t¢/2‘ in under 
two seconds; Mathematica requires eight seconds; and the HP-48SX requires 22 
seconds for the same graph. 

Theorist is optimized for speed and efficient use of resources. For example, its 
default values produce graphs with relatively few subdivisions and its goal for 


Timings for Theorist and Mathematica are based on a Mac IICX. 
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calculations is 19 digits of precision,” sufficient for all but the most unusual 
circumstances and faster than the ‘‘arbitrary precision” claimed by other programs. 
Theorist produced the figure below in just a few seconds. 


7. tf Pd, y 
LETT 
ALLTT TT] 


zZ = sin xy 


The speed and responsiveness of Theorist’s graphics make for a wonderful 
interactive graphing environment in which it is easy and enjoyable to adjust 
viewing angles and perspective in real time to produce good-looking and informa- 
tive 3D graphs. Mathematica’s graphics are more powerful but its user interface is 
hot as responsive as Theorist’s. For example, in Theorist one adjusts the viewing 
angle for a 3D graph by clicking and dragging a box enclosing the graph. Both 
programs have the ability to produce animated sequences, although Mathematica’s 
is much more flexible. The HP-48SX, of course, is not in the same graphics league. 
Its size rules out a truly useful graphics screen, but it can produce rough 
2-dimensional graphics suitable for experimentation and has surprisingly powerful 
and useful graphics commands. 

Formulas in Theorist can be entered with enough pointing-and-clicking to keep 
the most hyperactive mouse happy although the documentation wisely encourages 
users to switch quickly to the faster keyboard entry. While Mathematica’s formula 
display harkens back to the days of teletypes, both Theorist and the HP-48SX 
display formulas the way we usually write them. 

Mathematica and the HP-48SX have extensive and well thought-out program- 
ming capabilities. Both encourage top-down programming and have intelligent and 
powerful data types. Both can treat functions as easily as other data types and both 
have much of the power and elegance of LISP. Using either the HP-48SX or 
Mathematica one rapidly builds up a library of powerful programs and, hence, of 
powerful concepts that enable one to deal with mathematics on a new level. This is 
where Theorist falls short. It has only a few very limited programming capabilities 
and one is forced to deal with mathematics at a very basic level. 

The HP-48SX and Mathematica encourage the treatment of problems in a 
natural and clean way. Learning either one is mentally stimulating and intellectu- 
ally rewarding. Both provide powerful and often new ways of conceptualizing 


*But on page 19 of the Theorist reference manual we find “‘To avoid unreasonable performance, we 
had to limit the precision of some algorithms to much less than nineteen digits, although the result 
should be sufficient for most purposes.” 
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mathematical problems. By contrast Theorist 1s stuck on a high school view of 
mathematics as mechanical symbolic manipulation. For example, the Theorist 
learning guide devotes eleven pages to integrating 


[va + bsin? @ cos 6 dé 


with the user guiding Theorist step-by-step through all the gory details. With 
Mathematica one types 


IntegratelLSqrtla + b SinltJ “2] Cos(tj], tJ 


and in less than three seconds 


alog(vb sin(r) + ya + bsin(t)” | . sin(t) Va + b sin(t)” 
eS 


appears. 


Case Studies: Doing Mathematics 


The Fibonacci sequence. The Fibonacci sequence is one of the standard exam- 
ples illustrating the power and computational pitfalls of recursion. It is discussed in 
the documentation for all three of the systems under discussion. The sequence is 
defined by 

Fo=F,=1, F,=F,_, + F_>. 
The obvious program in any language 


FibCO] = 1 
FibCi] = 1 


FibCn_J] Fib£€Cn - 1] + Fib£En - 2] 


runs in exponential time since it requires the computation of F,_, and of F,_, 
from scratch to determine F,. This problem is very general. For example, the 
obvious program for simulating the discrete logistic model 
Pr — ap, (1 a bp,,—1) 
also requires exponential time since it makes two function calls for p,_, in the 
computation of p,. With both Mathematica and the HP-48SX there are easy ways 
to avoid this and these ways are discussed in the documentation. In Mathematica, 
for example, the construction 
Fib€n J :=Fib€n] = Fib€n - 1] + FibE€n - 2] 


saves the result of each computation for future use. Perhaps the best way to look 
at the problem for the logistic model is by rewriting it in the form 


f(p) = ap(1 — bp) 
Pr — f( Pr _ 1) 
The obvious program corresponding to this formulation 


FCp J := a p (1 - b p) 
PCn_ J == FCPCn - 1)] 
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runs very quickly and focuses attention where it belongs—on iteration of the 
function f(p). 
We can treat the Fibonacci sequence in the same way using vectors. 


G(X) = (%2, x, + x2) 

U. 2~ (1, 1) 

U, — G(v,,_,). 
Besides solving the computation problem this solution focuses attention on the 
two-dimensional nature of the problem. This solution is very natural using either 
Mathematica or the HP-48SX and appears to be the only solution using Theorist?, 
but the Theorist reference manual gives up. It suggests computing the Fibonacci 
sequence by using the obvious recursion to compute F, and F, and then adding 
these as “initial conditions” to compute the next few terms in the sequence. In this 


way one could theoretically compute as many terms as desired, seven at a time. 
The following Theorist notebook shows the more elegant solution. 


(oq) = lava 


| (a s 2) 
(2) (@) - sont 1) (2<@) 


50 
0 <— (100 - x) —> 
x-> 100 
Obj -50 


Refraction problem 


A Typical Calculus Problem—Refraction. Suppose that we want to calculate 
the path followed by a light ray traveling from an underwater object to an 
abovewater eye as shown in the figure above. More generally, we can suppose that 
the eye is located at (a,b) and the object at (p,q). According to Fermat’s 
Principle, light will follow the path that minimizes the total travel time from the 
object to the eye. Since the speed of light in air is roughly 30 cm/ns and the speed 
of light in water is roughly 22.5 cm/ns we find this point by minimizing the 


function 
V(p-x)°+q@ V¥(x—a)° +b? 
+ 


T — 
(x) 22.5 30 


>Theorist, however, has a very low limit for the depth of recursion. 
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The following is a Theorist notebook solving this problem. 


(e)a=0 
(e) b = 50 
(e} p = 100 
(e}q = —50 
a(v[p-x] +@ V[x-a]’ +0? 
yal as * 3 


a 
EE 
E 


This notebook does not show the required user interaction. The user must 
request a graph by pointing-and-clicking, then choose suitable x and y limits 
(Theorist does not even provide automatic y-axis scaling) and, having visually 
isolated the root, request a root. The following printout shows a Mathematica 
notebook solving the same problem. 


TEx , a, b, p,q J := Sqrtl(p-x) 2 + q*2)]/22.5 + 
SqrtlC(x-a) “2 + b“2)]/30 


Tprimelx , a,b, q_ J >= DCTLCx,a,b,p,ql, x] 


p_», 
Answerla_ , b , p,q _]: 
FindRoot CTprimel[x, a,b,p,q] ==0, {x, (at b) / 2}] 


AnswerLlO0O, 50, 100, - 50] 
{x -> 63.5253] 


These two solutions show some of the differences between Mathematica and 
Theorist. The Mathematica solution is broken down in the natural way 


¢ Define the function T(x) that describes the total travel time if the light ray 
intersects the surface of the water at the point x. 

¢ Differentiate the function T(x). 

e Solve the equation T’(x) = 0. 


The structure of Mathematica encourages this solution because of the facility 
with which it handles functions. Theorist handles functions in a very muddy way. 
One cannot in a natural way define a function 7(x), then differentiate it to obtain 
T(x), and then solve the equation T’(x) = 0 as we did in Mathematica. Indeed, 
Theorist cannot differentiate functions—only expressions. It requires that we 


define a new variable y in one line by differentiating the expression defining the 
function T(x). 
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All the user involvement with Theorist is nice the first few times but it prevents 
progressing further. For example, I like to ask my students to solve real problems. 
One particularly nice problem is finding the apparent position and shape of an 
underwater object as viewed by an abovewater person using binocular vision*. With 
Mathematica or the HP-48SX it is easy and natural to solve more difficult 
problems like these using the function 


answerLla,b,p,qJ] 


as a building block. With Theorist a student would spend most of his or her time 
repeating the same sequence of calculations. 


Conclusions 


Although Theorist is a worthwhile program with fast and powerful graphics 
which are a joy to use interactively, both the HP-48SX and Mathematica are more 
powerful environments for doing useful mathematics. Mathematics progresses in 
large part by building new concepts that allow us to work in broader strokes. 
Because it is impossible to build mathematical modules within Theorist, the user is 
confined to a basic level. Theorist is not flexible, in large part because it lacks 
powerful and general programming constructs. It is often difficult or impossible to 
approach a problem in Theorist in the most natural way. Theorist does symbolic 
manipulation well but because of the massive user interaction required to solve 
even the most straightforward problems, this ability is useful for solving schoolbook 
problems rather than doing serious work. 

For fast, responsive graphics and interactive mechanical algebraic manipulation 
choose Theorist. For serious work choose Mathematica or the HP-48SX. 


“If the viewer’s two eyes and the object determine a plane perpendicular to the surface of the water 
then the apparent position of the object is shifted toward the viewer and upward. If the viewer’s two 
eyes are the same height above the surface of the water and the same distance from the object then the 
apparent position of the object is shifted directly upward from its actual position. Generically, the two 
apparent lines of sight do not intersect. The mathematics predicts interesting optical phenomena easily 
observable with a ten gallon aquarium. 
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General, T(14), S, C, L. Microcomputers and 
Mathematics. J.W. Bruce, P.J. Giblin, P.J. Rippon. 
Cambridge Univ Pr, 1990, xvi + 425 pp, $90; $29.95 
(P). (ISBN: 0-521-37515-0; 0-521-31238-8] An in- 
teresting collection of topics to be investigated by 
students using computers. The mathematical pre- 
requisites vary from high school algebra up through 
calculus, and is accessible to freshmen and sopho- 
mores. Topics include elementary number theory, 
curves, differential equations, and iteration of func- 
tions. Abundant examples, exercises, and sample 
computer programs. The programs are written in 
Basic, but are easily transferrable to other environ- 
ments (such as Mathematica). Overall, an excel- 
lent reference book for anyone wishing to incorporate 
computing into the lower-level mathematics curricu- 
lum. Note the price. MPR 


General, P, L**. A.J. Lohwater’s Russian-English 
Dictionary of the Mathematical Sciences, Second 
Edition, Revised and Expanded. Ed: R.P. Boas. 
AMS, 1990, xi + 343 pp, $35 (P). (ISBN: 0-8218- 
0133-3] Long-awaited update of this important ref- 
erence work, first published thirty years ago. Open- 
ing grammar synopsis has been rewritten; stress 
marks have been added to Russian words; and the 
vocabulary has been “extensively enlarged” to reflect 
contemporary Russian writing in the mathematical 
sciences. Intended only to aid translation from Rus- 
sian to English, not vice versa. LAS 


General, C, P. User Manual for DERIVE, Ver- 
ston 2: A Mathematical Assistant for Your Personal 
Computer. (IBM PC Software.) Albert Rich, Joan 
Rich, David Stoutemyer. Soft Warehouse, 1990, viii 
+ 244 pp, $250 (P). The most significant enhance- 
ment in this new version is programmability. Users 
can now augment the collection of built-in functions 
with user-defined functions. The new programming 
capability allows the definition of iterative and recur- 
sive functions. In addition to internal improvements 
in the software, the User Manual has been exten- 
sively revised and expanded. Despite the increased 
functionality, DERIVE continues to run on any IBM 
PC or compatible computer with at least 512K of 


memory and MS-DOS 2.1 or later. AO 


Elementary, T(13: 1). Elementary Geometry, 
Third Edition. R. David Gustafson, Peter D. Frisk. 
Wiley, 1991, ix + 446 pp, $43.95. (ISBN: 0-471- 
51002-5] Changes in this edition include revision 
of the chapter on logic, additional material on an- 
alytic geometry, surface area and volume, more al- 
gebraic problems, and new end-of-chapter review ex- 
ercises. Remains a very traditional presentation of 
high school Euclidean geometry, complete with two- 
column proofs. (First Edition, TR, April 1973; Sec- 
ond Edition, TR, February 1986.) JNC 


Elementary, S*(7-13). Polynomials. Tom M. 
Apostol. Project Mathematics! California Institute 
of Technology, 1991, 28 minute videotape. Program 
Guide and Workbook, 30 pp, (P). A half-hour video- 
tape motivated by basketball shots that uses effec- 
tive video graphics to illustrate linear, quadratic, 
cubic, and higher polynomials. An animated hand 
crank simultaneously varies parameters and shapes 
of graphs; on-screen algebraic choreography helps re- 
late algebraic forms to visual representations, espe- 
cially of factors and zeros. Accompanying Workbook 
provides reinforcement, extensions, and exercises. 
Available from state Departments of Education, from 
the MAA, and from the Cal Tech bookstore; freely 
reproduceable for educational purposes. LAS 


Mathematics Appreciation. For All Practical 
Purposes: Introduction to Contemporary Mathemat- 
ics, Second Edition. COMAP. WH Freeman, 1991, 
xvii + 626 pp. (ISBN: 0-7167-2115-5] The new edi- 
tion of this popular liberal arts survey of contempo- 
rary mathematics contains a welcome doubling of the 
exercise sets, together with key revisions in selected 
chapters. The optional accompanying videotape se- 
ries (not revised) remains well-coordinated with the 
text. Updated instructor, telecourse guides avail- 
able. (First Edition, TR, February 1988; Extended 
Review, May 1989.) RB 


Mathematics Appreciation, S, C. Cube and 
Tess. (Macintosh Software.) Atlantic Software (PO 
Box 299, Wenham, MA 01984), 1988, 21 pp, $24.95 
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(P). A computer simulation of Rubik’s Cube and a 
four-dimensional version called Tess (for tesseract). 
Moves are made by clicking; sequences of moves can 
be recorded into procedures for repeated execution 
and saved in files for later use. Includes two such 
files of pre-written procedures for each puzzle. Runs 
on any Macintosh. LAS 


Mathematics Appreciation, S(9-13). The Story 
of x. Tom M. Apostol. Project Mathematics! Cali- 
fornia Institute of Technology, 1989, 24 minute video- 
tape. Program Guide and Workbook, 30 pp, (P). 
A visual, historical introduction to x, produced in 
a thoroughly professional manner. Simple applica- 
tions, Archimedes’ discovery, computation, and ex- 
tensions (lattices, random numbers, Buffon needle). 
Useful at many levels, from junior high to college 
classes in quantitative literacy. Workbook contains 
commentary and exercises. Available from state De- 
partments of Education, from MAA, and from the 
Cal Tech bookstore. LAS 


Mathematics Appreciation, T(13-15: 1). The 
Nature of Mathematics, Sizth Edition. Karl J. 
Smith. Brooks/Cole, 1991, xv + 816 pp. (ISBN: 0- 
534-13914-0] A warm, “student-friendly” approach 
to mathematical literacy (reasoning, computers, 
numbers, algebra, geometry, probability, statistics) 
for those who learned little (and remember less) of 
high school mathematics. Three problem levels make 
the text suitable for courses at a wide range of prereq- 
uisites, from none to intermediate algebra. Countless 
marginal asides enrich the exposition with histori- 
cal, social, and humorous tidbits. Numerous supple- 
ments are available to aid instructors. LAS 


Precalculus, S*, L. Functions and Graphs. I.M. 
Gel’fand, E.G. Glagoleva, E.E. Shnol. Birkhauser 
Boston, 1990, ix + 105 pp, $14.50 (P). [ISBN: 0- 
8176-3532-7] First in a series of self-study book- 
lets written in 1966 (in Russian) to support Moscow 
State University’s mathematical “Correspondence 
School,” an outreach effort to students aged 12-17 in 
small, remote villages of the USSR. This pamphlet is 
designed to help students learn to “see” formulas and 
functions and observe the way changes in functions 
are reflected in changes in their graphs. First trans- 
lated into English by the University of Chicago; orig- 
inal English edition was published in 1967 by MIT 
Press (TR, October 1969). This new printing makes 
these attractive books available to a new generation 
of students. LAS 


Education, P*, L*. Professional Standards for 
Teaching Mathematics. NCTM, 1991, viii + 196 
pp, $25 (P). (ISBN: 0-87353-307-0] A companion 
to NCTM’s influential 1989 Standards for curricu- 
lum and evaluation (TR, June-July 1989), this vol- 
ume describes 24 standards for teaching mathemat- 
ics, for evaluation of the teaching of mathematics, 
and for the support and development of mathematics 
teachers. Emphasizes the need for teachers—and the 
colleges who educate teachers—to create classroom 
communities, to promote reasoning, to encourage 
conjecturing, and to build connections. Expounds 
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a bold vision of what teachers should know and be 
able to do, how they should be educated, evaluated, 
and sustained as professionals. LAS 


Education, §(17-18). Algebra for Everyone. Ed: 
Edgar L. Edwards, Jr. NCTM, 1990, vi + 89 pp, 
$7 (P). [ISBN: 0-87353-297-X] Seven essays based on 
the premise that algebra can and should be taught 
to all students. Explores issues, delineates funda- 
mental algebraic concepts and processes, and pro- 
vides teaching ideas. Includes annotated resource list 
organized into categories that include expectations, 
equity, learning styles, cooperative learning, mate- 
rials, manipulatives, software, successful programs, 
and funding sources. MW 


Education, P, L. Technology in Education: Look- 
ing Toward 2020. Raymond S. Nickerson, Philip P. 
Zodhiates. Lawrence Erlbaum Assoc, 1988, xviii + 
330 pp, $22.50 (P); $59.95. [ISBN: 0-8058-0297-5; 
0-8058-0214-2] A collection of fourteen papers on 
the long-term role of technology in the teaching of 
mathematics, science, and computing at the primary 
and secondary school levels. The papers in this vol- 
ume were originally prepared for a conference held 
October 15-17, 1986. AO 


Education, T(16-17). Mathematics for the Young 
Child. Ed: Joseph N. Payne. NCTM, 1990, xii + 
306 pp, $39.50. [ISBN: 0-87353-288-0] Research- 
based strategies for teaching mathematics to children 
from pre-school through grade four. Major goals 
are to enable children to construct their own knowl- 
edge and understandings, and to develop procedural 
knowledge from a strong conceptual knowledge base. 
Makes connections with recent research, reform doc- 
uments. Covers whole number concepts, fractions 
and decimals, geometry and measurement, proba- 
bility, graphing, and microcomputers, all from a 
problem-solving and constructivist perspective. MW 
History, S(17-18), P, L. Ein Jahrhundert Mathe- 
matik 1890-1990. Gerd Fischer, et al. Dokumente 
zur Geschichte der Mathematik, Band 6.  Friedr 
Vieweg & Sohn, 1990, xii + 830 pp, DM 198. (ISBN: 
3-528-06326-2] Nineteen papers surveying the de- 
velopment of eighteen special fields and the history 
of the Deutsche Mathematiker Vereinigung during its 
first hundred years. Contains a fascinating account 
of mathematics under the Nazis. JD-B 


History, P, L*. John von Neumann and the Ori- 
gins of Modern Computing. William Aspray. Hist. 
of Comput. MIT Pr, 1990, xvii + 376 pp, $35. (ISBN: 
0-262-01121-2] A revealing archive-based portrait 
of the many dimensions of von Neumann’s pioneer- 
ing work in computing. As a central leader in 
the complex post-war interactions of science, mili- 
tary, and industry, von Neumann not only laid down 
the principles for stored program computers, but he 
also helped initiate theories of automata, reliability, 
self-replication, and memory in biological systems. 
Text reveals close ties to numerical meteorology and 
applied mathematics. Well-illustrated with pho- 
tographs from family and government archives. LAS 


History, P, L*. An Introduction to the His- 
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tory of Structural Mechanics. Edoardo Benvenuto. 
Springer-Verlag, 1991, $49 each. Part I: Statics and 
Resistance of Solids, xxi + 306 pp, [ISBN: 0-387- 
96227-1]; Part II: Vaulted Structures and Elastic Sys- 
tems, xxi + 245 pp. (ISBN: 0-387-97187-4] An out- 
growth of but not a translation of La Scienza delle 
Costruzioni e tl suo Sviluppo Storico, Sansoni Pub- 
lishers, 1981. In a Foreward, Clifford Truesdell says, 
“The first to show how statics, strength of materials, 
and elasticity grew alongside existing architecture 
with its millenial traditions, its host of successes, its 
ever-renewing styles, and its numerous problems of 
maintenance and repair—(Benvenuto) thinks clearly, 
clearly organizes his material, difficult and compli- 
cated as it seems, and writes clearly with direct and 
masterly expression—a general, pioneering treatise— 
any place you open this book and read in it, you will 
be fascinated by what is there.” The translation from 
Italian to English is especially well done, without loss 
of charm. JDEK 


Logic, P, L. The Semantic Foundations of Logic, 
Volume 1: Propositional Logics. Richard L. Epstein. 
Nijhoff Intern. Philosophy Ser., V. 35. Kluwer Aca- 
demic, 1990, xxii + 386 pp, $99. (ISBN: 0-7923-0622- 
8] Presents history, philosophy, and mathematics of 
major systems of propositional logic: classical logic, 
dependence logics, modal logic, many-valued logics, 
intuitionism, paraconsistent logic. Proposes general 
framework for semantics for propositional logics and 
theory of translations between logics. KES 


Graph Theory, P. Field Guide to Simple Graphs. 
Peter Steinbach. Design Lab (6101 Sequoia, NW, Al- 
buquerque, NM 87120), 1990, vii + 134 pp, $20 (P). 
Drawings of all nonisomorphic, simple, connected 
graphs with at most seven vertices and all such regu- 
lar graphs with at most ten vertices, each with basic 
properties identified. Drawings of all trees with at 
most twelve vertices; summary tables of numbers of 
graphs with specified property. Drawings generated 
“by eye.” JPH 

Graph Theory, P, L. Paths, Flows, and VLSI- 
Layout. Eds: Bernhard Korte, et al. Algorithms & 
Combinatorics, V. 9. Springer-Verlag, 1990, xxii + 
383 pp, $69. (ISBN: 0-387-52685-4] Survey chap- 
ters based on a 1988 workshop on graph-theoretic 
aspects of very-large-scale-integrated circuits. BC 


Combinatorics, P. Mathematics of Ramsey The- 
ory. Eds: Jaroslav Nedetiil, Vojtéch Rodl. Algo- 
rithms and Combinatorics, V. 5. Springer-Verlag, 
1990, xiv + 269 pp, $69. [ISBN: 0-387-18191-1] Ex- 
pository surveys and research on classical and recent 
resultyin Ramsey theory of sets (finite and infinite), 
graphs, numbers, Euclidean geometry, and topolog- 
ical spaces. Also complexity and independence re- 
sults, and techniques of ergodic theory. Articles by 
leaders in the field. JPH 


Combinatorics, P. Finite Geometries and Combi- 
natorial Designs. Eds: Earl S. Kramer, Spyros S. 
Magliveras. Contemp. Math., V. 111. AMS, 1990, 
xviii + 312 pp, $53. (ISBN: 0-8218-5118-7] Pro- 
ceedings of an AMS special session held in Lin- 
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coln, Nebraska during October 29-November 1, 1987. 
Contains 23 papers. AD 


Number Theory, T*(15-18: 1, 2), S*, L**. 
An Introduction to the Theory of Numbers, Fifth 
Edition. Ivan Niven, Herbert S. Zuckerman, Hugh 
L. Montgomery. John Wiley, 1991, xiii + 529 pp, 
$50.95. (ISBN: 0-471-62546-9] A classic re-born. 
Earlier editions appeared in 1960 (original), 1966 
(Second), 1972 (Third, TR, June-July 1972), and 
1980 (Fourth, TR, December 1980). This edition re- 
flects many of the remarkable new developments in 
number theory of the past decade (e.g., public key 
cryptography, elliptic curves and Falting’s theorem, 
pseudoprime tests, Lenstra’s factorization) as well as 
other enrichments (e.g., geometry of numbers, sys- 
tems of linear Diophantine equations, numerous cal- 
culational issues). Extensive notes tie each section to 
historical and bibliographic roots. Can be used either 
for a one- or two-term course: selected early sections 
provide a gentle pace suitable for novices. LAS 


Number Theory, S(15-16), C, P. Number The- 
ory Package. Donald E.G. Malm. Oakland Uni- 
versity (E-mail: malm@argo.acs.oakland.edu), 1990, 
(P). Three implementations (for Turbo Pascal on 
IBM clones and on Macintosh, and for UBASIC on 
IBM clones) of a suite of forty research-level subrou- 
tines and function calls for number theory, imple- 
menting many current algorithms for efficient com- 
putation. Includes examples showing how the rou- 
tines can be put together, and a list of all procedures 
with descriptions and limitations. Samples: Baille- 
Wagstaff Lucas pseudo-prime test; elliptic curve 
method to factorize; Lehmer’s algorithm for linear 
congruences, Preferred distribution by anonymous 
ftp to vela.acs.oakland.edu, in /pub/numbdroid 
subdirectory. Also available on disk. LAS 


Linear Algebra, P, L. Nonnegative Matrices in 
Dynamic Systems. Abraham Berman, Michael Neu- 
mann, Ronald J. Stern. Wiley, 1989, xix + 167 pp, 
$36.95. [ISBN: 0-471-62074-2] 


Group Theory, S*, C. Ezploring Small Groups: 
A Tool for Learning Abstract Algebra. (IBM PC 
Software.) Ladnor D. Geissinger. Harcourt Brace 
Jovanovich, 1989; User’s Manual, vi + 31 pp, (P). 
(ISBN: 0-15-526011-1] Award-winning package pro- 
viding numerous examples of operation tables and 
groups with on-screen questions (about e.g., order, 
commuting, elements, subgroups, conjugates, com- 
mutativity, associativity, inverses). New tables can 
be created randomly or by typing. Information gen-’ 
erated by the programs is saved for display and re- 
view in various screens. The User’s Manual contains 
a series of seventy substantial exercises to help guide 
a student through the exploration of a full comple- 
ment of properties. LAS 


Algebra, T(15-17: 1, 2), L. Fundamental Con- 
cepts of Abstract Algebra. Gertrude Ehrlich. PWS- 
Kent, 1991, xi + 340 pp, $38. (ISBN: 0-534-92455- 
7] Written to serve as a text for either one or two 
semesters, there is some substantial classical algebra 
here. Begins with a brief, helpful history of alge- 
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bra and some set theory. Chapter on group theory 
includes structure of finite Abelian groups and Sy- 
low theorems. Chapter 3 starts with rings includ- 
ing principal ideal domains and Euclidean domains, 
concludes with modules and linear algebra. Conclud- 
ing chapter treats fields, Galois theory, solvability by 
radicals, and geometric constructions. Numerous ex- 
ercises (no answers), references, index. JS 


Algebra, T(13: 1). Investigations in Algebra. Al- 
bert Cuoco. MIT Pr, 1990, 623 pp, $29.95. (ISBN: 0- 
262-53071-6] Logo-based experiments in functions, 
mathematical induction, combinatorics, and number 
theory for students who have a background in in- 
termediate algebra and a basic knowledge of Logo 
programming. “Investigations” is the key word: the 
author considers the exploration process much more 
important than the content. Suitable for bright high 
school or beginning college students as an alternative 
to calculus. MW 


Calculus, T(13-14: 1, 2), S, L. Ezplorations in 
Calculus with a Computer Algebra System. Donald 
B. Small, John M. Hosack. McGraw-Hill, 1990, xiv 
+ 226 pp, (P). (ISBN: 0-07-058267-X] A collection 
of about forty brief, computer-oriented units, each 
with open-ended exercises, on topics in elementary 
calculus. Topics are chosen to illustrate the power of 
“computer algebra systems” for graphical, numeri- 
cal, and symbolic calculations. The CAS Maple is 
used in most examples, but the book is not tied 
to any one system: “translation” tables are given 
for parallel commands in Maple, Mathematica, Ma- 
cysyma, and Derive. In addition, an introductory 
chapter treats CAS’s in general, and suggests a ratio- 
nale for their use in undergraduate mathematics. PZ 


Calculus, S(13), C. A Guide to Calculus T/L: 
A Program for Doing, Teaching, and Learning Cal- 
culus. J. Douglas Child. Macintosh Software and 
User Guide. Brooks/Cole, 1990, x + 181 pp, (P). 
(ISBN: 0-534-11817-8] A user-friendly graphical in- 
terface to the symbolic mathematical package Maple 
designed to enable calculus students to explore stan- 
dard topics (functions, graphs, derivatives, integrals, 
limits, sequences, series) on an electronic whiteboard 
where objects can be written, moved, combined, and 
manipulated much as on a sheet of paper. Com- 
mands (e.g., Define f by f(x) = sin x) are ex- 
pressed in full, understandable English, displayed for 
students to choose—so commands don’t have to be 
remembered or typed precisely. Graphs (of functions 
of one variable only) are easy, as are zooms and scal- 
ing. An “authoring” environment enables instructors 
to pfepare scripts to be redisplayed and explored in 
class or labs. By clicking “Talk to Maple” one can 
pre-empt Calculus T/L and interact directly with the 
underlying Maple symbolic system. Requires a Mac 
Plus or later model, at least 1M of memory, and a 
hard disk. LAS 


Calculus, T(13: 3). Caleulus, Fifth Edition. Earl 
W. Swokowski. PWS-Kent, 1991, xxiii + 1159 pp, 
$48, (ISBN: 0-534-92492-1]; Calculus of a Single 
Variable, 1991, xxi + 765 pp. [ISBN: 0-534-92584- 
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7] A revision of Calculus with Analytic Geometry, 
Alternate Edition; this edition expands discussions, 
includes guidelines for solving problems, and adds 
applications to the design of computers and climatic 
effects. Attractively redesigned with new graphics, 
some changes in presentation, and some new ap- 
plications; this remains a traditional calculus text. 
Calculus of a Single Variable consists of the first 
thirteen chapters of Calculus, Fifth Edition. (First 
Edition, TR, December 1975; Second Edition, TR, 
June-July 1979; Third Edition, TR, November 1984; 
Fourth Edition, TR, December 1988.) JNC 


Real Analysis, T(16-17: 2), L. Modern Analysis: 
Measure Theory and Functional Analysis with Appli- 
cations. William H. Ruckle. PWS-Kent, 1991, xv + 
265 pp. (ISBN: 0-534-92164-7] Intended for a first- 
year analysis course for graduate students. Chap- 
ter one is a review of set theory, the real line, and 
linear algebra. The other chapters concern metric 
spaces, measure and integration, functional analy- 
sis (e.g., Hahn-Banach Theorem), function and se- 
quence spaces, and applications (e.g., Picard’s The- 
orem, ergodic theory). MLR 


Real Analysis, P. Lecture Notes in Mathematics- 
1446: Methods of Nonconvez Analysis. Ed: A. Cel- 
lina. Springer-Verlag, 1990, v + 206 pp, $24 (P). 
[ISBN: 0-387-53120-3] Lectures given at the first 
session of the Centro Internazionale Matematico Es- 
tivo (CIME) held at Varenna, Italy, June 1989. Con- 
tains seven lectures. MLR 


Real Analysis, P. Lectures on Singular Integral Op- 
erators. Michael Christ. CBMS Reg. Conf. Ser. in 
Math., No. 77. AMS, 1990, ix + 132 pp, $27 (P). 
[ISBN: 0-8218-0728-5] Expansions of lectures given 
at the NSF-CBMS conference on singular integral 
operators held at the University of Montana, August 
1989. Topics include the Cauchy integral on Lips- 
chitz curves and the T(1) theorem. MLR 

Complex Analysis, T**(16-17: 1, 2), S, P, 
L. Complez Variables, Second Edition. Stephen D. 
Fisher. Math. Ser. Wadsworth, 1990, xiv + 427 
pp, $49.95. (ISBN: 0-534-13260-X] A light revision 
of the First Edition (TR, December 1986). An at- 
tractive, readable, and enthusiastic—yet rigorous— 
introduction to theory, methods, and applications 
of complex analysis in one variable. Although for- 
mal prerequisites (multivariable calculus, mainly) are 
few, exposition is at advanced undergraduate or be- 
ginning graduate level. Excellent, varied problem 
sets will challenge anyone, and point to a wealth 
of related topics. Many figures support an unusu- 
ally concrete, geometric viewpoint. Most changes 
from previous edition are local—rewriting for clar- 
ity, new exercises, etc. One more substantial change: 
Cauchy’s theorem, assuming continuity of deriva- 
tives, is obtained more directly than formerly using 
Green’s theorem. PZ 


Differential Equations, T(18: 1), S, P. Sturm- 
Liouville and Dirac Operators. B.M. Levitan, I.S. 
Sargsjan. Math. & Its Applic., V. 59. Kluwer Aca- 
demic, 1991, xi + 350 pp, $149. [ISBN: 0-7923-0992- 


1991] 


8] Parts I and II are parallel] developments of the 
theory and some applications for Sturm-Liouville and 
one-dimensional Dirac operators. The pattern is sim- 
ilar, proceeding through the spectral theory for the 
regular and singular cases, the spectrum, distribution 
of eigenvalues, and inverse problems. References, in- 
dex. JS 


Differential Equations, P. Theory of Impulsive 
Differential Equations. V. Lakshmikantham, D.D. 
Bainov, P.S. Simeonov. Ser in Modern Appl. Math., 
V. 6. World Scientific, 1989, x + 273 pp, $58. [ISBN: 
9971-50-970-9] 

Partial Differential Equations, P. Stochastic 
Evolution Systems: Linear Theory and Applications 
to Non-linear Filtering. B.L. Rozovskii. Math. & 
Its Applic., V. 35. Kluwer Academic, 1990, xviii 
+ 315 pp, $129. [ISBN: 0-7923-0037-8] A com- 
prehensive treatment of stochastic partial differen- 
tial equations—a combination of differential equa- 
tions and probability theory—with applications in 
physics, chemistry, and control theory. BC 

Partial Differential Equations, T(17: 1, 2). 
Principles and Techniques of Applied Mathematics. 
Bernard Friedman. Dover, 1990, ix + 315 pp, $8.95 
(P). (ISBN: 0-486-66444-9] Unabridged and unal- 
tered republication of work first published by John 
Wiley in 1956 on how the study of abstract systems 
can help in the solution of concrete problems. For 
a one-year graduate course; prerequisites include lin- 
ear algebra and complex integration. Emphasis on 
techniques derivable from abstract theory, but stress 
is on the ideas and not upon a rigorous develop- 
ment. JDEK 


Numerical Analysis, S(18), P, L. Solving Linear 
Systems on Vector and Shared Memory Computers. 
Jack J. Dongarra, et al. SIAM, 1991, x + 256 pp, 
$19.75 (P). (ISBN: 0-89871-270-X] Written by alge- 
braists, not computer scientists, with applied math- 
ematicians in mind. Includes basic aspects of vec- 
tor and parallel processing, overview of current high- 
performance computers, and some aspects of imple- 
mentation and performance, but emphasis is on algo- 
rithms and software for dense coefficient systems and 
direct and iterative méthods for sparse ones. Block 
algorithms prevail. Useful glossary and extensive 
bibliography. DFA 

Operator Theory, T(18: 2), S, P. A Primer 
on Spectral Theory. Bernard Aupetit. Universi- 
text. Springer-Verlag, 1991, xii + 193 pp, $29.80 
(P). (ISBN: 0-387-97390-7] A (short) introduction 
to the techniques of subharmonic functions and an- 
alytic multifunctions in spectral theory. Assumes 
background in subharmonic functions, capacity, and 
functions of several complex variables. A friendly 
book. MLR 

Functional Analysis, T(17: 2), S, L. Ergodic 
Theory. Karl Petersen. Stud. in Adv. Math., V. 
2. Cambridge Univ Pr, 1989, xi + 329 pp, (P). 
(ISBN: 0-521-38997-6] The author’s point of view 
is that “ergodic theory consists of examples, conver- 
gence theorems, the study of various recurrence prop- 
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erties, and the theory of entropy.” Each is treated at 
a basic and advanced level. Assumes measure the- 
ory and functional analysis. Many exercises and a 
sizable bibliography. (1983 hardcover edition, TR, 
December 1984.) TAV 


Functional Analysis, P. Algebraic Ideas in Ergodic 
Theory. Klaus Schmidt. CBMS Reg. Conf. Ser. in 
Math., No. 76. AMS, 1990, v + 94 pp, $30 (P). 
(ISBN: 0-8218-0727-7] Presents the ideas that were 
the basis for the 1989 CBMS Conference at Seattle. 
Two topics are developed: the influence of opera- 
tor algebras on dynamics, with some emphasis on 
Markov shifts, and higher dimensional Markov shifts 
where little structure exists. TAV 

Functional Analysis, T*(16-17: 2), L*. Func- 
tional Analysts and Linear Operator Theory. Carl L. 
DeVito. Addison-Wesley, 1990, x + 358 pp, $45.25. 
(ISBN: 0-201-11941-2] Beautifully motivated ele- 
mentary introduction to basics of functional analysis. 
Topics include compact operators, spectral theorems, 
unbounded operators, and non-normalizable eigen- 
vectors. Good selection of problems with selected 
solutions at end. A very nice book! BH 


Functional Analysis, L. Approzimation Theory 
and Functional Analysts. Ed: Charles K. Chui. Aca- 
demic Pr, 1991, x + 247 pp, $49.95. [ISBN: 0-12- 
174583-X] Compilation of papers to honor George G. 
Lorentz on his 80th birthday. Eleven invited papers 
(mainly survey) on approximation theory and func- 
tional analysis presented at a two-day conference at 
Texas A & M in February 1990. Includes a brief au- 
tobiography, lists of Lorentz’s publications, Lorentz’s 
doctoral students, and participants. JDEK 
Analysis, S(18), P, L. Topics in Metric Fized 
Point Theory. Kazimierz Goebel, W.A. Kirk. Stud. 
in Adv. Math., V. 28. Cambridge Univ Pr, 1990, viii 
+ 244 pp, $49.50. (ISBN: 0-521-38289-0] Assumes 
basic knowledge of analysis and topology (e.g., Ba- 
nach spaces are not defined but provide the usual 
setting for the theory presented). First two-thirds of 
the text deals with classical theory of nonexpansive 
mappings, last one-third deals with variety of more 
contemporary topics. Extensive bibliography. BH 


Analysis, P. Computer Aided Proofs in Analysis. 
Eds: Kenneth R. Meyer, Dieter S. Schmidt. Math. & 
Its Applic., V. 28. Springer-Verlag, 1991, xvi + 251 
pp, $32. [ISBN: 0-387-97426-1] Twenty-one papers 
from a 1989 conference at the University of Cincin- 
nati. BC 


Analysis, P. Recent Advances in Fourier Analysis 
and Its Applications. Eds: J.S. Byrnes, Jennifer L. 
Byrnes. NATO ASI Ser. C, V. 315. Kluwer Aca- 
demic, 1990, x + 686 pp, $174. [ISBN: 0-7923-0875- 
1] Forty-one papers from a 1989 conference. Ap- 
plications run the gamut from X-ray crystallography 
to neurocomputer architectures. Special attention to 
polynomials with restricted (e.g., unimodular) coef- 
ficients. BC 

Analysis, T(14-17). Real Analysts. Norman B. 
Haaser, Joseph A. Sullivan. Dover, 1991, ix + 341 
pp, $8.95 (P). (ISBN: 0-486-66509-7] Intended to 
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provide a smooth transition from calculus to abstract 
analysis. Covers metric spaces, normed linear spaces, 
inner product spaces, and develops the Lebesgue in- 
tegral. Also includes topics involving approximation 
theory, the Banach fixed point theorem and its appli- 
cations, and Stieltjes integrals. (1971 Van Nostrand 
Reinhold edition, TR, August-September 1971.) KS 
Algebraic Geometry, T(17-18: 1), S, P, L. 
Algebraic Geometry for Scientists and Engineers. 
Shreeram S. Abhyankar. Math. Surveys & Mono., 
No. 35. AMS, 1990, xiii + 295 pp, $87. [ISBN: 
0-8218-1585-0] From the Preface: “I am simply 
resurrecting the concrete and ancient methods of 
Shreedharacharya (500 A.D.), Bhaskaracharya (1150 
A.D.), Newton (1660), Sylvester (1840), Salmon 
(1852), Max Noether (1870), Kronecker (1882), Cay- 
ley (1887), and so on.” Not a bad bunch to bring 
back. BC 


Differential Geometry, P. Lecture Notes in 
Mathematics-1451: Global Geometry and Mathemat- 
tcal Physics. L. Alvarez-Gaumé, et al. Springer- 
Verlag, 1990, ix + 197 pp, $24 (P). (ISBN: 0-387- 
53286-2] Proceedings of a 1988 conference in Mon- 
tecatini Terme, Italy. Includes papers on moduli 
spaces, conformal field theory and string theory, and 
the geometry of the KP equation. JO 


Differential Geometry, T?(17-18: 1), S, P. In- 
troduction to Differential Geometry for Engineers. 
Brian F. Doolin, Clyde F. Martin. Pure & Appl. 
Math., V. 136. Marcel Dekker, 1990, xii + 163 pp, 
$45. [ISBN: 0-8247-8396-4] Written to provide con- 
trol engineers with the necessary background and 
vocabulary in differential geometry to read research 
papers in geometric and nonlinear control. Topics 
include manifolds, tangent spaces, vector fields, dif- 
ferential forms, Lie groups, and Grassmannian tech- 
niques for control theory. Some background in differ- 
ential geometry and topology is assumed since terms 
like diffeomorphism and second countable Hausdorff 
are used, but not defined. Only ten exercises; refer- 
ences. OJ 


Differential Geometry, T(17-18), P. Rieman- 
nian Geometry, Second Edition. Sylvestre Gallot, 
Dominique Hulin, Jacques Lafontaine. Universitext. 
Springer-Verlag, 1990, xiii + 284 pp, $29 (P). (ISBN: 
0-387-52401-0] This edition adds sections on sur- 
faces with constant negative curvature and on con- 
formal geometry, and makes some corrections. (First 
Edition, TR, June-July 1988.) OJ 


Geometry, T(18: 2), S, P. Lecture Notes 
in’ Mathematics-1441: Géométrie et théorte des 
groupes. M. Coornaert, T. Delzant, A. Papadopou- 
los. Springer-Verlag, 1990, x + 165 pp, $18 (P). 
[ISBN: 0-387-52977-2] Based largely on Gromov’s 
work on hyperbolicity, especially his 1988 paper 
“Hyperbolic Groups.” Discusses hyperbolic metric 
spaces and hyperbolic groups. Other subjects in- 
clude quasi-geodesics, isoperimetric inequalities, and 
isometries. Reviews basic facts of automata theory, 
and shows that hyperbolic groups are automatic. In 
French. AD 
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Geometry, T*(15: 1, 2). Geometry: A Metric 
Approach with Models, Second Edition. Richard S. 
Millman, George D. Parker. Undergrad. Texts in 
Math. Springer-Verlag, 1991, xiii + 370 pp, $39. 
[ISBN: 0-387-97412-1] Revised edition of work first 
published in 1981 (TR, May 1983). Topics covered 
are neutral geometry, theory of parallels, hyperbolic 
geometry, classical Euclidean geometry, Bolyai ap- 
proach to theory of area, and isometries. Metric 
approach. Emphasis on illustrating definitions and 
axioms with models. Added material in this edition 
includes some expository exercises. Basic prerequi- 
site is mathematical maturity; calculus helpful. Cur- 
rently available (from the second author): a program 
written in Pascal which allows graphical explorations 
and calculations in the Poincaré plane. Authors be- 
lieve that mathematics is learned by doing and not 
by reading. JDEK 


Geometry, S, C, P. Designer Fractal: Mathematics 
for the 21st Century. (Macintosh Software.) Peter 
Van Roy, Linda Garcia, Bernt Wahl. Dynamic Soft- 
ware (PO Box 7534, Santa Cruz, CA 95061), 1988. 
User Manual, 15 pp, (P). A design tool called Frac- 
taSketch that creates, saves, and prints fractal de- 
signs by replication of a template. Includes tools to 
create designs at several different levels of iteration; 
to magnify, reduce, and relocate the design once cre- 
ated; to save in the Clipboard or Scrapbook; to print 
on a laser printer at good resolution; and to create 
new templates. Calculates and displays the fractal 
dimension of each pattern. Includes several dozen 
prepared templates to produce innumerable designs, 
including all classic fractals. LAS 


Geometry, S, C, P. The Desktop Fractal Design 
System. (Macintosh Software.) Michael F. Barnsley. 
Academic Pr, 1990, Handbook, v + 40 pp, $39.95 
(P) set. (ISBN: 0-12-079064-5] An implementation 
of the Iterated Function System (IFS) for encod- 
ing fractal-generating affine transformations into ef- 
ficient codes (typically, a few lines, each containing 
six digits) from which the “chaos algorithm” can 
reproduce fractal pictures of arbitrary complexity. 
Software displays these fractal images, permiting dy- 
namic modification of IFS code to make changes in 
images visible. Also contains an implementation of 
the escape-time algorithm that computes attractor 
sets, Julia sets, and the Mandelbrot set. Handbook 
explains not only how to use software but also how 
IFS codes generate fractals, and how the two key 
algorithms can be implemented in simple Basic pro- 
grams. A useful supplement to Barnsley’s 1988 Frac- 
tals Everywhere (TR, April 1989; Extended Review, 
March 1990). Software fails to run on some newer 
Macintosh models. LAS — 


Geometry, S(9-13). The Theorem of Pythagoras. 
Tom M. Apostol. Project Mathematics! Califor- 
nia Institute of Technology, 1988, 22 minute video- 
tape; Program Guide and Workbook, 30 pp, (P). 
A motivational and instructional video introduction 
to right triangles: opens with typical survey prob- 
lems, covers standard algebraic formulation (includ- 
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ing Pythagorean triples), and then gives three visual 
proofs (Chinese, Euclid, dissection). Concludes with 
extensions to other shapes and dimensions. Intended 
for middle school students. Accompanying Work- 
book contains discussion, extensions, and exercises. 
Available from state Departments of Education, from 
MAA, and from the Cal Tech bookstore. LAS 


Algebraic Topology, T(17-18), S, P. Modern 
Geometry— Methods and Applications, Part III: In- 
troduction to Homology Theory. B.A. Dubrovin, 
A.T. Fomenko, S.P. Novikov. Transl: Robert G. 
Burns. Grad. Texts in Math., V. 124. Springer- 
Verlag, 1990, ix + 416 pp, $59. (ISBN: 0-387- 
97271-4] Develops singular homology and cohomol- 
ogy theory together. Emphasizes geometric aspects 
and avoids homological algebra where possible. Also 
presents de Rham and sheaf cohomologies. Wide 
range of topics, e.g., classification of surfaces, ho- 
motopy groups of spheres, Bott periodicity. Includes 
chapters on Morse theory and cobordism. Assumes 
background from Part II on smooth manifolds, ho- 
motopy theory, and fiber spaces. Few exercises; bib- 
liography. OJ 


Topology, T(18), S, P. Topological Fields and 
Near Valuations. Niel Shell. Pure & Appl. Math., 
V. 135. Marcel Dekker, 1990, xiii + 233 pp, $89.75. 
[ISBN: 0-8247-8412-X] A survey of the theory of 
topological fields which introduces the notion of a 
prevaluation as a tool to study locally bounded 
topologies. Requires a solid background in alge- 
bra and point set topology. Does not include ex- 
ercises. CEC 


Topology, T?(17), P, L. Introduction to Uniform 
Spaces. I.M. James. London Math. Soc. Lect. 
Note Ser., V. 144. Cambridge Univ Pr, 1990, 148 
pp, $24.95 (P). (ISBN: 0-521-38620-9] A uniform 
structure on a set X is a collection ] of relations 
on X such that 1) 0 is a filter on X x X; 2) each 
D € Q contains the diagonal of X x X; 3) DEQ 
implies D-1 € 2; 4) D € Q implies Eo E C D for 
some E € {. Topological spaces are generalizations 
of uniform spaces. The author examines the relation- 
ship between the two and also develops the theory of 
uniform transformation groups. Forty exercises at 
end of book. Rather dry presentation. BH 


Operations Research, T(15-16: 1). Introduc- 
tion to Linear Programming: Applications and Ez- 
tensions. Richard B. Darst. Pure & Appl. Math., 
V. 141. Marcel Dekker, 1991, xii + 353 pp. (ISBN: 
0-8247-8383-2] An elementary introduction to the 
theory and practice of solving linear programming 
problents. Also covers techniques for some closely 
related problems including quadratic programming, 
minimization of quadratic functions, and a variety of 
network problems. AO 


Operations Research, T(14-15), L. Linear Pro- 
gramming and Applications: A Course Tezt. Will 
McLewin. Input-Output (3 Wyndham Place, Lon- 
don), 1990, xiv + 216 pp, $33. [ISBN:0-904870-11-1] 
An elementary linear programming text. Uses both 
tableau-based and matrix-based approaches. Con- 
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tains the standard coverage of the simplex method, 
duality, sensitivity analysis, and the usual applica- 
tions to network, assignment, and game theory prob- 
lems. Discusses Khachian’s Ellipsoid Method, but 
does not mention Karmarkar’s technique. Limited 
number of exercises, mostly theoretical in nature. 
(1981 edition, TR, August-September 1982.) SM 


Operations Research, T(18: 1), P, L. Nonlinear 
Programming: Sequential Unconstrained Minimiza- 
tion Techniques. Anthony V. Fiacco, Garth P. Mc- 
Cormick. Classics in Appl. Math., V. 4. SIAM, 1990, 
xvi + 210 pp, $26.50 (P). (ISBN: 0-89871-254-8] A 
corrected, unabridged republication of the original 
classic, discussing techniques of solving constrained 
nonlinear programming problems using a sequence of 
unconstrained problems. Very well-written; part of 
a very important series published by SIAM. (1968 
John Wiley edition, TR, May 1969.) SM 


Operations Research, T(14-15: 1), S, L. Math- 
ematics Applied: An Introduction to Mathematical 
Modeling. William J. Adams. Pace & Pace, 1990, 
viii + 215 pp, $15.75 (P). An undergraduate text- 
book with minimal prerequisites. A semester of cal- 
culus and some knowledge of deductive logic is suffi- 
cient. Four approaches are covered: linear program- 
ming and random processes (the bulk of the book), 
geometry and differential equations. Plenty of exer- 
cises, some with solutions. Inexpensive. SM 


Dynamical Systems, 5*(12-16). The Beauty and 
Complezity of the Mandelbrot Set. John H. Hubbard. 
Science Television (Box 2498, Times Square Station, 
New York, NY 10108), 1989. School Edition, 47 
minute videotape, $49, (ISBN: 1-878310-03-8]; Uni- 
versity Edition, 73 minute videotape, $59. (ISBN: 
1-878310-02-x] A polished, well-motivated lecture by 
John Hubbard, beginning with Newton’s method ap- 
plied to quadratic iterations from a high school stu- 
dent’s perspective and concluding with a full explo- 
ration of Julia sets and the Mandelbrot’s set. Hub- 
bard lectures “the old fashioned way,” effectively and 
elegantly, using a whiteboard to draw suggestive pic- 
tures that help the viewer interpret many beautiful 
still and moving color graphics. Computer meth- 
ods are integrated very effectively: mouse movements 
demonstrate how different seed points create differ- 
ent orbits; an incredible zoom reveals the iterated 
complexity of the Mandelbrot set. Brief concluding 
section raises issues of complexity in analogy with 
DNA, offering Mandelbrot’s theory as a “message of 
hope for biology” that the genetic code may not be 
as complex as it now appears. The longer University 
Edition contains an extensive section on equipoten- 
tial lines, field lines, and fractional labels as a means 
of explaining in precise terms the relation between 
Julia sets and the Mandelbrot set. The incredible 
richness of ideas conveyed in these tapes would re- 
quire repeated viewings (with interspersed study) for 
full appreciation. LAS 


Dynamical Systems, S, C. Chaos: Mathemat- 
ics for the 21st Century. (Macintosh Software.) 
Bernt Wahl. Aerial Pr, 1989. A series of programs 
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that display graphs of various dynamical processes 
(Lorenz attractor, logistic map, Henon maps, pen- 
dulum, Van der Pol, predator-prey) with various op- 
tions for changing parameters, both in the underlying 
equations and in the form of the graphical represen- 
tation. Each process starts with default parameter 
values; other recommended values are available via 
buttons, as are user-provided parameters. LAS 


Dynamical Systems, S. Fractals on Hewlett- 
Packard Supercalculators, Preliminary Edition. Yves 
Nievergelt. New Directions in Math Ser. Wadsworth, 
1991, vi + 74 pp, $8 (P). [ISBN: 0-534-15037- 
3] A workbook on fractals (Julia sets, von Koch’s 
snowflake, Hausdorff dimension) with exercises (and 
sample programs) for HP series of programmable 
calculators (HP-28C, HP-28S, HP-48SX). Concludes 
with exploratory term projects. Provides mathemat- 
ical exposition for students at all levels—beginning 
with those who do not yet know complex numbers. 
Published without binding or cover to keep the price 
down! LAS 


Dynamical Systems, S**(12-16), P*. Fractals: 


An Animated Discussion. Heinz-Otto Peitgen, et al. 
WH Freeman, 1990, 63 minute videotape, $59.95 (In- 
dividual); $149.95 (Institutional). Documentation, 
20 pp. (ISBN: 0-7167-2213-5] Interviews with Ed- 
ward Lorenz and Benoit Mandelbrot set the stage for 
a wide-ranging visual and scientific introduction to 
fractal behavior (coast lines, self-similarity, butter- 
dy effect). Central exemplar is a pendulum swing- 
ing over three magnets illustrated via videotape and 
computer simulation. Effective historical and con- 
textual presentation; little direct mathematics (al- 
though much is given, with references, in the brief ac- 
companying Documentation). Mandelbrot discusses 
the origin of his discoveries, his philosophy of geom- 
etry and applied mathematics, and his perspective 
on mainstream mathematics of his generation. Fab- 
ulous films illustrate the relation between Julia sets 
and the Mandelbrot set; visual landscapes blend ar- 
tificial worlds with real footage; even the music is 
synthesized and composed from fractral algorithms. 
A true delight for the eye, for the mind, and for the 
mind’s eye. LAS 


Dynamical Systems, S**, C*, P*. The Beauty 
of Fractals Lab. (Macintosh Software.) Hartmut 
Jurgens, Heinz-Otto Peitgen, Dietmar Saupe. Mac- 
intosh Software by Thomas Eberhardt and Marc 
Parmet. Springer-Verlag, 1990, $49. [ISBN: 0-387- 
14205-3] A toolkit to construct full color Julia sets 
and the Mandelbrot set in 2, 2.5, or 3-dimensional 
perspectives using a variety of algorithms (some for 
speed, some for accuracy). Supports multiple win- 
dows with different images; full control over number 
of iterations, color (or gray-scale) mappings, distance 
estimates, image size, and light sources. Cursor tools 
(in a palette on each window) permit insertion of 
new seeds, blow-ups of selected regions, construction 
of Julia sets corresponding to points selected from 
the Mandelbrot set, display of first orbit steps of se- 
lected initial values, and visual display of distance 
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estimates. Images can be saved in Scrapbook for 
importation into other graphics or word processing 
packages. Includes several interesting demos of clas- 
sic images; requires a Mac II with at least 1 megabyte 
of memory running system 6.0.2 or later. LAS 
Control Theory, T(18), P, L. Decentralized Con- 
trol of Complez Systems. Dragoslav D. Siljak. Math. 
in Sci. & Eng., V. 184. Academic Pr, 1991, xiv + 
527 pp, $74.95. [ISBN: 0-12-643430-1] Discusses 
the control of large scale systems by a di.ide-and- 
conquer strategy. The central complex system is par- 
titioned into subsystems, each of which is subject to 
a less complex controller. The hope is to apply the 
ideas of distributed parallel processing to complex 
control problems. SM 

Probability, T(17-18: 2), S. Probability: The- 
ory and Ezamples. Richard Durrett. Wadsworth, 
1991, x + 453 pp, $53.95. [ISBN: 0-534-13206-5] 
Intended for a year course for students familiar with 
measure theory. In addition to the usual topics and 
numerous examples, the author includes material on 
stochastic processes including Markov chains on gen- 
eral state space, renewal theory, and central limit 
theorems for martingales. TAV 


Probability, T(14-15). A Primer in Probability, 
Second Edition, Revised and Ezpanded. Kathleen 
Subrahmaniam. Stat.: Textbooks & Mono., V. 111. 
Marcel Dekker, 1990, xi + 318 pp, $45. (ISBN: 0- 
8247-8348-4] Undergraduate, post-calculus course 
in probability. Includes introduction to statistical 
inference (simple testing theory). Numerous exam- 
ples and exercises as well as a discussion of various 
interpretations of probability and its role in scien- 
tific investigations. Includes standard topics. (First 
Edition, TR, March 1980.) MK 


Probability, P. Lecture Notes in Mathematics- 
1442: Quantum Probability and Applications V. Eds: 
L. Accardi, W. von Waldenfels. Springer-Verlag, 
1990, vi + 413 pp, $45 (P). [ISBN: 0-387-53026-6] 
Proceedings of a 1988 workshop held in Heidelberg. 
Thirty short papers on miscellaneous topics. JO 


Stochastic Processes, T(18: 2), P. Stochastic 
Flows and Stochastic Differential Equations. Hiroshi 
Kunita. Stud. in Adv. Math., V. 24. Cambridge 
Univ Pr, 1990, xiv + 346 pp, $69.50. (ISBN: 0-521- 
35050-6] The author attempts to develop the rela- 
tionship between stochastic flows of diffeomorphisms 
and stochastic differential equations, using the lat- 
ter to study the properties of the flows. Contains 
many examples and exercises, and a large bibliogra- 
phy. TAV 

Stochastic Processes, P. Stochastic Integrals: An 
Introduction. Heinrich von Weizsacker, Gerhard 
Winkler. Adv. Lect. in Math. Friedr Vieweg & Sohn, 
1990, ix + 332 pp, DM 78 (P). [ISBN: 3-528-06310-6] 
A nicely written introduction to stochastic integrals 
and the Ito calculus for semi-martingales. The ma- 
terial is deep and the reader should expect to spend 
effort grasping the details, but the subject is funda- 
mental in modern stochastic theory. TAV 


Stochastic Processes, P. Dependence with Com- 
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plete Connections and Its Applications. Marius 
Iosifescu, Serban Grigorescu. Tracts In Math., V. 
96. Cambridge Univ Pr, 1990, xiv + 304 pp, $69.50. 
[ISBN: 0-521-33331-8] Dependence with complete 
connection is a generalization of Markovian depen- 
dence. The book presents a theoretical development 
and several applications. Contains appendices to as- 
sist the readers with prerequisites and a very exten- 
sive bibliography. TAV 

Stochastic Processes, T(18: 2), P, L. Stationary 
Stochastic Models. Andreas Brandt, Peter Franken, 
Bernd Lisek. Wiley, 1990, 344 pp, $89.95. [ISBN: 
0-471-92132-7] Considers both discrete and contin- 
uous time models. Emphasizes queueing theory in 
the last half of the book. Tersely written in text- 
book style. No exercises. Recommended for readers 
with some prior experience in the field. Extensive 
references. SM 


Stochastic Processes, P. Stochastie Processes and 
their Applications in Mathematics and Physics. Eds: 
Sergio Albeverio, Philippe Blanchard, Ludwig Streit. 
Math. & Its Applic., V. 53. Kluwer Academic, 1990, 
xili + 402 pp, $129. [ISBN: 0-7923-0894-8] Twenty- 
two papers—revised versions of papers given in 1985 
at Bielefeld. On three general areas: geometric struc- 
ture of diffusion processes, probabilistic problems 
from quantum physics, and the interaction between 
probability theory and statistical mechanics. TAV 


Stochastic Processes, P. Lecture Notes in 
Mathematics-1444: Stochastic Analysis and Related 
Topics II. Eds: H. Korezlioglu, A.S. Ustunel. 
Springer-Verlag, 1990, 268 pp, $29 (P). (ISBN: 0- 
387-53064-9] Proceedings of a workshop in Silivri, 
Turkey, July 1988. Major papers on asymptotic 
problems in Weiner integration and applications of 
anticipating stochastic calculus to stochastic differ- 
ential equations. TAV 


Stochastic Processes, P. Limit Theorems on Large 
Deviations for Markov Stochastic Processes. A.D. 
Wentzell. Math. & Its Applic., V. 38. Kluwer Aca- 
demic, 1990, xv + 176 pp, $79. (ISBN: 0-7923-0143- 
9} A clean translation of the 1986 Russian origi- 
nal edition. After preliminaries, the author inves- 
tigates the action functional for Markov processes, 
precise asymptotics, and probabilities of large devi- 
ations. TAV 


Elementary Statistics, T(13-14: 2). Statistics 
for Business and Economics. Debra Olson Oltman, 
James R. Lackritz. Brooks/Cole, 1991, xxii + 984 
pp, $45. (ISBN: 0-534-14430-6] Introduction to de- 
scriptive and inferential statistics with some prob- 
ability. Chapters on ANOVA, regression, nonpara- 
metric statistics, time series, index numbers, and de- 
cision making. Includes MINITAB as well as other 
computer output. Real danger students will lose 
sight of important concepts because of the encyclo- 
pedic listing of terminology and formulas. Some 
of the figures and tables are extremely difficult to 
read. RWJ 


Computational Statistics, T(16), S, P. Mod- 
ern Statistical, Systems, and GPSS Simulation: 
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The First Course. Zaven A. Karian, Edward J. 
Dudewicz. WH Freeman, 1991, xvi + 468 pp, $56.95. 
[ISBN: 0-7167-8232-4] Introduces theory and im- 
plementation of discrete-event simulation. Addresses 
theoretical basis for simulation, how long to run a 
simulation, and how to analyze output for experi- 
mental goals and integrates these with GPSS, a sim- 
ulation language. Written at an undergraduate prob- 
ability level and contains a variety of examples from 
business, time-shared computer analysis, and emer- 
gency medical systems. Handy reference for standard 
simulation techniques. MK 


Computational Statistics, $(13-16), C. Statis- 
tics with Stata. Lawrence C. Hamilton. Brooks Cole, 
1990, vii + 171 pp, $31.95 (P). [ISBN: 0-534-12850- 
5] Supplementary text for using STATA in class- 
room setting. Runs on MS-DOS and UNIX plat- 
forms. Graphics capabilities are excellent for in- 
tended level. Although not a do-all package, covers 
a lot of ground: EDA methods, ANOVA, regression, 
logistic and robust regression, diagnostics (includ- 
ing Cook’s distance, DFFITS, VIF, added-variable 
plots, etc.), survival analysis, non-parametrics, time 
series, and more. Easy for students to learn; nice 
for comparison with more elaborate, research-level 
packages. MK 


Statistics, T(18), P. Order Statistics and Infer- 
ence: Estimation Methods. N. Balakrishnan, A. Clif- 
ford Cohen. Stat. Model. & Decision Sci. Aca- 
demic Pr, 1991, xix + 377 pp, $79.95. (ISBN: 0-12- 
076948-4] Consolidates developments in mathemat- 
ical methods based on order statistics. Full discus- 
sion of moment and maximum-likelihood estimation, 
with emphasis on linear unbiased and optimal linear 
estimation based on complete or censored samples. 
Emphasis is on theory although nearly all discussion 
is illustrated with brief examples. Includes an ency- 
clopedic bibliography. MK 

Statistics, P. Monotonicity Properties of Infinitely 
Divisible Distributions. B.G. Hansen. CWI Tract, V. 
69. Mathematisch Centrum, 1990, ii + 101 pp, Df. 
38.50 (P). [ISBN: 90-6196-380-X] “Studies the rela- 
tionship between an infinitely divisible distribution 
(i.e., solution to a generalized central limit problem) 
and its Lévy spectral function.” BH 


Statistics, T(14-17: 1, 2). Statistics for Research, 
Second Edition. Shirley Dowdy, Stanley Wearden. 
Ser. in Prob. & Math. Stat. Wiley, 1991, xvii + 
629 pp, $49.95. [ISBN: 0-471-85703-3] Revision of 
the authors’ 1983 First Edition (TR, January 1984). 
Significant change is the addition of sections on com- 
puter usage in each chapter, with techniques illus- 
trated by SAS commands and output. Another new 
feature is the inclusion of nonparametric procedures 
as analogs to previously discussed parametric tests. 
The chapter on multiple regression has also been 
rewritten to emphasize modern computer output. 


RSK 


Statistics, T(18: 1), P, L. Nonparametric Func- 
tion Estimation, Modeling, and Simulation. James 
R. Thompson, Richard A. Tapia. SIAM, 1990, xvi 
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+ 304 pp, $32.50 (P). [ISBN: 0-89871-261-0] Gives 
a survey of techniques for one-dimensional nonpara- 
metric density estimation. The harder problem 
of multi-dimensional estimation is also addressed. 
Model building and mathematical optimization are 
also each given one chapter. No exercises. SM 


Programming, S(14-16), P, L. A C++ Toolkit. 
Jonathan Shapiro. Ser. in Innovative Technology. 
Prentice-Hall, 1991, xiii + 231 pp, (P). (ISBN: 0-13- 
127663-8] A handbook for developing C++ object 
classes that may be used and reused in applications. 
The author illustrated principles for writing reusable 
classes with his own example classes for standard 
data structures. Introduction, including a reader’s 
guide to C++; presentation of example classes; stor- 
age Management and performance tuning; code list- 
ings. Thought-oriented exercises. RB 


Programming, T(13-14), S, C, P, L. Program- 
ming tn MacScheme. (Macintosh Software.) Michael 
Fisenberg, et al. MIT Pr, 1990, xviii + 388 pp, 
$32.50 (P). [ISBN: 0-262-55018-0]; MacScheme Man- 
ual and Software. MIT Pr, 1990, 194 pp, $37.50 
(P). [ISBN: 0-262-62077-4] A manual, a Macintosh 
student disk, and an introductory text for Scheme, 
a simple, learnable dialect of Lisp that was devel- 
oped in part to provide a basis for teaching introduc- 
tory computer science. Text spirals among projects, 
language structure, and debugging; final few chap- 
ters emphasize procedures as first class objects—the 
“mantra” of Scheme. Appendices discuss the Mac- 
Scheme interface, offer solutions to selected exercises, 
references, and several indices. The Manual begins 
with a brief overview of the language, the compiler, 
and the debugger; the major section is a detailed 
reference manual for the language. Appendices dis- 
cuss “Easy Graphics,” an extension built into Mac- 
Scheme to take advantage of the Macintosh graphics 
capability. Includes a complete list of error mes- 
sages, and other reference lists. The MacScheme 
disk includes several sample programs and special 
procedures required to make MacScheme compatible 
with other versions (e.g., Abelson & Sussman’s well- 
known text Structure and Interpretation of Com- 
puter Programs). Requires a Mac Plus, SE, or II 
with at least 1M memory. LAS 


Programming, S(14), P. Designing Screen Inter- 
faces in C. James L. Pinson. Yourdon Pr (Distr: 
Prentice-Hall), 1991, xvi + 267 pp, (P). [ISBN: 0- 
13-201583-8] For the experienced C programmer. 
Shows how to create windowing, pull-down menus, 
help screens, data input screens, point-and-shoot file 
selection, and other features to enhance software for 
IBM PCs and clones. Modular. Includes much 
highly-commented source code. DFA 


Languages, P, L**. C: A Reference Manual, Third 
Edition. Samuel P. Harbison, Guy L. Steele, Jr. 
Prentice-Hall, 1991, viii + 392 pp, (P). (ISBN: 0- 
13-110933-2] An essential book for serious C pro- 
grammers. This edition incorporates ANSI C into 
the main text and provides detailed advice on writ- 
ing programs that are portable between ANSI C and 
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traditional C compilers. Also new in this edition are 
a collection of study exercises (with solutions). AO 


Languages, P, L. The Definition of Standard ML. 
Robin Milner, Mads Tofte, Robert Harper. MIT Pr, 
1990, xi + 101 pp, $15 (P). [ISBN: 0-262-63132-6] 
The official, formal definition of the programming 
language ML, a relatively new functional program- 
ming language that is strongly typed and also poly- 
morphic. It also provides imperative language con- 
structs, parametric modules, and a sophisticated ex- 
ception mechanism. AO 

Languages, T(15-17: 1), S, L. Programming 
Linguistics. David Gelernter, Suresh Jagannathan. 
MIT Pr, 1990, xx + 411 pp, $37.50. (ISBN: 0-° 
262-07127-4] Designed to be used as a textbook for 
courses in the design and evolution of programming 
languages. Emphasizes the importance of aesthet- 
ics and elegance in the design of programming lan- 
guages. An interesting and innovative text that de- 
serves serious considerations. AO 


Algorithms, P. Proceedings of the Second Annual 
ACM-SIAM Symposium on Discrete Algorithms. 
Chair: Alok Aggarwal, et al. SIAM, 1991, 482 pp, 
$45 (P). [ISBN: 0-89871-271-8] 53 research papers, 
primarily on graph algorithms and computational ge- 
ometry, selected for and presented at a discrete algo- 
rithms conference, January 1991. JPH 


Algorithms, T*(15-16: 1), L. Algorithms and 
Data Structures: Design, Correctness, Analysis. Jef- 
frey H. Kingston. Intern. Comput. Sci. Ser. Addi- 
son-Wesley, 1990, ix + 315 pp. [ISBN: 0-201-41705- 
7] <A textbook for a data structures and algorithm 
analysis course that emphasizes depth over breadth. 
Emphasizes the use of abstract data types, proofs of 
correctness, and appropriate algorithm design tech- 
niques. Unusual features of the book include its 
treatment of amortized complexity analysis, splay 
trees, and Fibonacci heaps. Example programs are 
presented in Modula-2. AO 


Computer Systems, P. An Architecture for Com- 
binator Graph Reduction. Philip John Koopman, 
Jr. Academic Pr, 1990, xv + 155 pp, $34.95. 
(ISBN: 0-12-419240-8] “Traditionally, it has been 
assumed that advanced programming languages [in- 
cluding functional languages] require radically dif- 
ferent, non-von Neumann architectures for efficient 
execution. This book explores mapping functional 
programming languages onto conventional architec- 
tures using a combination of techniques from the 
fields of computer architecture and implementation 
of advanced programming languages [in particular, 
combinator graph reduction].” RB 

Computer Systems, C, P. Leo: A Technical Ed- 
itor. (IBM PC Software.) ABK Software (4495 
Ottawa Place, Boulder, CO 80303). User Manual, 
1990, 136 pp, (P). A text editor for IBM PC sys- 
tems with a graphics display designed as a front end 
for TeX. Menus and special keys are used to provide 
special features (font changes, accents, math-mode) 
which are displayed (or approximated) directly on 
the screen as they will be when printed. The file 
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created by Leo is a TpX file, and can be input di- 
rectly into TX. Or it can be printed by a pseudo-TpX 
processor Leopr which approximates most important 
features of TX. Leo handles only a subset of TeX in 
a way that hides the TeX input code from the user. 
Very thorough User Manual. LAS 


Computer Systems, C, P. X Window System, 
User’s Guide, OSF/Motif Edition, Volume Three. 
Valerie Quercia, Tim O'Reilly. O’Reilly & Assoc, 
1990, xxvi + 709 pp, $30 (P). [ISBN: 0-93175-61-7] 
Complete manual for the X Window System, Version 
11, Release 4. Assumes X is installed and running 
and that you are running UNIX, although the UNIX 
dependencies are slight. Includes introduction and 
how to get started, and progresses to discussions of 
the mwm window manager, font specification, graphics 
utilities, other clients, and customizing X. MK 


Computer Systems, C, P. X Toolkit Intrinsics 
Programming Manual: OSF/Motif 1.1 Edition for 
X11, Release 4, Volume 4. Adrian Nye, Tim 
O'Reilly. O’Reilly & Assoc, 1990, xxxili + 632 pp, 
$30 (P). [ISBN: 0-937175-62-5] Manual for writing 
X Window System programs using the Xt Intrinsics 
Library. Divided into two volumes, this volume (IV) 
provides explanation of the toolkit. Includes tutorial 
and many task-oriented examples. This is a compan- 
ion to Volume V, X Toolkit Reference Manual. In- 
cludes discussion of widget sets, instance hierarchies, 
creating basic menus and dialog boxes, C graphics 
models, resource management, interclient communi- 
cations, and more. MK 


Computer Systems, S(13-16). Introducing Math- 
ematica. Stephen Wolfram. Science Television (Box 
2498, Times Square Station, New York, NY 10108), 
1989, 60 minute videotape, $49. (ISBN: 1-878310- 
01-1] An hour-long videotape of Stephen Wolfram 
talking (casually and informally) into the camera 
while typing Mathematicaat a keyboard. Split screen 
video shows both Wolfram’s lecture and Mathemat- 
ica’s screen display. In three parts: quick overview 
of numerical, symbolic, and graphical features; de- 
tailed examination of each of these features; an in- 
troduction to extensions (e.g., programming, appli- 
cation packages, TgX and Fortran outputs). Includes 
“animation” of surface graphs changing in response 
to a control parameter. Illustrates all key features of 
Mathematica; excellent introduction for those with- 
out sufficient computer experience or resources to use 
the actual package. LAS 


Computer Systems, L*. sed & awk. Dale 
Dougherty. O’Reilly & Assoc, 1990, xvii + 394 pp, 
$24.95 (P). [ISBN: 0-937175-59-5] A detailed in- 
troduction for beginners to two UNIX text process- 
ing tools: the stream editor sed and the pattern- 
recognition language awk (named for its authors 
Aho, Weinberger, Kernighan). Both tools permit 
systematic transformations (permutations, substitu- 
tions, rearrangements) of strings of words or data, 
thus serving as useful pre-processors for other pack- 
ages (e.g., statistical software, text formatters) and 
as programming tools of their own. Includes two 
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large examples—an interactive spell checker and an 
indexing program—plus scripts (i.e., programs) sub- 
mitted by colleagues to illustrate the diverse power 
of these tools. LAS 


Computer Systems, P. SAA: A Guide to Im- 
plementing IBM’s Systems Application Architecture. 
Jerrold M. Grochow. Yourdon Pr (Distr: Prentice- 
Hall), xxvi + 385 pp. [ISBN: 0-13-785759-4] IBM 
introduced SAA in 1987 as a comprehensive frame- 
work for defining “IBM compatibility.” Computing 
applications that adhere to SAA provide standard 
user interfaces, share programming tools and soft- 
ware development strategies, and can communicate 
among diverse IBM computer systems. This guide 
presents the architecture for managerial and techni- 
cal personnel in industry. RB 


Computer Science, P. Realistic Compiler Gener- 
ation. Peter Lee. Found. of Computing Ser. MIT 
Pr, 1989, xv + 246 pp, $32.50. [ISBN: 0-262-12141-7] 
Presents a survey of semantics-based compiler gener- 
ation and a new method (“high-level semantics”) of 
describing programming language semantics. In con- 
trast to previous methods, the new method makes 
the automatic generation of efficient compilers pos- 
sible. Based on the author’s Ph.D. thesis. AO 


Computer Science, P. Object-Oriented Databases 
with Applications to CASE, Networks, and VLSI 
CAD. Eds: Rajiv Gupta, Ellis Horowitz. Prentice- 
Hall, 1991, xiv + 447 pp. [ISBN: 0-13-629833-8] 
Object-oriented database systems (“OODBs”) have 
recently appeared in response to conventional rela- 
tional database system limitations in applications 
such as computer-aided design (CAD), manufacture 
(CAM), software engineering (CASE). This collec- 
tion of twenty-one articles introduces object-oriented 
concepts, surveys existing OODBs, presents real- 
world application examples, and includes a brief 
overview of C++ language and database enhance- 
ments of C++. RB 


Applications, P. Proceedings of the Fourth Euro- 
pean Conference on Mathematics in Industry. Eds: 
Hansjorg Wacker, Walter Zulehner. European Con- 
sort. for Math. in Industry, V. 6. Kluwer Academic, 
1991, x + 433 pp, $148. (ISBN: 0-7923-1036-5] Pro- 
ceedings of the Fourth ECMI Conference on Indus- 
trial Mathematics held at Strobl in Austria, May 29- 
June 2, 1989. Articles in following sections: invited 
lectures (includes models for crystallization of poly- 
mers, information technology, trends in higher edu- 
cation, thermal behavior of thyristors with cooling 
devices), steel processing, chemical engineering, as 
well as a wide variety of contributed lectures. MK 


Applications, P, L*. The Ubiquity of Chaos. Ed: 
Saul Krasner. AAAS, 1990, viii + 247 pp, (P). [ISBN: 
0-87168-350-4] Nineteen papers on diverse applica- 
tions of chaos that have been presented at AAAS 
annual meetings over the past several years. Topics 
include brain waves, quantum chaos, business cycles, 
childhood epidemics, stellar variability, and inter- 
national security—a stunning array of applications, 
united in a helpful common index, that documents 


AT2 


the “ubiquity” of chaos. LAS 


Applications (Engineering), T(18), P. Stochas- 
tic Finite Elements: A Spectral Approach. Roger 
G. Ghanem, Pol D. Spanos. Springer-Verlag, 1991, 
x + 214 pp, $49.50. [ISBN: 0-387-97456-3] With 
applications to mechanical engineering in mind, this 
book extends the well-known method of finite ele- 
ments to problems involving uncertainty. Primarily 
theoretical in nature, the book concludes with three 
examples. SM 

Applications (Physical Science), P. Applied and 
Industrial Mathematics, Venice-1, 1989. Ed: Re- 
nato Spigler. Math. & Its Applic., V. 56. Kluwer 
Academic, 1991, xiii + 374 pp, $136. (ISBN: 0-7923- 
0521-3] Thirty-one papers from a 1989 conference 
on industrial-strength mathematics including fluid- 
flow modelling, nonlinear waves, propagation in ran- 
dom media, and transport phenomena. BC 
Applications (Physical Science), S(16-17), P, 
L*. Crystalline Symmetries: An Informal Math- 
ematical Introduction. Marjorie Senechal. Adam 
Hilger, 1990, xi + 137 pp, $39. [ISBN: 0-7503-0041-8] 
An excellent, up-to-date introduction to the mathe- 
matical basis of crystallography, from Platonic solids 
to Penrose tiles. Ideal undergraduate seminar mate- 
rial. BC 

Applications (Physics), S(17-18), P. Electro- 
Diffusion of Ions. Isaak Rubinstein. Stud. in Appl. 
Math., V. 11. SIAM, 1990, ix + 254 pp, $44.75. 
[ISBN: 0-89871-245-9] An innovative approach that 
allows the unified treatment of such diverse topics 
as electrolyte solutions, ion-exchangers, ion-selective 
membranes, and semiconductors. This book is in- 
tended to attract applied mathematicians to a re- 
markably under-developed area. MU 

Applications (Physics), T(17-18: 1), S, P, L. 
Direct and Inverse Problems: Potentials in Quan- 
tum Scattering. Boris N. Zakhariev, Allina A. Suzko. 
Transl: G. Pontecorvo. Springer-Verlag, 1990, xiii + 
223 pp, $32 (P). [ISBN: 0-387-52484-3] With the in- 
clusion of new material concerning exactly solvable 
models, this fine translation supercedes the Russian 
original. Part I—One-Dimensional, One-Channel 
Systems—provides a brief but careful introduction 
culminating in the Levinson Theorem. Part II is 
devoted to multi-channel problems. I]lustrations, 
exercises, and a lengthy bibliography are also pro- 
vided. MU 


Applications (Physics), P. Random Matrices, Re- 
vised and Enlarged Second Edition. Madan Lal 
Mehta. Academic Pr, 1991, xviii + 562 pp, $99.50. 
[ISBN: 0-12-488051-7] Random matrices are used 
in this volume as a tool for statistical quantum 
mechanics—as a means of computing statistical 
properties of chaotic systems, which are far more 
common than the integrable systems studied in or- 
dinary texts. Major focus is on Gaussian ensembles, 
especially on evidence for a “central limit” conjecture 
that the statistical properties of a few eigenvalues of 
a large random matrix are independent of the proba- 
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bility distribution of the elements of the matrix. This 
conjecture, backed by extensive numerical evidence, 
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Magic Squares and Linear Algebra 


CHRISTOPHER J. HENRICH, Concurrent Computer Corporation, Tinton Falls, NJ 07724 


CHRISTOPHER J. HENRICH: After receiving my doctorate at Harvard in 1968, 
I taught at Columbia and at Fordham. Since 1975, I have been working for 
Concurrent Computer Corporation (which was formerly Interdata, and until 
1985 a subsidiary of the Perkin-Elmer Corporation). My work has been 
concentrated in compilers and mathematical software; I am now a member 
of the Ada compiler development team. My research interests have in- 
cluded partial differential equations and the representation theory of com- 
pact Lie groups. 


1. Introduction. A magic square of order n is an arrangement of the numbers 
{1,...,n’} in an n Xn square, so that each row, column, or diagonal adds up to 
the same sum S. Because nS = 1 + --- +n’, we have 


S = n(n? + 1)/2. (1.1) 


Magic squares are a perennial topic of recreational mathematics. They tantalize us 
with a suggestion of hidden pattern. With a few concepts from linear algebra, we 
can find a surprisingly simple structure that underlies many magic squares, 
including some famous examples. We shall use finite vector spaces; in this paper, 
we only use finite-dimensional vector spaces over the field F, with two elements. 
Familiarity with the first chapter of [3] should be sufficient to enable the student to 
understand what an affine magic square is and how to construct one. 

We shall start by examining the square of order 4 that appears in Durer’s 
engraving Melancolia. (See, for instance, [1, p. 147].) This will lead us to the 
description of the family of affine magic squares of order 4. All the magic squares 
of order 4 are known, and we shall see what portion of them are affine. We briefly 
discuss two squares found by Benjamin Franklin, to show how the theory extends 
beyond squares of order 4. 


2. A close look at Diirer’s square. Consider a four by four square. Number the 
rows and columns, starting with 0 and using base-2 notation. The row and column 
numbers then label the positions in the square uniquely; a label contains four 
digits, which may be 0 or 1. (See Ficure 1.) 

Now let us forget that these 0’s and 1’s came from positional notation for 
integers, and regard them as values in F,. Thus we have identified the cells of the 
square with the elements of F). 


0001 0010 


0101 0110 
1001 1010 
1101 1110 


Fic. 1 


481 
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We shall be interested in subspaces of F>. A notation for the vector subspace 
spanned by several vectors will be that list of vectors, enclosed in angle brackets. 
We will use the common notation {e,, e,, €3, €,} for the natural basis of F;. Our 
motive for looking at subspaces is that they correspond to features of interest in 
the study of magic squares. For instance, the first row is (e3,e,), and the affine 
subspaces parallel to this correspond to the other rows. Similarly, the columns of 
the square are of the form x + (e,, e,) for suitable choices of x in F;. The main 
diagonal is (e, + e3,e, + e,); the affine spaces parallel to this are the other 
diagonal and two broken diagonals, represented in FIGURE 2. Some magic squares, 
called quasi-pandiagonal, are magic for the broken diagonals of FIGURE 2 but not 
for those of FIGURE 3. The latter are another parallel family of affine spaces. The 
reader should look at some of the other two-dimensional subspaces of F;, and 
their parallel families of affine subspaces. 

Now let us consider Durer’s square; it is transcribed here in FiGur_E 4. 


Fic. 4 
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Fic. 5 


Some of the four-dimensional order hidden in this square comes to light if we 
look at the parity of the values in its cells, in Figure 5. 

This, in a humble way, also has the magic property: in every row, column, and 
diagonal the sum is 2. It is also related to the vector space structure of F>. In fact, 
if the 0’s and 1’s are understood as belonging to F,, then Ficure 5 is a linear 
function from F; to F,. We need some more notation; we shall let {f,, f,, f;, f4} be 
the dual basis to {e,, e€,, e3, e4}, so that f(e,) is 1 if i =j and 0 otherwise. Then 
FIGURE 5 displays 

fi tfet fa. 

Because this is a linear function, on any vector subspace of FS it either is 
constant (equal to 0) or takes the values 0 and 1 equally often. If the latter is true 
on a vector subspace, then it is equally true on any parallel affine subspace. This 
function is non-constant on the subspaces corresponding to lines, columns, and 
diagonals of the square; therefore on each of these affine spaces it takes the values 
0 and 1 equally often. Here is a general explanation of why FiGurRE 5 has the magic 
property. It ts worth restating in slightly greater generality: 


Proposition 1. Let wp be a linear function on F; with values in F,; let E be a 
linear subspace of F; which is not contained in the null-space of w. Then, on any 
affine subspace parallel to KE, the function wp takes the values 0 and 1 equally often. 


Can similar reasoning account for all the magic in Diirer’s square? We can look 
beyond the parity of the numbers in the square by representing them in base 2. If 
we try this, we find that it is tidier to subtract 1 from each number first. Then they 
range from 0 to 15, and their base-2 representations are 0000 through 1111. 
FiGuRE 6 shows the result. 


1111 ; 0010 
0100 1001 


1000 ‘0101 
0011 1110 


Fic. 6 


The respective binary digits in each cell are easier to understand if they are written 
in separate squares, as in FiGuRE 7, and regarded as taking values in F,. (We shall 
trust the reader to determine from context when a 0 or 1 is intended to belong in 
F, and when it belongs in Z.) Each digit position determines a function of 
positions in the square. They are labelled V,, j = 1,...,4, so that the correspond- 
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1001 1001 1100 1010 

V;; 0110 Vz: 1001 Vz; 0011 Vz: 0101 

1001 0110 0011 0101 

0110 0110 1100 1010 
Fic. 7 


ing number in FiGureE 4 is 
W(x) = 8V,(x) + 4V,(x) + 2V3,(x) + V(x) + 1. (2.2) 
The functions in FiGurRE 7 turn out to be linear functions, modified by a 
constant term; that is affine functions. Specifically, 


Vi = iD +f +f,+1 
Vi=fi +f3 +f, +1 
V3=fh +f +f; + 1 (2.3) 


Vz=hfi +f, +fy+1 


They are given in tabular form in FIGuRE 8. 


Fia. 8 


Now we can see, very clearly, why Diirer’s square has to have the magic 
properties that it does. Each of the value functions is non-constant on the 
subspaces that correspond to rows, columns, and diagonals. On any affine sub- 
space E of F;, an affine function W that is not constant must, as we have seen, 
take the values 0 and 1 equally often. Thus each of the V,, takes the value 1 on 
exactly two elements in each row, or in each column, or in each diagonal. This fact 
accounts for the uniformity of the sums of W(x) over rows, columns, and 
diagonals. The numbers W(x) include all of {1,...,16} with no duplications, 
because the linear parts of the.value functions are linearly independent. 

It is natural to generalize the properties of Diirer’s square. At the same time, 
we can summarize our discoveries in a series of easy propositions. 


DEFINITION 1. If V;, j = 1,...,4 are affine functions on FS with values in F,, 
then the map W from F; to Z given by equation (2.2) is an affine square. 


In discussion of a particular affine square W, we shall denote the linear part of 
V, by w.. 
J J 


DEFINITION 2. Let V,,...,V, be as in definition 1, and let W be the affine square 
determined by them; let E be a linear subspace of F;. Then E is magic for W if each 
of the linear functions w, is non-zero on E. 
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Proposition 2. Let V,,...,V, be as in Definition 1, and let E be a linear 
subspace of F; which is magic for the affine square W determined by the V;. Then W 
has uniform sums on the affine subspaces parallel to E. 


Proof. Let E have dimension d. By Proposition 1, each of the functions yw, takes 
the values 0 and 1 each 2%~! times on every affine space parallel to E. Clearly the 
same is true of V,. Therefore the sum (in Z) of V; over every such affine subspace 
is 2¢—!; the conclusion follows at once in view of equation (2.2). 


DEFINITION 3. Let V,,...,V, be as in Definition 1, and let W be the affine square 
determined by the V,;. Then W is nonsingular if ,,..., 4 are linearly independent. 


PRoposiTION 3. Let V,,...,V, be as in Definition 1, and suppose that the affine 
square W determined by the V, is non-singular. Then the numbers in W are exactly 
{1,..., 16} with no omissions or duplications. 


Proof. Because 1 < W(x) < 16 for x € FJ, and there are 16 points in F5, it is 
sufficient to show that no two of them have the same value of W. Now suppose x 
and y are members of Fy and V,(x) = V(y), for j = 1,...,4. Then we have 
w(x) = wy) for all j. But by hypothesis the w; are linearly independent, and 
therefore span F}*. It follows that x = y. 


DEFINITION 4. An affine magic square is an affine square which is nonsingular, 
and for which the subspaces (e,, >), (€3,€4), and (e, + €5,e,; + e4) are magic. 


PROPOSITION 4. Every affine magic square is a “‘pandiagonal” or “‘quasi-pandiag- 
Dp 
onal’? magic square. 


Proof. This is just a combination of Propositions 2 and 3. 


3. How many affine magic squares exist? To answer this question, we must 
look closely at affine functions on F;. Their linear parts are the most important 
consideration. 

For a linear function to be non-constant on the prescribed subspaces, it must 
satisfy certain constraints. Because the “row” subspace is magic, it must be 
nonzero on either e, or e,. Likewise, because the “column” subspace is magic, it 
must be nonzero on either e, or e,. Finally, for the “diagonal” subspace to be 
magic, it must be nonzero on either e, + e, or e, + e,. Six linear functions satisfy 
these constraints: 


L,=fatfs Lo=fitftfas La=fi the tha; 
La=fi,t+ht+fas Le=fithis Le=fhith + fs. 

To determine an affine magic square, we must select a set of four functions 
among which there is no linear relation. Accordingly, we look for linear relations 
among the six eligible functions. A relation among three functions, f + f’ + f” = 0, 
is equivalent to the equation f + f’ = f”. There are two such relations: 

L,+L,+L,.=0. 
A relation among four functions can be expressed as f + f’ = g + g’. However, 
one finds that a nonzero function that is not eligible can be expressed as a sum of 


two eligible functions in only one way. Therefore there are no four-term relations 
among the eligible functions. 
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From six eligible functions, a set of four can be selected in 15 ways. Each of the 
three-term relations excludes 3 of these sets. Thus there remain 9 linearly indepen- 
dent sets of eligible linear functions. Now a set of linear functions determines 16 
sets of four affine functions, because each linear function can be given a constant 
term of 0 or 1. And each such set can be arranged in 4! = 24 ways. Thus the 
number of affine magic squares of order 4 is 9- 16 - 24 = 3456. If we take the 8 
symmetries of the geometrical square into account, the number of distinct affine 
magic squares is 3456/8 = 432. 

A complete enumeration of the magic squares of order 4 has been known since 
the seventeenth century, when it was found by Bernard Frénicle de Bessey. Bensen 
and Jacoby [2] checked his work using a computer. A further detailed study, with 
analysis and proofs, is found in [4]. In this census, two squares are identified if they 
differ only by one of the 8 symmetries of a geometrical square. There are 880 
distinct magic squares; 48 are pandiagonal, and another 384 are quasi-pandiagonal. 
Thus we have a converse to Proposition 4. 


PRoposiTIOn 5. Every “pandiagonal” or “quasi-pandiagonal”’ magic square of 
order 4 is an affine magic square. 


4. Benjamin Franklin’s examples. The theory and notation of section 2 are 
easily extended to describe affine magic squares whose order is any power of 2. 
Examples of such squares have long been known. Benjamin Franklin constructed 
magic squares of order 8 and 16. An account of them is given in [1, pp. 89-112]. 
The square of order 8 is transcribed as FiGurE 9. If we subtract 1 from each 
number and convert to base 2, we get FIGURE 10, which is analogous to FIGURE 6. 


Fic. 9 
000 001 010 011 100 101 110 111 
000 110011 111100 000011 001100 010011 011100 100011 101100 
001 001101 000010 111101 110010 101101 100010 011101 010010 
010 110100 111011 000100 001011 010100 011011 100100 101011 
011 001010 000101 111010 110101 101010 100101 011010 010101 
100 110110 111001 000110 001001 010110 011001 100110 101001 
101 001000 000111 111000 110111 101000 100111 011000 010111 
110 110001 111110 000001 001110 010001 011110 100001 101110 
111 001111 000000 111111 110000 101111 100000 011111 010000 


Fic. 10 
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Fic. 11 


We can check that the square is affine; the functions which determine it are 
tabulated in Ficure 11. Several subspaces of dimension 2 and 3 in F? are magic 
for this square. Among the 2-dimensional such subspaces are these: 


(€3, €6)5 

(@ + €3, €g)} 
(€3,€5 + €,); 

(é, + €3,@5 + e). 


One can check that every contiguous 2-by-2 block in the square corresponds to an 
affine space parallel to one of these subspaces. 

Franklin observed that every bent row of 8 numbers ascending and descending 
diagonally totals 260, for example the row from 52 descending to 54 and from 43 
ascending to 45. This is the image of the vector subspace 


(e, + €6,€, + €5,€, +e, + ey), 


which is magic for this square. This fact accounts for only 4 of the 10 bent rows. 
What explains the magic of the row from 14 descending to 10, and from 23 


200 217 232 249 8 25 40 57 72 89 104 121 136 153 168 185 
58 39 26 7 250 31 218 199 186 167 154 135 122 103 90 71 
198 219 230 251 6 27 38 59 70 91 102 123 134 155 166 187 
60 37 28 5 252 229 220 197 188 165 156 133 124 101 92 = 69 
201 216 233 248 9 24 41 56 73 88 105 120 137 152 169 184 
55 42 23 10 247 234 215 202 183 170 151 138 119 106 87 74 
203 214 235 246 11 22 43 54 +75 86 107 #118 139 150 171 = 182 
53. 44 21 12 245 236 213, 204 181 172 149 140 117 108 85 76 
205 212 237 244 13 20 45 52 77 84 #109 116 141 148 #173 180 
51 46 19 14 243 238 211 206 179 174 147 #142 115 110 8&3 78 
207 210 239 242 15 18 47 SO 79 82 111 114 #143 146 175 178 
49 48 17 16 241 240 209 208 177 #176 145 144 113 112 81 —= 80 
196 221 228 253 4 29 36 61 68 93 100 125 132 157 164 189 
62 35 30 3 254 227 222 195 190 163 158 131 126 99 94 67 
194 223 226 255 2 31 34 63 66 95 98 127 130 159 162 191 
64 33 32 1 256 225 224 193 192 161 160 129 128 97 96 65 


Fic. 12 
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0 
0 
0 
0 
1 
1 
1 
1 


CoOrocococ.cd hm 
oF FP Fe Se eS S| 
coo coclchcCOlUCUCcClUcCOUlUcCOlUrR 
Coo coc CcocCcUcrWCcClUCcCUrR he 
ee 
=—= === = = CO OC © 
-=- F&F FP COO CO KH Re 
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ascending to 19? This is not an affine subspace; but it is the union of two affine 
subspaces. Indeed, the cells 14 + 60 + 37 + 19 lie in an affine subspace parallel to 


(e, +e, + &,e, + €5 + &,); 
and the cells 59 + 10 + 23 + 38 lie in an affine space parallel to 
Ce, te, +e; + e,e4 + es + &,). 


Both subspaces are magic for Franklin’s square. 

Franklin’s square of order 16 is reproduced in FiGure 12, and the affine 
functions which generate it are tabulated in FiGurE 13. Franklin was justly proud 
of this square, calling it “the most magically magical of any magic square ever 
made by any magician.” [1 p. 93] If you explore the affine subspaces of F? which 
are magic for this square, you may agree that he was not exaggerating. 
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An Analytical Description of Some Simple Cases 
of Chaotic Behaviour 


JAMES V. WHITTAKER 
Department of Mathematics, University of British Columbia, Vancouver V6T 1W5 Canada 


1. Introduction. Considerable attention has been paid to the sets of points z, in 
the complex plane such that the n-th iterate f"(z,) of the function 


f(z) =2°— ph 

evaluated at the point z = z, should remain bounded as n tends to ~. We have 
only to mention the book [14] by Mandelbrot which has rendered this subject 
accessible to a much wider audience with its fine illustrations and graphical 
descriptions. A recent wave of activity in this area has generated, among others, 
the papers by Barnsley et al. [1, 2, 3, 4] and by Mandelbrot [15, 16]. A problem 
which is closely related to the one just described is to analyze the behaviour of the 
n-th iterate F”"(x,) of the function 


F(x) = Mx(1 -x) 


evaluated at the point x, in the interval 0 <x <1. The survey articles by 
Blanchard [5] and Whitley [18] and the book by Preston [17] provide a comprehen- 
sive overview of the problem for this, as well as for other more general classes of 
functions. The behaviour exhibited by the n-th iterates of f and F as n tends to 
in both of these situations has received the general appellation ‘chaos’. 

In the special cases » = 2 and M-=4 we shall derive explicit analytical 
formulas for the n-th iterates of the functions f and F which have received some 
attention in the book by Devaney [7, pp. 39-53]. As a result, we shall be able to 
describe exactly the sets of points z) such that f(z.) remains bounded as n tends 
to © on the interval —2 < z < 2, a fact that was already pointed out by Julia in his 
original paper [11, p. 186]. We shall then turn our attention to the orbits of the 
function F in the interval 0 < x < 1, that is to say, the sets of points F”(x,) as n 
runs over the positive integers. We shall show that the set of points x, is dense in 
[0, 1] in each of the following cases: the orbit of x, is periodic; the orbit of XQ is 
eventually absorbed into a fixed point; the orbit of x, is dense in [0, 1]. 

These results are then extended to certain polynomial functions of higher 
degree which are related to the Chebyshev polynomials, as well as to certain 
rational functions which are related to the duplication formulas of the circular and 
Jacobian elliptic functions. In this connection, the work of Lattés [12] should also 
be mentioned. The author is much indebted to Prof. Benoit B. Mandelbrot for 
helpful comments and references. 


2. An analytical formula. We shall take as our point of departure the function 
X = F(x) = 4x(1 -x) 


which maps the unit interval 0 < x < 1 onto itself. The close connection between 
this and the function f defined earlier is easily seen from the change of variable 
defined by 


z= ¢(x) =2- 4x, Z=6(X) =2- 4X, 
489 
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for the resulting function is simply 
Z=f(z) =2z*-2. 


Evidently Z = CX) = f(z) implies that ¢F = fd, and ¢ maps the interval [0, 1] 
onto the interval [—2, 2]. Furthermore, the function 


z=h(u) = 2cosu 


maps the interval 0 < u < 7 onto[-—2, 2]. If we now define the function g(u) = 2u, 
then an easy calculation will enable us to verify that 


fn(u) = (2cosu)* — 2 = 2(2cos? u — 1) = 2cos2u = hg(u), 


that is to say, fh(z) = hg(z) throughout their common domain. A simple iteration 
of this relation gives us 


f*h = ffh = fhg = hgg = hg’, 
as well as the general result f"h =hg” for every positive integer n. Since 


g"(u) = 2”u for each positive integer n, this latter result can then be stated 
analytically in the form 


f"(2cosu) = 2cos g”(u) = 2cos(2”u) 
for 0 < u <7. We Shall inscribe this result in the following theorem. 
THEOREM 1. If f(z) =z? — 2 for —2 <z < 2, then 
f"(2cos u) = 2cos(2"u) 
for each positive integer n and 0 <u <7. 
CoroOLuarRy. If F(x) = 4x(1 — x) for 0 <x <1, then 
F"(sin? u) = sin?(2"u) 
for each positive integer n and 0 <u < 7/2. 


Proof. Since the relation ¢F = fd implies F” = 6 'f", we have only to 
observe that if we put x = sin’ u, then we will have (x) = 2cos2u and 


1 — cos(2”2u) 
2 

Since the function f(z) = z? — 2 is analytic throughout the complex plane, it 

follows that the analytical formula of Theorem 1 which was verified on the 

segment 0 < u < 7 must remain valid for all complex values of u according to the 


principle of the permanence of analytical forms. Moreover, if we put u = v + iw 
and recall that 


F"(x) =@ 7 '(f"(2cos2u)) = @7'(2cos2”2u)) = = sin?(2"u). 


2 
lcos ul” = cos? v + sinh? w, 


we then have 
|cos(2”u) |? = cos*(2”v) + sinh?(2”w) 


for each positive integer n, from which it follows that if the imaginary part w of u 
is different from 0, then f”(2 cos u) must tend to ~ as n tends to ». Furthermore, 
the relation 


cos u = cos v cosh w — isin v sinh w 
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reminds us that the mapping U = 2cosu sends the line v = 7 to that part of the 
negative real axis defined by U < —2, and for such values of U it now follows that 
f"(U) will tend to © as n tends to ~, with the exception of the point U = —2. 
Likewise, the cosine mapping sends the line v = 0 to that part of the real axis 
defined by U > 2, and for such values of U it follows that f”(U) will tend to ~ as 
n tends to ©, with the exception of the point U = 2. Thus, the only points z, in the 
complex plane such that f”(z,) remains bounded as n runs over the positive 
integers are those in the interval [—2, 2]. This phenomenon was verified by Julia in 
his original paper [11, p. 186]. 

Most of the previous results can be carried over to certain higher degree 
polynomials f(z). We shall retain the function z = h(u) = 2cosu but now let 
g(u) = ku where k is some positive integer. Evidently 


hg(u) = 2cos ku = 2T,(cos u), 
where T, is the Chebyshev polynomial of degree k defined by 
T,(z) = cos(k arccos z). 
The leading coefficient of 7, is 2*-1\ so if we want a polynomial f, with leading 
coefficient 1, then we must put f,(z) = 27,(z/2). From this it readily follows that 
f,h(u) = 2T,(cos u) = 2cos ku = hg(u). 


The rest of the derivation now follows the path mapped out for the case k = 2 and 
leads to the result stated in the next theorem. A few of these polynomials f,(z) 
are listed below: 

z> — 3z, z+ — 477 +2, z® — 8z° + 20z* — 16z? + 2. 

THEOREM 2. If T, denotes the k-th Chebyshev polynomial and f,(z) = 2T,(z/2), 
then 

fi(2cos u) = 2cos(2”ku) 
for all positive integers n and all complex values of u. 

3. A dynamical model. Many authors such as Collet and Eckmann [8] and 
Whitley [18] have taken a different view of the problems described above and 
regarded the iterates f” of f as the various phases of a dynamic system. It is 
apparent from Theorem 1 that we can embed f” in a continuously evolving system 
f’ for all real values of the time ¢ according to the formula 

f'(2cos z) = 2cos(2'z). 

This relation can be cast into a mtore convenient form by means of the change of 
time scale defined by t = s/log 2, and our system is then governed by the formula 
f*(2cos z) = 2cos(e*z). 

If we put 

x(s) =2cos(ae*), y(s) = 2ae*sin(ae’),  z(s) = 2a*e** cos(ae*), (1) 

where a is some constant, then repeated differentiation yields the equations 


x'= -y, y =y+z, z’=2z-a’e**y. 


If we solve for a*e* in the relation z = a*e7*x and substitute into the expression 
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for z’, then we will arrive at the following autonomous system of nonlinear 
ordinary differential equations: 
yz 
x'= -y, y=yt+z, z= 2z—- —. (2) 
Of course, the autonomous system (2) can be reduced to the single third-order 
equation 


x'x x 
x" = 3x" + — -—- — - 2x’. 
x x 
Let us now examine an arbitrary trajectory (x(s), y(s), z(s)) of the system (2) 
with initial values 


x(0) =X, y(0) = Yo> z(0) — ZQ: 


We shall show that this trajectory lies on a rational surface S and spirals outward 
on S with increasing time s. Since the right hand sides of the equations in (2) are 
all continuously differentiable everywhere except when x = Q, it follows from the 
classical existence and uniqueness theorems for ordinary differential equations 
that the trajectory passing through (%9, yo, Z)) must be unique provided that 
X,) # 0. From 


d 
7G, + y*) =x'z +.x2z' + 2yy’ 
5 


yz 
—yz + x(22 — —| + 2y(y +z) 


= 2(xz + y’) 
we infer that 
2 S 
x(s)z(s) + (y(s))° = (X02 + yo)e. (3) 

Henceforth, we shall assume that x9, Z) > 0 and observe that 

dz xz'—z'  x(2z—-—yz/x) —2z(-y) Zz 

en a a 

ds x x x x 


which implies that 


x(s) Xo 
If we put 
Xo 
po = —(X9Zo + y6) 
Zo 


and eliminate e7° in our latest results, we will then find that 


Zz 1 
7 oa (2 +y*), 


This enables us to solve for z with the result that 


a a a (4) 
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This means that the trajectory must lie on the rational surface (4) in 3-space, and 
so the trajectory must be nowhere dense in 3-space. 
A general analytical solution of the system (2) can easily be obtained by 
substituting the expressions 
z= 70 25y, y= —-x’ 
Xo 


into (3) so as to get the single equation 


7a 7a 

0 0 
—er5x? + x! = —p’e5, 
XQ Xo 


which simplifies to the form 


x 
x2 4+ — 2e@- 25x12 = p. (5) 
<0 


A simple integration or direct substitution will easily verify that the solution to (5) 
with initial condition x(0) = xq is 


+ arccos — 


Z e -1 Xo 
x(s) = pcos ; 
Xo FE ee 


The complete solution of (2) now follows from 


[Zo [Zp e*-1 Xo 
y= -x'=,/— e*sin| ,/ — + arccos — ], 
Xo Xo Fb Le 
Zo [Z e?-1 Xo 
zZ=y'—-y = —e’** cos — + arccos — |. 
UXo Xo FE ee 


Thus, when s assumes such values that 


Ze —il Xo 
— + arccos — = 2n7T 
Xo -b la 


for some integer n, we will then have 


Zo 
x(s)=p,  y(s)=0, 2z(s) = ——e”’. 
XQ 
This means that if S denotes that part of the surface defined by (4) whose domain 
lies in the infinite strip x” < pw’, then trajectories on S can cross over the x-axis 
only when they lie either directly above the point (w, 0,0) or else directly below the 
point (—y,0,0). Since y, = 0 implies z) = 0 in (4), we must exclude all other 
initial values with y, = 0 besides the ones just mentioned, so the surface S$ ought 
to have an open slit above and below the segment joining the points (yw, 0,0) and 
(—p,0,0). At the same time we must adjoin the vertical halfline above (yu, 0,0) as 
well as the vertical halfline below (—, 0,0), so S will remain a connected surface 
but no longer simply connected. 
For any prescribed values of x, and pw, we shall choose y, and Z, so that 


Zo Xo XQ 
 =parccos—, yo = yu? — x2 arccos —, (6) 
UL bu 


Xo 
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where X,,Z, are required to have like signs and y, > 0. Our trajectory will 
accordingly assume the simpler form 


x 
x(s)=p cose arccos =), 
ph 


Xo , Xo 
y(s) = | arccos — Je* sin| e* arccos — }, (7) 
pe pe 


2 
Xo Xo 
_ 28 S 
Z\S)=p arccos —— e COs}| e arccos — |]. 


The presence of chaotic behaviour in these trajectories can be readily exhibited by 
introducing a time delay of log 2, where we see that 


x 
x(s + log2) =p cos(2e" arccos =| 
ph 


_ 2| >s *o\ _ _ 2 2 
u| 2 cos*| e* arccos 1 (x(s))" —p, 
p. p. 


Xo , Xo 
y(s + log2) = 2u|arccos — ]Je* sin{| 2e* arccos — 
pe pe 


Xo . Xo Xo 
4u| arccos — Je’ sin| e* arccos — | cos| e* arccos — 
fey pe fe 


4 
—x(s)y(s), 
pe 


Xo \* 5 Xo 
z(s + log2) = 4u| arccos — | e** cos|2e* arccos — 
fey fey 


2 
Xo Xo Xo 
2S 2 S - 2 S 
Le arccos —— é COS e€° arccos — — Sin’|e arccos — 


4 
7 (x(8)2(5) — (y(s))’). 


This means, in particular, that if we put f(x) = (2/w)x* — w and x(s) = x, then 
we will have x(s + log 2) = f(x), and further iteration of this relation clearly leads 
to 
x(s +nlog2) =f"(x) 
for every positive integer m and s > 0. Therefore, if time s is measured in units of 
log2, then the x-coordinates of our trajectory will exhibit the sort of chaotic 
behaviour that was discussed in earlier sections. 
Furthermore, we see that the mapping G from 3-space into itself defined by 


2 4 
G(x,y,z) = —x* — pw, —xy, —(xz — y’) 
ye wo? 


can be embedded in the continuously evolving system of trajectories (7) so long as 
the initial point (Xo, Yo, Zp) Satisfies the conditions of (6). As mw runs over the 
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positive real numbers, the nature of the mapping G will remain essentially 

unchanged, for we have 

2 (6 mw bo 2/{2u 4 4 pw 

—6(52.59.52] — —-x* — w, ——xy, — —(xz — y”) 
fey wp 4 p 4 


w-\2~2"°2 
(x? — 2,2xy, 2(xz — y?)). 


Evidently the introduction into (7) of a time delay equal to log k for some positive 
integer kA would result in a mapping G from 3-space into itself that was defined in 
terms of the Chebyshev polynomials. 


4. Orbits. The behaviour of the iterates f"(2cos u) for 0 <u <7 of f(z) = 
z* — 2 as n runs over the positive integers can be analyzed most conveniently by 
putting r = u/7 so that 0 <r < 1. The fixed points of f evidently correspond to 
r=0, u =O and r = 273, u = 27/3. From the general formula 


f"(2cos rr) = 2cos(2”rr) 


we conclude that the behaviour of 2”r can be seen to best advantage when we 
represent r by means of its dyadic expansion 


a 
tay bth bay tt = AA, ... Ap..., 


where each a, is either 0 or 1. Our aim is to determine necessary and sufficient 
conditions in terms of r for the iterates of f”(2 cos wr) to be absorbed into one or 
other of the fixed points. From the elementary formula 


cos 2arr = cos27r(1 — r) 


we infer that if r < 1/2, then we can compute the next iterate cos 27r according 
to the formula 
cos 27r = cos(mT(r)). 


where 7(r) is the shift operator defined to be the fractional part of 2r, that is to 
say, 
T(r) = T(.a,a,...) = -A,QA3... . 


But if r > 1/2, then we must define T(r) to be 1 — fractional part of 2r according 
to the formula 


T(r) = T(.a,a,...) = .aba4..., 
where a, = 1 — a, for each index k and compute 
cos 2arr = cos(7T(r)). 


These definitions of T(r) can be compacted into the single formula 


U,a,... incasea, = 0, 
T(.a,a2...) = aa’ in case a, = 1 
° 2? 3° ee 1 ry 
These two definitions clearly agree in the ambiguous cases 
T(.0111...) = .111..., T(.111...) = .000.... 


The fixed points 
r=4= .101010..., r=0= .000... 
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evidently satisfy the relation T(r) = r. The relation 
T(.010101...) = .101010... 
reveals that the iterates of 
r= 3 = .010101... 


are quickly absorbed into the fixed point r = 2/3. From these observations we can 
easily deduce the following results. 

A necessary and sufficient condition for the iterates T"(r) of r to be absorbed 
eventually into the fixed point r = 2/3 is that the tail of the dyadic expansion 
r= .a,a,... should be ...010101... . Likewise, a necessary and sufficient condition 
for the iterates T"(r) of r to be absorbed eventually into the fixed point r = 0 is that 
the tail of r should be ...000..., that is to say, r should have a finite dyadic 
expansion. 

As another example let us take r = .100100... and calculate T(r) = .110110..., 
T(T(r)) = T(r) = .010010..., T(r) = .100100... . The iterates of such an r are 
evidently periodic, and our aim will be to show that a necessary and sufficient 
condition for the iterates T"(r) of r to be eventually periodic is that r should have a 
dyadic expansion with periodic tail having period p. 

We have already dealt with the cases p < 2 where the iterates are eventually 
absorbed into either r = 0 or r = 2/3. The set of iterates T”(r) as n runs over the 
positive integers we shall call the orbit of r and denote it by O(r). The example 
r= .011011... shows us that the orbit O(r) need not include r in its periodic 
oscillations, for T(r) = .110110..., and the orbit of T(r) has already been 
charted. Furthermore, the period of T(r) as a function of n need not coincide 
with the period of the dyadic expansion, as shown by the example r= 
.11001100..., 7(r) = .01100110..., T77(r) = .11001100... =r. Nevertheless, we 
shall verify that the period of T”(r) as a function of n can not exceed the period of 
the dyadic expansion. Indeed, the action of the mapping T consists of a truncation 
of the initial digit and, in some cases, a reversal of all remaining digits. Now the 
reversals will occur at the first appearance of a 1 in the dyadic expansion, and 
thereafter at each digit a, such that a,_, #a,. If p is the period of the dyadic 
expansion, then we have a,,, = a, for all sufficiently large indices k. If a reversal 
occurs at a,, then another must occur at a,,,, provided that a, is not the first 1 
in the expansion. If a, is the first 1 in the dyadic expansion, then another reversal 
need not occur at a,;,,, as shown by the orbit 


O(.101101...) = {.100100...,.110110...,.010010...}. 


Evidently an even number of shifts and reversals is equivalent to an even number 
of shifts with no reversals, and if there were an even number of reversals within 
one complete period of the dyadic expansion, then the values of T”(r) would 
clearly repeat with the same period p. Moreover, if a reversal occurs at a, and at 
A,+ , then the digits between them must change at least once, for otherwise the 
period of the expansion would be 1. Since a, = a,,,, there must be an even 
number of changes of digit between a, and a,,,, hence, an even number of 
reversals, so the resulting action from a, to a,,, is simply an extended shift, and 
T”(r) will eventually be periodic with period p. A well-known elementary argu- 
ment shows us that the smallest period of T(r) as a function of n has to be a 
divisor of p. The converse of this result, that if T”(r) is eventually periodic, then 
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the dyadic expansion of r must be periodic as well, is easily verified. We shall 
announce this result in the following theorem. 


THEOREM 3. If the dyadic expansion of r is eventually periodic with period p, then 
T”(r) will eventually be a periodic function of n, and its period will be a divisor of p. 
Conversely, if T(r) is eventually periodic, then the dyadic expansion of r will also be 
eventually periodic. 


We shall now turn our attention to orbits O(r) which are infinite. Consider, for 
example, the orbit O(r) where r = .1010010001... . Evidently O(,) is infinite, and 
its limit points are 1,1/2,...,1/2”,...,0, so O(r) is nowhere dense in the closed 
interval [0, 1]. 


THEOREM 4. Given any point ro in the interval [0,1], we can find a point r 
arbitrarily close to r, such that O(r) is dense in [0, 1]. 


Proof. Let s, have a finite dyadic expansion which agrees with sufficiently many 
of the initial digits of ry so that sy will lie as close to rg as desired. We shall then 
define a sequence S of finite dyadic expansions as follows: 


s, = 0, s, = .1, s, = .00, s, = .01, ss = .10, so = .11,..., 


according to the obvious pattern wherein all dyadic expansions of length n — 1 will 
have appeared by the time we reach s,,_,. We now proceed to define the dyadic 
expansion of r inductively. The initial segment of r will be s), and the subsequent 
segments of r will be either the members s, of the sequence S in their proper 
order, or else their reversals s’, according to the following criterion. Suppose that 
we have already defined the first n segments which form the first L digits of r. We 
shall then define the next segment of r to be s, in case there was an even number 
of reversals in the first L digits; otherwise, we define the next segment of r to be 
s’, where the prime denotes as usual the result of reversing all the dyadic digits of 
s,- The infinite dyadic expansion which has thus been formed inductively will be 
denoted by r, and it is apparent that r lies as close to rg as desired. Furthermore, 
if r, is any point in the interval [0,1] and e is any small positive number, then we 
can find a member s, of S with length M such that 1/2” < «/2 and |s, — r,| < 
e/2. From this it follows that some member s of O(r) will have s, as its initial 
segment, so 
E E 
5 + 5 E. 
Thus, 7, must be a limit point of the orbit O(r), and O(r) must be dense in [0, 1]. 
It is quite clear from the method of proof of Theorem 4 that we can insert an 
arbitrary initial segment into the dyadic expansion of those points r which we 
constructed there so as to produce an expansion of r with the property that the 
resulting orbit O(r) will eventually be periodic. From this it follows that the set of 
points r in [0,1] such that T”(r) is eventually periodic must be dense in [0, 1]. 
Indeed, the set of points r such that T”(r) is eventually periodic with period 
p => 3 must be dense in [0,1]. Finally, it was previously observed that the set of 
points r such that T”(r) is eventually absorbed into r = 0 is a dense subset of 
[0, 1], a phenomenon that was noticed by Julia [11, p: 186]. The same observation 
applies to the set of points r such that T”(r) is eventually absorbed into r = 2/3. 
We shall summarize these observations in the following theorem. 


ls —r,]<ls —syl+ sy —7r,| < 
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THEOREM 5. The following sets of points r are dense in the interval [0, 1]: 

(Gi) all points r such that T"(r) is eventually absorbed into r = 0 or r = 2/3; 
(ii) all points r such that T"(r) is eventually periodic with period p; 

(iii) all points r such that O(r) is dense in [0, 1]. 


Corotiary. If f(z) = z* — 2, then the following sets of points z on the segment 
[—2, +2] of the real axis are dense in that segment: 

(i) all points z such that f"(z) is eventually absorbed into z = 2 or z = —1; 

(ii) all points z such that f"(z) is eventually a periodic function of n with period p; 

(iii) all points z such that f"(z) is dense in [—2, +2]. 


Results for the functions f, of Theorem 2 that are entirely similar to those in 
the above Corollary can easily be formulated. 


5. Further duplication formulas. The initial source of our analytical formulas 
for the description of chaotic behaviour was the duplication formula for the cosine, 
which was then extended to higher replications by means of the Chebyshev 
polynomials. In the present section we shall sample two further duplication 
formulas and draw conclusions similar to those in earlier sections. A historical 
connection between the duplication formula for cot z and Cayley’s use of Newton’s 
approximation method to solve z* + 1 = 0 has been pointed out by Mandelbrot. 


We shall consider first the elementary formula 
2 2 tanh u 
t = a. 
ane T+ tanh? u 


If we put 


f(z) ==, A(u) = tanhu, gu) = 2u, 
then clearly fh(u) = hg(u). Iteration of this relation will then yield f"h = hg” for 
each positive integer n, and we have thus arrived at the formula 


f”(tanh uv) = tanh 2”u 


for each complex number u = v + wi. For real values of u we have —1<z= 
tanh u < +1, so the limit as n tends to » of f(z) = tanh 2”u will be +1 in case 
u > 0, 0 in case u = 0, and —1 in case u < 0. From the relation 

tanh 2”u|7 (2p + 2" wi) (2 sinh? 2”v + sin* 2”w 

tanh 2”u|“ =|ta v+2"°wi)| = ———— 

[tanh )| cosh? 2”v — cos* 2”w 
we infer that f”(z) will tend to a point on the unit circle as n tends to © in case 
v # (0. The relation 
sin 2” tly 


argtanh 2”u = arctan —————— 
S sinh 2”*!v 


implies that arg f"(z) will tend to either 0 or w as n tends to », and from 
considerations of continuity we infer that f”(z) will tend to +1 in case v > 0 and 
to —1 in case v < 0. A cursory examination of the mapping 

e*4# — | 


z= tanhu = ——— 
e744 4 
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will serve to remind us that the region Re u = v > 0 is mapped to Im z > 0, while 
Re u = v < 0 is mapped to Im z < 0. In case v = 0 we have 


sin? 2” w 


tanh 2”u|? = tan* 2”w, 


1 — sin? 2”w 
TT 
arg tanh2”u = + 5 


provided that w # 0, and we infer that f”(z) must lie on the imaginary axis. Now 
the behaviour of tan” 2”w can be analyzed along the same lines that were followed 
in deriving Theorem 5 as soon as we put w = (7 /2)r, for then we can write 


us us 
tan” 25 (1 —r) = tan’ 25r 


for all real values of r. Accordingly, we. can carry over the results of Theorem 5 
mutatis mutandis to the present situation and conclude that the following sets of 
points will be dense on the imaginary axis in the u-plane: the set of points wi such 
that tan? 2”w is eventually periodic; (ii) the set of points wi such that tan? 2”w is 
eventually infinite, that is to say, tan? 2”w = © for some positive integer n, namely, 
those points wi = (2m — 1)2~”~!ai for any choice of integers m,n with n > 1; 
(iii) the set of points wi such that tan* 2”w is dense on the real axis. We shall 
summarize these results in the following theorem. 


THEOREM 6. If 


2Z 
f(z) = 14+ 2? 


for all points in the complex z-plane, then f"(tanh u) = tanh 2”u for all points in the 
complex u-plane and all positive integers n. As n tends to ~, f"(z) will converge to 
+1 in case Rez>0O, to —1 in case Rez <0, and to 0 in case z=0. If 
Re z = 0, then the following sets will be dense in the imaginary axis of the z-plane: 
(i) the set of points z such that f"(z) is eventually a periodic function of n; (ii) the set 
of points z such that f"(z) is eventually infinite; Giii) the set of points z such that 
f(z) is dense in the imaginary axis. 


CoROLLARY. With the same hypotheses as in Theorem 6, the set of points z such 
that f"(z) remains bounded as n tends to ~ is the complement in the complex z-plane 
of a dense subset of the imaginary axis. 


6. Higher rational functions. The simple cases of chaotic behaviour which have 
been discussed in previous sections have barely scratched the surface of the 
general problems of a topological nature that are to be encountered here. We have 
already seen examples for certain functions f of the sets of points z in the 
complex plane where the iterates f”(z) as n runs over the positive integers are (i) 
absorbed into the fixed points of f (Theorem 5); (ii) periodic (Theorems 5 and 6); 
(iii) dense (Theorems 5 and 6); (iv) infinite (Theorem 6). In all of the examples 
presented so far, these points z were dense in one-dimensional subsets of the 
complex plane, namely, the real and imaginary axes. In order to illustrate the 
range of topological possibilities that these sets of points z may have for other 
rational functions f, we have chosen for our final example a duplication formula 
from the theory of Jacobian elliptic functions (see [19, pp. 491-535]). The duplica- 
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tion formula for the elliptic function cn u asserts that 
cn2?u—sn2udn2u —-k2(1 — en?2u) — 2(1 — cn2u) +1 
TR sntu ent) 
where & is a positive real number. If we put z = cnu and 
k2(1 —2?)° — 2(1 —z?) +1 

1— k2(1 — 22)” 


for all points in the complex z-plane, then it follows that F(cn u) = cn2u and 
generally that F”(cnu) = cn2”u for each positive integer n. Although the be- 
haviour of the iterates F”(z) of this particular function has not yet excited much 
attention, it can evidently be deduced in some detail from the poles, zeros, and 
doubly periodic nature of cn wu. 

We shall pursue a slightly different course and remark that F(z) can be 
simplified to some extent by means of elementary substitutions. If we put 


7 k2(1 — z?)° — 2(1 —z?) +1 
1 — k2(1 — 2?) 


and t = k(1 — z”), then we will have 


cn2u = 


3 


F(z) = 


= F(z) 


In order to complete our substitutions, we have to put 


T=k(1-Z*) =k(1-Z)(1+Z) 


_leeie 


(1 - #7)" 
4t(1 - k(1 — -} 
(1 — #?)° 
f(t). 


Our duplication formula can now be transferred to the function f(t) defined at all 
points in the complex t-plane. If we recall that 


t=k(1—-cn*u) =ksn’*u, 


then our previous relation 
Z = F(z) =F(cnu) = cn2u 
now assumes the form 
T = f(t) =f(ksn*u) 
=k(1—- Z*) =k(1 — cn’ 2u) 


— k sn? 2u. 
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Iteration of our new formula f(k sn? u) = k sn? 2u will evidently lead to the result 
quoted in the following theorem. 


THEOREM 7. If 
1 
4t(1 —kt){1—-— —-t 
( | - 
(1-1?) 


for all points in the complex t-plane, then 
f"(k sn? u) =k sn*(2"u) 


f(t) = 


for each positive integer n. 


THEOREM 8. With the same hypotheses as in Theorem 7, if 0 < k <1, then the 
following sets of points are dense in the complex t-plane: (i) the set of points t such 
that f"(t) is eventually a periodic function of n; (ii) the set of points t such that f(t) 
is eventually infinite; (iii) the set of points t such that f"(t) is dense in the complex 
plane. 


Proof. From [19] we know that sn u is doubly periodic with periods 4K and 
2K'i, where K,K' can be expressed analytically in terms of k, and that if 
0<k <1, then K, K’' will be real. Furthermore, sn u will be real valued in case u 
is real and pure imaginary in case u is pure imaginary, while snu is an odd 
function. If we put 


u=v+w=2Kr+ K'si, 


then it follows immediately that 


sn? 2(2Kr + K'si) = sn? 2(2K(1 —r) + K’(1 -s)i) 


sn? 2(2K(1 —r) — K'si) = sn?2(2K(1 —r) + K'si) 
sn? WOK 4 RUT). 
From Theorem 7 we have 

f"(k sn*(2Kr + K'si)) = k sn? 2”(2Kr + K'si), 


and if we restrict r,s so as to lie in the interval [0, 1] and represent them by means 
of their dyadic expansions, then we can write, as in the proof of Theorem 3, 


sn* 2(2Kr + K'si) = sn? T,(2KT,(r) + K'T,(s)i), (8) 


where 7,, 7, are composed of shifts and reversals of the respective dyadic expan- 
sions, just as in the proof of Theorem 3, and T>) is the identity mapping in case r, s 
both lie in either of the intervals [0,1/2] or [1/2,1], but Tp is the conjugation 
mapping in all other cases. 

From these definitions of T),7,, 7, it follows that f(t) has two fixed points tf 
corresponding to (r,s) = (0,0),(2/3,2/3) and two points corresponding to 
(r,s) = (0,2/3), (2/3, 0). We can describe the relation between 7), 7,, T, simply 
by saying that 7, will be the identity mapping in case both or neither of T,, T, 
execute a reversal, but 7, will be the conjugation mapping in case just one of 
T,, T, executes a reversal. From the relation 


sn?(v + K'i) = sn?(v — K'i) = sn?u + K'i 
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we infer that the alternative definitions in (8) are consistent in the ambiguous cases 
r=1/2,1 ands =1/2,1. Let 


Yr = .€,45Q,..., s = .b,b,b,... 


denote the dyadic expansions of r,s and suppose now that these expansions are 
eventually periodic with periods p,q respectively. We have already noticed in the 
proof of Theorem 3 that there will be an even number of reversals between a, and 
Ax +p, aS well as between b, and b,,,. If m denotes the least common multiple of 
p,q and A is the set of indices in a segment of length m where T, executes a 
reversal, while B is the corresponding set of indices in the same segment where 7, 
executes a reversal, then we know from the proof of Theorem 3 that A, B will 
both contain an even number of indices. Furthermore, 7, will execute a conjuga- 
tion at all indices in the symmetric difference D = (A UB)-—(A/MB), and D 
must clearly contain an even number of indices regardless of whether A MB 
contains an odd or an even number of indices. Thus, the function T(n) defined by 


T(n) = T3(2KT/(r) + K'T3(s)i) 

must eventually be periodic with period m or less. As in the proof of Theorem 3, 
we readily infer that the period p of T(n) must be a divisor of m. The converse, 
that if T(m) is eventually periodic, then the dyadic expansions must also be 
periodic, is easily verified. 

Now sn u has simple poles at the points K'i and 2K + K’i, so it will also have 
poles at the points 

4MK + (2N+1)K'i, 2(2M+1)K+(2N+4+1)K’' 
for all pairs of integers M, N. By making suitable choices for M, N, and n > 1 we 
can fill the square 0 < r < 1, 0 < s < 1 with a dense set of points having one of 
the forms 
4MK + (2N + 1)K’' 2(2M + 1)K+ (2N+4+1)K' 
2” 2” 

If u = 2Kr+K'si is any one of the corresponding points in the u-plane, then 
sn*(2”u) will evidently be infinite, and we only have now to verify that the mapping 
t = ksn*u sends the rectangle with corners 0,2K,2K + K'i, K'i in the u-plane 
onto a dense subset of the t-plane. To this end we shall use the addition formula 
for sn u and write 


snudn'w +isn’wen’wenvu dnv 
snu = sn(v + wi) = 


3 


cn’ w+ k* sn? vu sn? w 
where we have written sn’ w for sn(w, k') and likewise for cn’ w and dn’ w, while k’ 
denotes the complementary modulus defined by k? + k'? = 1. From the table of 
values 


v 0 K 2K w 0 K' 
sn VU 0 1 0 sn’ w 0 1 
cn Uv 1 0 —] cn’ w 1 0 
dnv 1 k’ 1 dn’ w 1 k 


we can trace out the image in the ¢-plane of the boundary of our rectangle in the 
u-plane when the latter is traversed in the counterclockwise direction beginning at 
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u = 0. Along the edge w = 0 we have t = k sn’ v, and the image point will trace 
out the slit from ¢ =0 to t=k along the positive real axis. Along the edge 
v = 2K we have 


A sn’ w cn’ w \? sn’? w 
t = — |—_—_—_——_ = — _ 
cn’? w cn’? w 
and the image point will trace out the negative real axis from t = 0 to t = —o, 


Along the edge w = K’ we have 
k ksnv \? 1 
t= _— = 
ars} k sn? v’ 


and the image point will trace out the slit on the real axis from ¢t = 1/k to 
t = +o, Finally, along the edge v = 0 we have 


isn’ w cn’ w \” sn’? w 
of Sate) «et 
cn’“ w cn’“ w 

and the image point will again trace out the negative real axis but in the opposite 
direction from t = — to t = 0. Consequently, the image in the ¢t-plane of the 
interior of our rectangle in the u-plane will be the complement in the t-plane of 
the segments on the real axis from t=1/k to t= + and from t=k to 
t = —o, and we will thus have achieved our construction of a dense set of points 
in the ¢-plane such that f”(t) is infinite for some positive integer n. 

We shall employ the construction used in the proof of Theorem 4, suitably 
modified so as to fit the present situation, for constructing points (7, s) such that 
the values of T(n) will form a dense subset of the fundamental rectangle with 
corners at u = 0,2K,2K + K'l, K'i as n runs over the positive integers. If the 
action of the mapping 7, is ignored for the moment, then the procedure for 
generating a dense subset of lattice points in our fundamental rectangle could be 
accomplished in the following manner. At each stage of the process described in 
the proof of Theorem 4 where we have to assign the list of all dyadic segments of a 
given length, we should require r to repeat a given segment on that list while s 
runs over the entire list, then make r repeat the next segment on the list while s 
runs over the entire list again, and so on in lexicographical order. However, the 
action of 7, will intervene in this process whenever one of r,s executes a reversal 
while the other does not, and the result of this intervention is the conjugate of the 
value of T(n) that we anticipated. It is an easy matter to identity those regions in 
the fundamental rectangle where no conjugations will occur during the first n 
iterations of TJ. Indeed, we have already observed that 7, is the identity mapping 
in case (r,s) lies in 


172 1 2 
R, = [0,3]° v [5,1], 
where we have adopted the convention of writing /* for the cartesian product 


I XI of any subset J of the real axis with itself. Evidently 72 will be the identity 
mapping in case (r,s) lies in 


2 2 

R,= ((0,4] ¥ [3,2] ¥ (la. a] ¥ [2.17 
for if g denotes the mapping defined by g(u) = 2u, then g(R,) clearly lies in the 
region R obtained by translating R, through 2MK and NK'i as M and WN run 
over the integers. Generally Tj will be the identity mapping in case (r,s) lies in 
the checkerboard region R, with squares of side 2°” which comprises one half the 
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area of [0, 1]’, for g”(R,,) clearly lies in R. If we observe that all the lattice points 
represented by the dyadic segments used above to generate a dense subset of the 
fundamental rectangle will themselves lie in the intersection of all the checker- 
boards R, as n runs over the positive integers, it then follows that Tj will be the 
identity mapping for all values of n at each of these lattice points, and our 
construction of a dense subset is complete. The result that ¢ = sn?u maps the 
fundamental rectangle onto a dense subset of the complex t-plane, which was 
derived earlier in this proof, is now required to verify that the set of points t such 
that f”(t) forms a dense subset of the complex plane as n runs over the positive 
integers is itself dense in the complex t-plane. At the same time, this property of 

= sn’ u implies that the set of points ¢ such that f"(t) becomes eventually a 
periodic function of n is itself dense in the complex t-plane. 


CoROLLARY. With the same hypotheses as in Theorem 8, f"(t) will eventually be 
a periodic function of n with period p if, and only if, the dyadic expansions of r,s are 
both periodic with periods p,q, respectively, and p will divide the least common 
multiple of p, q. 
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SUMMARY OF THIS ARTICLE. We focus on the following questions: How many Pythagorean number 
triples are there, whose triangles have a common hypotenuse, leg-sum, leg-difference, area, perimeter, 
inradius or leg, and how can they be calculated? How are such triples distributed for small parameters 
as well as asymptotically? Statistical results generated by computer will be given. Some of the 
asymptotic statements may appear surprising to readers who are relatively unfamiliar with number 
theory. A survey of published results and historical remarks are included in this article to make it more 
enjoyable. 


1. Introduction. A triple of strictly positive integers x, y, h satisfying 
x*°+y%=h? (1.1) 


is called a Pythagorean number triple, or more briefly a Pythagorean triple. 

A Pythagorean triple, denoted by [x, y,h], is called primitive if and only if 
x, y,h are relatively prime, i.e. their gcd is 1; it is sufficient that gcd(x, y) = 1 
holds. 

If x and y were both odd then (1.1) would imply that h? = 2 (mod 4). But this 
is impossible, because if h is even this implies h? = 0 (mod 4) and if h is odd this 
implies h? = 1 (mod 4). Consequently, one leg has to be even and the other has to 
be odd. The following way of describing primitive Pythagorean triples (ppt) has 
been known at least since the time of Euclid. 

The pairs of integers [a, b] with the properties 


a>b>0 gcd(a,b) = 1 a+b=1 (mod2) (1.2) 
generate the set of all ppt [x, y,] with x even and y odd by taking 
x=2ab y=a’—-b*? h=a’*+b?’. (1.3) 


The above description is a bijection between the pairs [a, b] with properties (1.2) 
and all the ppt [x, y, h] with even x. 

If we use any pair of integers a,b with a > b> 0 in (1.3), we would get a 
Pythagorean triple, but it would not be primitive in general. We call a and b the 
generators of the primitive triple [ x, y, h]. 


*Institute for Mathematics and its Applications, University of Minnesota, Minneapolis, and 
Ingenieurschule Biel, Postfach 1180, CH-2500 Biel/Bienne Switzerland. This work was mainly done 
when the author was spending his sabbatical in Minneapolis. He wishes to thank the State of Bern, in 
Switzerland, for its support and the IMA, at the University of Minnesota, for its provision of excellent 
working conditions and computer facilities. 
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Examples: 
[2,1] - [4,3,5] [3,2] — [12,5, 13] [21,4] — [168, 425, 457]. 
GEOMETRIZATION. (1.3) can be considered as the conformal mapping z > z? with 
z=artib: 
(a + ib)” = (a2 —b?) +i + 2ab. (1.4) 


Indeed, y = Re(z?) and x = Im(z’). Introducing coordinate axes as shown in 
Ficure 1, the mapping z > z” has the following properties when discussed for the 
entire (a, b)-plane: 


y= a? _ b? 
80 / (36,77,85) 
70 ” 
a= 8 (16, 63, 65) (72, 65,97) 
60 ~~ 
50 as 7 
40 Heo, 39,89) 
Q=6 (12, 35,37) > 
30 
a=§ 
20 
a:4 
104.953 
2 (12, 5, 13) 
(43,5) x = 2ab 
0 10 20 30 40 50 60 70 80 


For a = a, = const, we get y = ag — b? and x = 2a)b. For a, # 0 this implies 
b =x/2a, so that 


y=-—> +4}. (1.5) 


As a result, the images of horizontal lines are parabolas which are symmetric about 
the y-axis, and which open downward, with their common focus at the origin O. 
The exceptional case of a, = 0 gives us the negative y-axis as the image. 
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Similarly, the images of vertical lines b = by = const are described by 
2 


—.-b?2, b, #0 (1.6) 


* 4b 


and they are parabolas that open upwards. 

Furthermore, y = 0 implies x? = 4a$ or x* = 4b, respectively. Consequently, 
parabolas of the two orthogonal families have a point on the positive x-axis in 
common if and only if ay = Do. 

FIGURE 2 shows the images (x, y) of all points (a,b) with a>b>0, a= 
1,2,3,...,42 and b = 1,2,3,...,30. In particular, the computer generated points 
mark all ppt in the range x and y < 1800, where the distance from the origin to 
such a point gives the length of the hypotenuse of the corresponding right triangle. 
The points marked with an h correspond to primitive multiple triples with a 
common hypotenuse. Similarly, s stands for a common leg-sum and f for a 
common area. See the following for details. 


LOE 
ES : 
x 


oe ~ RRS 


NX ~, “MOF COM, IAN 
e ~ sé 7s ~ iN SY x » 
Me PBEASS Se ere S ee SN ¢ A — ab 
0 250 500 £750 1000. 1250 1500 1750 2000 
Fic, 2 


Note that 


p(a) if a is even 


p(a)/2_ if a is odd, (1-7) 


Number of points on a curve with const a-value = | 


where g(a) = number of positive integers less than a and relatively prime to a. 
That is, g is the Euler-phi function. If a is even, (1.7) holds by definition. If a is 
odd, we use the fact that gcd(a,k) = gcd(a, a — k) holds for every k. But k and 
a — k have different parity. Therefore, exactly one of a pair k, a — k is even. 
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If by reflection with respect to the line x = y in FiGuRE 2, we map all the points 
with y >x into the domain x > y > 0, we get Figure 3. The density of ppt 
doubles if we drop the condition that x must be even, however we lose the nice 
geometrical pattern. 

Please note: We always identify triangles that are mirror images of each other. 


F a een a 
Fic. 3 


2. Triples with Common Hypotenuse h. If gcd(a, b) = 1, then any odd prime 
divisor of h = a? + b? is of the form 4m + 1. See [4, Theorem 13]. 
Because h is odd, it is necessary that its prime factorization have the form 


h = pBi + p§2 +++ pPBn (2.1) 
with pairwise different primes p; = 1 (mod 4) and B; = 1. ) 
Fermat established that each prime number p of the form 4m + 1 can be 
written uniquely as a sum of two squares: p = a* + b’, up to a permutation of the 
squares. We know from the above discussion that a, b have different parity. 
When discussing representations of a positive integer as a sum of two squares, 
we will need the following two identities: 


re +s?) + (u? + v7) = (ru — sv)” + (rv + su)’ (2.2) 


(r2 +57) + (uw? +02) = (m+sv)° + (wv —- su)’. 


Consequently, if we can represent each factor as a sum of two squares, (2.2) gives 
us their product as a sum of two squares. 

Furthermore, let us call the representation of a number as a sum of two squares 
a* + b* proper, if gcd(a, b) = 1. With the knowledge of the representation for the 
primes p, in (2.1), the unique proper representations for the prime powers p?: as 
well as the total set of 2”~! different proper representations for h can be 
generated by the composition (2.2): Each additional prime power p? doubles the 
number of proper representations. Fermat (1632-1690), Euler (1707-1783), 
Lagrange (1736-1813) and Legendre (1752-1833) played important roles in the 
development of these ideas (see [18, Chap. II, §VIII and §IX] and [3, Chap. VI]). 
The parity condition is respected by the two compositions (2.2). 
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In summary, we have the following: 


THEOREM 1. If h > 1 has the form (2.1), then there are exactly 2"~' different 
primitive Pythagorean triples with common hypotenuse h, which can be generated by 
applying (2.2). If h is not of the form (2.1), then there is no primitive Pythagorean 
triple. 


Examples: The smallest h with exactly 

(a) 1 ppt is 5 = 27 + 1? and the triple is [4, 3, 5] 

(b) 2 ppt is 5-13 = 65 = (2? + 17X37 + 27) = (64 2)? + (4 F 3)? = 87 + 
1? = 4* + 7% and the triples are [16, 63, 65] and [56, 33, 65]. 

(c) 4 ppt is 65 - 17 = 1105. From [8, 1],[7, 4] for 65, and using [4,1] for 17, we 
get: 


[33,4] < [264, 1073,1105] [31,12] © [744, 817, 1105] 
[32,9] — [576, 943, 1105] [24, 23] — [1104, 47, 1105] 


(d) 128 ppt is 5-13 - 17-29-37: 41-53 - 61 = 138, 863, 452, 745. 

The initial elements of the ordered sequence of h-values with 

(1) 4 ppt are: 1105 1885 2405 2465 2665 3145 3445 3485 3965 4505 4745 5185... . 
Only the first two, marked with h, are shown in FiGurE 2. 

(2) 8 ppt are: 32,045 40, 885 45,305 58,565 67,405 69,745 77,285 80,665... . 


DISTRIBUTION OF TRIPLES WITH COMMON HYPOTENUSE. 


The asymptotic behavior of the number 7(x,m,r) of all primes p <x with 
p =r (mod™m) and gcd(r, m) = 1, as x > ~, the so called primes in arithmetic 
progression, (see [10, §4, Ch. III, Th. 3.17]) is given by: 


Xx 


m(x,m,r) ~ (2.3) 


g(m) “log x’ 


In our case, with the notation (x) for the number of primes < x, we get 
1 
w(x,4,1) ~ -:-——~ =:(x). (2.4) 


To generate h-values with several ppt, one has to deal with primes of the form 
4m + 1. TABLE 1 shows some values of w(x) and 7r(x, 4, 1). 


TABLE 1. Distribution of primes. 
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We let H,(x) be the number of A <x such that there exist exactly 2”~' 
different ppt [x, y,h] for each h. Computer calculations resulted in TABLE 2 and 
FIGURE 4. 


TABLE 2. Distribution of multiple ppt with common hypothenuse. 


H(x): 2 triples H,(x): 4 triples H,(x):8 triples 
10 


10,000 100 
20,000 183 
40,000 338 
100,000 664 
200,000 1,183 
500,000 2,429 
1,000,000 4,184 
2,000,000 7,608 
4,000,000 13,236 
10,000,000 28,624 
20,000,000 51,574 
50,000,000 111,773 
100,000,000 204,891 
200,000,000 369,064 
400,000,000 680,850 
1,000,000,000 1,519,597 
2,000,000,000 2,803,231 301,286 


10° H,(x) 
10° Hx) 
104 Hx) 
10° 
102 
10 
1 x 
104 10° 10° 107 10° 10° 


To discuss the asymptotic behaviour of H,(x), we start with a classical result: 
If 77,,(x) is the number of all positive integers <x of the form (2.1), but without 
the condition p; = 1 (mod 4), then for each n = 1,2,3,... we have 


1 x(loglog x)"~' 


T(x) ~ (x > ). (2.5) 


(n — 1)! log x 


See [9, Ch. III, §4, Th. 3.12] or [4, Sec. 22.18, Th. 437] (the case n = 1 is the prime 
number theorem). 
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Because (2.4) holds, we have to introduce a factor 2 for each prime factor of the 
form 4m + 1. From (2.5) it follows that 
1 1 x(loglog x)" ’ 
H ~~ , 2.6 
AX) ~ 55 Tay log x (¥ > #) (2.6) 
Furthermore, for each n = 2,3,4,..., 


H,-\(x) — 2(n — 1) 


; 2.7 
H,(x) log log x (4 > *) (2-7) 
and, therefore, 
H,,_ (x 
lim Ay) = 0. (2.8) 
roe H,(x) 


The asymptotic behaviour is completely different from what we have seen in 
TABLE 2 and FiGurRE 4. However, x has to be “huge.” The monotonic function on 
the right-hand side of (2.7) decreases extremely slowly. Its value is 1 at x =e® 


Remark. In [8] and [15], the asymptotic behaviour of the number of all ppt 
with h < x is given by 


a x 
2"-!.H ~ ——.. 
D2 w(x) ~ 5 


FORMULA OF LEGENDRE. For given h' = yh > 0, where h is of the form (2.1) and 
y does not contain divisors of the form 4m + 1, the total number M of Pythagorean 
triples that are corresponding to the hypotenuse h' is given by 


2M +1= (26, + 1): (2B, + 1)--: (26, + 1). (2.9) 


This is a straightforward result from Theorem 1. We count all possible ppt, 
whose hypotenuses are divisors d of h'. The ppt can then be multiplied by h'/d to 
get the desired triples: 

For n = 1, we have B, triples: all possibilities are d = p{ with 1 <k < B,. 

For n = 2, we have 28,8, + B, + B> triples, because d = p/ p< with j,k > 1 
has two common ppt. 

By induction, we arrive at the general case. 


SOME CONSEQUENCES. 


(1) A Pythagorean triple exists if and only if the hypotenuse has at least one 
prime factor of the form 4m + 1. Example: The representation for 10 = 
37 + 1 is unique. 

(2) If y > 1, every triple corresponding to the hypotenuse h’ is not primitive. 

(3) Every triple with the corresponding hypotenuse h’ is primitive if and only if 
h'=p,°p>, °°: p, With pairwise different primes p; = 1 (mod 4). 

(4) The special case n = 1 in (2.9) implies that M = B,: For an arbitrarily 
chosen integer M > 0 there is an infinity of hypotenuses h’, each of them 
corresponding to M different Pythagorean triples. Example: The minimal 
value for A with three triples is 5°. The triples are 


52+ [4,3,5] 5-[24,7,52] [336,527,53]. 
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3. Triples with Common Leg-Sum s. From (1.2), for a given leg-sum we get 
s=x+y=(a*—b?) + 2ab =(a +b) — 2b?. 

With u =a +b and vu = BD, this leads to Pell’s equation 


u*—2v*=s with gcd(u,v) = 1. (3.1) 
Because a > b, we have a + b > 2b and 
u>22vu>0 (3.2) 


must also hold. 

Therefore, the solutions of (3.1) with the property (3.2) give the solutions of all 
ppt with leg-sum s, where a =u —v, b=v. 

To deal with Pell’s equation, we write 


u? — 2v? = (u + vy2 )(u — vy2) (3.3) 


and we work in the integer ring Z[V2] consisting of elements a = u + vy2 with 
norm N(a) = u2 — 2v2. With the conjugate element a' = u — vy2, we obtain 


N(a) =a:a' =u? — 2v’, (3.4) 
which is again (3.3). Indeed, solving Pell’s equation is the same as finding elements 


with a given norm s. 
Let us multiply the right-hand side of 


(r? — 2s)(u? — 2v”) = (r + s¥2)(r — s¥2)(u + vv¥2)(u — vv2) 
in two different ways: 
(1) first with third, and second with fourth factor; 


(2) first with fourth, and second with third factor. 
Then we obtain 


nen = (ru + 2sv)* — 2+ (rw + su)’ G3 5) 


(r? — 2s?)(u? — 2v?) = (ru — 2sv)* — 2+ (rv — su)’. 


Elements with norm +1 or —1, so called units, play a significant role. The 
complete set of units in Z[V2 ] is the following: +(1 + V2)”, m & Z. Their norms 
are equal to (— 1)”, because N(a - B) = N(a) : N(B) always holds. If we know one 
solution (uv, U,) of Pell’s equation, we can generate recursively an infinite set of 
solutions by introducing the two units (1 + y2)* = 3 + 2¥2 and 3 + 2v¥2)7! = 
3 — 2V2 with norm +1 in (3.5): 


Ups, + Ups, V2 = (3 + 2V2)(u, + vv2) = (3u, + 4v,) + (+2u, + 30,)V2 
U,-1 + U,_, V2 = (3 — 2V2)(u, + vv2) = (3u, — 4v,) + (—2u, + 30,)V2 
(3.6) 


k =0,1,2,.... 

Note that the indices in the first case are increasing and positive, in the second 
case decreasing and negative. 

Two elements a and B in Z[y2 ] are called associates if and only if a = B « « for 
some unit ¢. This is an equivalence relation. All integers u; + ; v2 generated in 
(3.6) form a subset of an equivalence class of Z[V2 ]: See [5, Ch. 16, Section 16.5]. 
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The generalized identities 


(r? + Rs*)(u? + Rv?) =(m - Rsv)? +R-(r+ su)? 


(3.7) 
(r2 + Rs?)(u2 + Rv?) = (ru + Rsv)’ + R* (rv — su)’ 

contain the two special cases (2.2) and (3.5) if we choose R= 1 and R= —2 
respectively. In the previous section, we had the ring Z[i] of the Gaussian integers 
with norm N(u + iv) = ju + iv| =u? + v2, where +1, +i were the only units. 
The fact that we have an infinity of units makes the leg-sum problem tougher than 
the hypotenuse problem. 

Now we use results from the Gaussian Law of Quadratic Reciprocity and 
Quadratic Forms. See [9, Ch. VI, Th. 111], [11, Ch. 2], [1, Ch. 4]. 


LEMMA 1. 

(A) There is a solution of problem (3.1) if and only if all prime factors of s are of 
the form 8m + 1. 

(B) If s > 1 is a product of n different prime powers, each prime being of the form 
8m +1, then among the infinity of solutions, there are exactly 2"~' pairwise 
nonassociated solutions of problem (3.1) with u > 0 and v > 0. 

Note that 2"~' is half the number of solutions of the equation u* —2= 
0 (mod s). 


The question now is: How many of the infinity of solutions of (3.1) satisfy 
u>2v > 0? 
From 
u*—4v*) 2s —Uu? ; 0 38 
——_— = ——— =1u-2> . 
u+ 2v u+2v . . (3.8) 
it follows that there are only a finite number of such pairs (u, v), if any. Trivially, if 
(u,v) is a solution of (.1), then all four combinations (+u, + v) are solutions. 
More is said in 


LEMMA 2. Assumption: s has the form mentioned in (B). Among the infinity 
(u,v) = (+u,, + v,) of solutions of (3.1), where the set (u,,v,), k € Z is described 
in (3.6), there is exactly one which satisfies the condition u > 2v > 0. 


Lemma 2 is proved in [12]. This article deals with the “‘Problémes de Fermat,”’ 
in which Frenicle was involved too. See also [18, Ch. II, §XII]. 


GENERATING PRIMITIVE PYTHAGOREAN TRIPLES WITH COMMON LEG-SuM. Us- 
ing (3.5) for the n different prime powers of s > 1 gives 2”~' solutions of @.1), if all 
prime factors of s are of the form 8m + 1. Iterating, in the sense of (3.6) until 
lu| > 2|v| holds, generates the desired ppt with a = |u| — |v| and b = |v|. 


Examples: The smallest leg-sum with two different ppt is s = 7-17 = 119. 
u? — 2v* = 7 has the solution u = 3, v = 1, and u* — 2v* = 17 has the solution 
u=5, v = 2. Therefore, using (3.5), u* — 2v? = 119 has the solutions (u,v) = 
(19,11) and (11,1). The first does not satisfy the condition u > 2v. From the 
second line of (3.6) we have 


=3:19-4-11=13 v= —2:19+3-11 = —15. 
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One step is sufficient here. With (11, 1) and (13, 5), we obtain the following result: 
[10, 1] generates [20, 99, 101] and [8,5] generates [80, 39, 89]. 

The smallest leg-sum with 4 different ppt is s = 7-17: 23 = 2737. Using 
the two results for u,v above and the fact, that u2 — 2v* = 23 has the solution 
(11,7) in each case after at most one iteration we get the generators: 
[37, 36][41, 16] [43, 12] [47, 6] with the ppt 


[2664, 73, 2737][ 1312, 1425, 2737][1032, 1705, 2737][564, 2173, 2737]. 


See also FiGurE 2, where the particular points are marked with s, assuming they 
appear in the diagram. There are more correspondences to the section dealing 
with common hypotenuses: 


(1) If on the right-hand side of (3.1) the leg-sum s > 1 and the prime factors 
are of the form 8m + 1, then Theorem 1 holds for common s instead of 
common h, and so does the Legendre formula (2.9). 

(2) Asymptotic behavior: Using 7(x, 8, + 1) in (2.3), the corresponding defini- 
tion of H,(x) (see definition following TaBLe 1), denoted by S,(x) and 
using the argument following (2.5), we get: 


Snot AnD) 2.3.4 3.9 
S,, log log x PS Aa 29a oe (3.9) 
Sn—-1 
lim =0 (3.10) 
x0 


similar to (2.7) and (2.8). 
4. Triples with Common Leg-difference, Area, Perimeter, Inradius or Leg. 


1. Common Leg-difference d. With a common leg-difference d = |x — y| and 
(1.3) we have 


d = |2ab — (a* — b’)| 
= |2a2—(a+b)| witha>b>0O, 
gcd(a,b) = 1, a + b = odd. (4.1) 


Also in this case, we have to deal with problem (3.1), however without the 
restrictive condition (3.2). So it is not astonishing to get an infinity of solutions, if 
all prime factors of d are of the form 8 + 1 and no solution otherwise. In [20] we 
can find the easily verifiable statement: 


If [a,b] generates a ppt with leg-diffeence d then so does [a + 2b, b]. 
Therefore, if we know one of the ppt, then an infinity of ppt can be generated. 
2. Common Area f. Searching for ppt with a common area f leads to the 


problem of finding f such that the equation 


1 
f= 5 xy = ab(a— b)(a +b) with a>b> 0, 


gcd(a,b) = 1, a+ b= odd (4.2) 


has several solutions. This problem is hard to solve. In [16], we learn that 
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The smallest area with a total of 
(a) 2 ppt is 2: 3-5-7 = 210, generated by [6, 1][5, 2]. 
(b) 3 ppt is 210 - 11 - 13 - 19 - 23 = 13123110, generated by 
[77, 38][78, 55 ][138, 5]. 
In [19], an infinity of pairs of ppt with common area, but no information on 
more than 2 ppt with common area is given. Despite Euler’s efforts on this 


problem, not much more can be said today. The asymptotic behaviour for the 
number 7,,(x) of all ppt with area <x, denoted by T(x), is given by (see [19]): 


r(1/4)° 


We denote by A,(x) the number of those areas <x which are associated with 
exactly k ppt. With the aid of the computer, we found for x = 1,588,020: A, = 
1976, A, = 22, A, =1and A,=A,;=::: =0. 

In [6] and [14], an efficient method for generating rational right triangles whose 
areas are integers is given. 


T,(x) ~c- vx withec = = 0.531340. (4.3) 


3. Common Perimeter p 
p=x+y+t+h=2ab+(a*-—b’)+(a*+b*)=2%(at+b)a (44) 
with the conditions mentioned in (4.2). 
Therefore p = even is the perimeter of a ppt if and only if 
P 
(2 
The generators are a =u —v, b=v. u>2v and (4.5) imply the number of 
isoperimetrical ppt with even perimeter p is equal to the number of odd divisors u 
of p/2 with Yp/2 <u < yp and gcd(u, p/2v) = 1. 
Using Bertrand’s postulate, it is proven in [7], that for each m > 0, there is a 
perimeter p with at least m isoperimetrical ppt. 
In [2], a list of 5 perimeters between 14,280 and 103,740 is given (60 < a < 210), 
each one associated with exactly 3 ppt. The asymptotic behaviour for the number 
T, of all ppt with perimeter < x is given by (see [8]): 


=u:‘v with u>vu>QO and gced(u,v) =1. (4.5) 


log 2 
Tp(x) ~ —z x. (4.6) 
T 
4. Common Inradius 1 
2 ab(a+b)(a—b 
i= 2s = abla + b)(a ~ 9) =b(a—b) with a—b=odd. (4.7) 


As a result of (4.7), i is always an integer, and the problem is simpler than that of 
the perimeter: If i has the prime factorization 


i = 28p81- p82 --- pP» with 2<p,<p,< -:: <p,, 


then 
the number of ppt with common inradius j is 2”, 
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because it is equal to the number of ways to express i as a product of two relatively 
prime factors. An extensive study of this case can be found in [17, Sec. 23.2]. 


5. Common Leg. This problem is left to the reader. 


5. Triples with Two Common Items. 


THEOREM 3. There exists no two noncongruent right triangles with two of the 
following items in common: hypotenuse, leg-sum, leg-difference, area, perimeter, 
inradius, one leg. 


Note that the theorem is not restricted to integers. Its result is quite obvious: 3 
elements of a triangle essentially (with exceptions) determine it. 


Proof. Some of the (7) possibilities are obvious, e.g., the ones with the common 
leg. In any case, the proof is of the kind shown in the given representative 
examples below. 


(a) Common hypotenuse h and inradius i. We use f for the area, p for the 
perimeter, and a for the angle where 0 < a < 7/4. In what follows, s stands for 
sin a and c for cosa. 

We discuss the function 


2f 2h*sc 
i(a) = — = ————_ 
D h+hs + he 
with a fixed value h. 
1 di d SC ce +0 —s5%—3? 
h da dalt+st+c — (1+s+c) 


(c—s)(c?+est+s*+c+5) 
(1+s+ c) 
is clearly > 0 Va € (0, 7/4). 


(b) Common leg-difference d = |x — y| and inradius i. We consider 6 := x — y 
to be fixed, then 


2x(x + 6) 


i( x) =e eee eo, 
2x+6+ Vx24+(x+8) 


With h = x2 +(x +6)’ and D=2x+6+h, 


di 2x +6 
D*: — = [2x +8) + 2x] [2x +8 +h] - 2+ ; -2x(x + 6). 
With y =x + 6 we have 
di x+y 
D?. — = (2x + 2y)(x ty +h) - (24 ; } = 239 


= 2h? + 2(x + y)(h? — xy) > 0 
because h? — xy > 0 for each right triangle. 
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The aim of this note is to present an easy, self-contained, and totally elementary 
proof of the fundamental theorem of Malgrange and Ehrenpreis ({1], [6]), on the 
existence of fundamental solutions for linear partial differential operators with 
constant coefficients. 

By totally elementary the following is meant: 

e we will not use any Fourier transform argument 

e only few integrations by parts will be used, together with the Hilbert space 
structure of L?, and the Riesz representation theorem. Convolutions are used, 
twice, in a very simple way. 

e Basically no knowledge of distribution theory is needed. We first show that for 
every g © L*,(R”) one can solve the equation P(D)u = g, with u € L7,(R”), if 
P(D) is a (non-zero) linear differential operator on R” with constant coeffi- 
cients. 

This is only to say (“by integration by parts”) that for all o € @(R”): 
(u, P*(D)p) = (g, ¢), where, 


aly! 
P(D) = Lays J? 


P*(D) = L(-1)""a— _ 


J = (dpe eodn)s Vl =f, tt thy 
all \ gait +S y 


J oxi oe. Ayin’ 
Ox Ox}! Oxi 


Also < , ) denotes the usual scalar product in L7(R”), (u,v) = fan ud. 
e Only the last step, which uses a trivial and well known trick (whose origin, I do 
not know) requires one to know the definition of a distribution. 

I wish to make clear that there is but little novelty in this proof. The basic and 
almost unique ingredient is H6rmander’s inequality ((3] 2.6) (whose proof is so 
elementary and such a masterpiece). There is an allusion in [8] to an “immediate” 
proof of the existence of fundamental solutions, using H6rmander’s inequality. 
Here we give the full detail in a way which seems somewhat different. Sections 1 
and 2 are taken from Malgrange [7]. The only possibly new point is that we prove 
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the approximation property in §3, using the same tools, whereas it has usually been 
done using Fourier transform arguments and using polynomial-exponential solu- 
tions ([4] 3.4, [5]). It may be argued that all this was implicit in the literature. Let us 
then make it explicit! 

We do not assume any knowledge of the theory of PDE. But, for instance, it 
should be noticed that the results on supports in §2 follow immediately from 
Hédlmgren’s uniqueness theorem (cf, e.g. [4] 5.3). In fact, they could also be 
obtained immediately from the uniqueness of solutions of O.D.E., by partial 
Fourier transform, since only the case of functions with compact support is under 
consideration. A relatively simple proof of the Malgrange Ehrenpreis theorem, 
using Fourier transforms, can be found in [2]. 


Notations. If © is an open set in R”, @(Q) will denote the space of ¢” 
functions with compact support in 0. The norm and the scalar product in L7(Q) 
will be respectively denoted by || || and <¢ , >, or |l Ilo and < , Do if some 
ambiguity is possible. 

The adjoint of an operator P(D) is defined as above and denoted by P*(D). 


1. L’ solvability on bounded open sets in R”. 


1.1. Hérmander’s inequality. Let PCD) be a (non-zero) linear differential oper- 
ator with constant coefficients, of order (or degree) m. 


THEOREM. For every bounded open set in R”, there exists a constant C > 0, 
such that for every p © @5(Q): 


|P(D)ell > Cllell. 


As constant C, one can take C = |P|mK,, 9, where 


all 
P(D) = Dar 


P\m = max}|a,|, |J| = 
|P| {la;|, J| = m} ax, 


and K,,, 9 depends only on m (the order of P(D)) and the diameter of (. 


Remark. The proot will use a trick that it is nice to look at first in the case 
n= 1, OQ =(0,1) and P(D) = d/dx! We have to show that |l¢’|| > C|l@|l for all 
gy € 63 ((0, 1)), for some positive constant C. 

One has ((xo@)’, g) = (xo’, @) + (g, o>. Further, ((x@)’, gp) = —<(x@, ¢'), by 
integration by parts. Hence: (¢,@) = —<(x@',@) — (xg, ¢'). Since |x| < 1, one 
gets llell” < 2lle'llllell, by the Schwarz inequality. So ||¢'ll > $lloll. 

The general proof goes by induction on the degree of P. 


Proof. If P(D) is a differential operator of order m >0 on R”, then for 
j=1,...,n define P(D) by 
P(D)(x;9) =x;P(D)e + P(D)¢. 


The operator P,(D) is zero if and only if P(D) does not involve any differentiation 
with respect to x,, and if non-zero, P(D) is of order < m. Let A = supg|x]|. By 
induction on m, we will show that for every g © @(Q) 


IPD) ¢ll < 2mAllP(D) ell. (+ *) 
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Let us first notice that the inequality (**) and the definition of P, yield: 
|P(DM x, || < am + 1)A||P(D)o]||. Since differential operators with constant 
coefficients commute, we have, for all @ € @(Q) 
IP(D) ell’ = (P(D)¢, P(D)¢) = (g, P*(D) P(D)@) = <g, P(D) P*(D)¢) 
= (P*(D)¢, P*(D)¢ = ||P*(D)ell’. 

This is of course just folklore from the theory of normal operators! 

The inequality (* *) is trivial for m = 0, since then P(D) = 0. Let us assume 
the inequality (* *) has been verified for operators of order at most m — 1, and let 
P(D) be an operator of order m. Compute in two different ways 
(P(D\x;¢), P(D)g>. From the definition of PD) 

(P(D)(x;9), P(D)e) = (x;P(D)e, P(D)¢) + IIP(D)ell’. 
By integration by parts (i.e., the definition of the adjoint) and using the commuta- 
tion of P*(D) and PD) 
(P(D)(x;¢), P(D)e) = (P*(D)(x;9), P*(D)¢@). 
Therefore, 
IPD) ell’ = (P*(D)(x,), P*(D)@) — (x)P(D) 9, P(D)¢@). 


As noticed above, the induction hypothesis yields 


IP*(D)(x,9)ll < (2m — 1) AIlP(D) eI. 
Since 

|<x,P(D), P(D)9)| < AllP(D)glllP,(D)¢ll 
one gets 


IPD) ell’ < 2mAl|P,(D) gIlllP(D) ¢ll 


and this establishes (* * ). 

If P(D) is an operator of order m > 1, there exists j € {1,...,} so that P(D) 
is of order m — 1, and |P|m-—1 > |Plm. The theorem follows then immediately by 
induction on m. 


1.2. L* solvability 


Coro.iary. If QO is a bounded set in R", then for every g = L?(Q) there exists 
u € L*(Q) so that P(D)u = g. 


Proof. This follows immediately from the inequality on the adjoint || P*(D)¢ll > 
Cllell, ¢ € &(Q). Indeed P(D)u = g means that for all g € @(Q) 


(g,9) = {u, P*(D)o). (*) 


Let E={V © G(Q), V = P*(D)¢@ for some g € 6 (0)}. H6rmander’s inequal- 
ity shows that the antilinear form VY — (g,@) is well defined and continuous in 
the L? norm. It can therefore be extended to E, the closure of E in L7(Q). Then 
the Riesz representation theorem gives the existence of u € E so that (*) holds. 
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2. Hormander’s inequality with weight and support. 


THEOREM. P(D) and © (bounded) are given as previously. There exists C' > 0 
so that for all n © Rand 9 € @(Q) 


fe™|P(D) el’ > C'f em lp)”. 
.@) .@) 


Note that C’ does not depend on n. 


Proof. Apply Hérmander’s inequality to V = e“"/”*1¢ and the operator Q(D) 
defined by 


O( D)(W) = &/"Q(D)[e-/W], 


which is indeed a constant coefficient operator with the same terms of higher 
degree as P(D). 


Coro.iary 1. If g € G(R") and P(D)g = 0 in the half space {x, > 0}, then 
gy = 0 in this half space. 


Proof. Take Q an open set containing the support of g, and let 7 tend to +x 
in the inequality. 


Coro.iary 2. Let g © €5(R"), or more generally ¢ € L?(R") and assume that 
gy has compact support. If P(D)@ is supported by the ball of radius r around 0, then 
SO IS Q. 

In the application P(D)¢ is given by a function in L?(R”). 


Proof. By using translations and rotations, the smooth case follows immediately 
from Corollary 1, by writing the ball as an intersection of half spaces. To treat the 
non-smooth case, consider y € @@(R"), x supported by the ball of radius 1 around 
the origin, and satisfying {x = 1. For ¢ > 0 set 


1 t 
. = P4Xe= af Ole - Dx(=] dt. 
E R” E 
Then oy, € @5(R”), P(D)e,(= [P(D¢],.x,) is supported by the ball of radius 
r+e,and 9, > in L’ as e > 0. This reduces the problem to the smooth case. 
Now that we are at the end of section 2, let us remind the reader that sections 1 
and 2 are taken from Malgrange [7]. 


3. Approximation Theorem. For p > 0, let B, be the ball of radius p around 0 
in R”. 

THEeoreM. Let 0<r<r'<R. Ifv € L*(B..) and satisfies P(D)v = 0 on B.., 
there exists (v,;), a sequence in L’( Bp) so that P(D)u, = 0 on Bp and (v,) tends to v 
in L?(B,) as j > &. 

Proof. By smoothing by convolution we can assume v to be smooth, possibly 
shrinking r’ slightly (to preserve P(D)v = 0). We have to show that if g€ 
L*(Bz) and satisfies (a, g)s, = 0 for all a € L’(B,) such that P(D)a = 0, then 
Cv, gp. = 0. 


Claim. There exists w © L*(Bz) so that for all o € G(R", g)z = 
(P(D)¢, W) Bp 
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Proof of the claim. This is just to say that for some constant C 
Ie, ge! < CI|\P(D)ellz,. 


Notice that if P(D)g = 0, we indeed have (g, g>) = 0. If P(D)o # 0, by §1 we 
can find ¥ € L*(Bp) so that P(D)V = P(D)¢ and ||Pllz, < C,\|P(Della, (C, a 
constant). Then (9, g)p = (¢ — V, ade + (VY, ge = (V, ge. Hence 
IXe, g>a,| < CI|P(D)gllz, with C = C,llg||. And the claim is proved. 

Pick w as given by the claim. Extend g and w on R” to g and WwW, by setting 
g=0 on R” —B. and W=0 on R” — B,. We then have (by definition) g = 
P*(D)w. Since w has compact support, and P*(D)w is supported by the ball B_, 
we conclude from §2 that w = 0 on Bp, — B,,. 

Then, take v as in the beginning of the proof, and extend it to be a smooth 
function on R” (but no longer satisfying P(D)v = 0 off B.). One has 


(U, &)p, = (P(D)v,w)p, = (P(D)vu,w)p = 0. O.E.D 


4. Solvability of P(D)u = g in L?,.(R"). The following theorem results imme- 


loc 


diately from §1 and §3 by a standard procedure (Mittag-Leffler): 


THEOREM. Let P(D) be a (nonzero) constant coefficient linear differential opera- 
tor on R". Then for every g © L*, (R") there exists u © L?,(R") such that P(D)u = g. 


loc loc 


Proof. As previously, B, denotes the ball of radius p around 0 in R”. By §1 
there exists u, € L?(B,) so that P(D)u, = g on B,. Then, inductively, assuming 
that u, has been chosen in L*(B,,,) so that P(D)u, = g, one chooses u,,, in 
L*(B, 3) in the following way. Let w be an arbitrary solution of P(D)w = g, in 
L*(B, 42). On B,,,; one has P(D)u, — w) = 0. By §3 there exists v € L’(B, , ,) 
such that: P(D)v = 0, and |lv — (u, — w)||B, < 1/2”. Set u,,; =v + w. Then 
P(D)u,., =g on B,,,, and |lu,,, — u,lla, < 1/2”. The sequence (u,) is obvi- 
ously convergent in L?,(R”), and its limit satisfies P(D)u = g. 


loc 


5. Fundamental solution. 


THEOREM (Malgrange-Ehrenpreis [1], [6]). Every nonzero linear differential oper- 
ator with constant coefficients on R” has a fundamental solution (i.e., a distribution 
E such that P(D)E = 65, 69 the Dirac mass at 0). 


At last, we have to assume that the reader knows the definition of a distribu- 
tion, and the definition of the derivatives of a distribution. 


Proof. Let H be the function (product of Heaviside functions) defined on R” 
by: H(x,,...,x,) = 1 if all the x, are positive and 0 otherwise. Then (0”/dx, --- 
dx,)H = 5). By §4 there exists u € L?,(R”) so that P(D)u =H. Set E = 


d"u/(dx, -++ Ox,) (a distribution). Then 


n n 


u 
P(D)E = P(D) ———_ _ = ———— (P(D = 6p. 
(D) E = P(D) 5 = (PD) a) = 8 
This ends the proof! 
Remark. In the proof one can replace the operator 0”/(0x, -:: 0x, by any 


operator which has a fundamental solution that belongs to L7,.(R”), for example: 


The Laplacian A, if n = 1,2,3, or (J — A)? for general n, if p > n/4. 
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NOTES 


EDITED BY DENNIS DETuRCK, DAviD J. HALLENBECK, AND RODICA SIMION 


On the Zeros of tan” z 


Li-CHIEN SHEN 
Department of Mathematics, University of Florida, Gainesville, FL 32611 


The zeros of the successive derivatives of a given meromorphic function have 
been investigated by many, notably by G. Polya, in the past. In particular, from a 
well-known result of Pélya [1], one can deduce that the set of all the accumulation 
points of the zeros of tan z, k = 1,2,3,..., are precisely the vertical lines 


{nr +iy:-e<y<w,n=0,+1,+ 2,4 3,....}. 


However, very little is known about the precise nature of those zeros. The primary 
purpose of this paper is to study the “large” zeros of tan“ z. Combining the above 
mentioned result of Pélya and the main theorem of this note one sees that, tan‘*? z 
has two kinds of zeros; that is, one type accumulates to form a dense set on those 
vertical lines and the other type diverges at a certain rate (see Theorem 1) to 
infinity. 

It would be interesting to see whether the behavior of the zeros of the 
successive derivatives of tan z can be generalized to a larger class of meromorphic 
functions which have only real poles. Finally, it is worthwhile to point out that 
tan“) z can be written as a special case of the generalized Riemann zeta function 
(see (6)) which has been studied extensively by many mathematicians. 

Since tan z is a periodic function of period 7, we will restrict our study to the 
vertical strip S = {z:— 7/2 < Rez < 7/2}. We will use I to denote the imagi- 
nary axis and [, to denote the line segment {z: z = 7y,— 1 < y < 1}, and S* and 
I* will be the portions of S and J belonging to the upper half plane, respectively. 

It is known that all the zeros of the kth derivative, tan™ z, k = 0,1, 2,3,..., 
z € S, lie on I. To see this, we first observe that, for z in S, 


tan z € I, if and only if z € J, (1) 
and 
tan z maps J one to one onto Ip. (2) 
We now define a sequence of polynomials as follows: 
P(x) =x; 
P(x) =(14+x7)Pi(x), k=1,2,3,.... 
Then 
tan z = P,, (tan z). (3) 


It is easy to verify that all the zeros of P,(x) are simple and lie on the interval 
{z:z=iy,—1<y < 1}. From (), (2) and (3), we see that for k > 1 all the zeros 
of tan“ z are simple and belong to J. We further conclude, from (3), that it has 
k — 1 zeros and they are symmetric with respect to the real axis; if k is even, one 
of its zeros is z = 0. So there are exactly [(k — 1)/2] zeros on I. 
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We now order the zeros 


k-1 
{eek = Wem m= 1,2,..., a 


of tan’ z on J* so that 
Vn,i > Vea 7 VK3 7 
We will prove 


THEOREM 1. z, 1 — tk Inv2 > 0 ask > », 


We will first establish some crucial properties of tan z. 
Let s = e”'7, Then 


sin z e'? — e? | 2 
tan Zz = = -|1——— = -ift — | (4) 
COS Z e*7 +e"? 1+s 
Using the chain rule, d/dz = 2isd/ds, we see that 
tan) z = (21)""'O,(s)/(1 +5)", (5) 


where Q, is a polynomial of degree k. Since tan“*? z has only imaginary zeros and 
2iz 


s = e*'?, we deduce that, with the exception of the zero at s = 0, the zeros of Q, 
are simple and positive and they are symmetric with respect to the circle |s| = 1. 
That is, if sy is a zero of Q,, then 1/s, is also a zero. Thus it is sufficient to study 
the zeros of Q, in the disc |s| < 1. From (4), 


tan z = {1 ~2¥(-9"s'), Ils|} <1. 
0 
Using the chain rule, we obtain 


tan z= (2i)"""f,(s) (k= 1), (6) 


where 
f(s) = E(-1)"nks" = O,(s) 0 +5) 1, 


We now order the zeros {s, ,,} of f,(s) in the unit disc so that 
0= 5,9 <Sp1 <Sp9< 07° 
Then the relation between the zeros of tan z and f,(s) is 
Skom = €XP(2iz, ) m=1,2,.... (7) 
To study the zeros of f,(s), we first establish the following 
Lemma. If |s| < B~*,6'/3 < B < 2, then 
f(s) — (—s + 2*s?)| < 3*Kalsl’, 


where K, is a constant independent of k. 
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we conclude from (10) that 
lg,(u) — (—-1)"(u” —u”"~!)| > O0ask > (11) 
uniformly on any compact set. If we order the zeros of g, so that 
O = Ung < Ug <Upa < '; 


we deduce from (11) that 


UK m1 - 1, 


U,,; > 0 1<j<m-2. (12) 
The following generalization of Theorem 1 follows immediately. 


THEOREM 2. Zz ,,-1 — ik /2)In(m/(m — 1)) > 0 as k > & for any fixed m. 
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Two Equal Sums of Three Squares with Equal Products 


JOHN B. KELLY 
Department of Mathematics, Arizona State University, Tempe, AZ 85287 


1. Introduction. In this note we produce a general solution for the Diophantine 
system 


xt tei ty ty (1) 
Xi X9X3 = Vi V23 ) 
This system is mentioned as an unsolved problem in [3] where it is attributed to 
John Leech who gave the particular solution {3, 8, 26; 2, 13, 24}. 

Actually, the problem goes back to 1909, at least. It was posed as a problem in 
L’ Intermédiare des Mathématiciens by U. Bini [2]. Partial solutions were offered by 
Dubouis and Mathieu. 


2. A General Solution. E. T.‘Bell [1] showed that if x,, x, x3, ¥1, ¥2, y3 are 
integers satisfying x,x,x; = y,y>y3, then there exists an integer matrix 


STL LSE) 
T= |bloy by by 


fz, bs. 33 


such that 
3 
x,= ILt,,, 1<i<3 
13! . (2) 
y= Ilt,, lsjs3 
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1. Introduction. In this note we produce a general solution for the Diophantine 
system 


xe +xe4+x2=yi + ye + y? | (1) 
X1XX3 = ViV2Y3 


This system is mentioned as an unsolved problem in [3] where it is attributed to 
John Leech who gave the particular solution {3, 8, 26; 2, 13, 24}. 

Actually, the problem goes back to 1909, at least. It was posed as a problem in 
L’ Intermédiare des Mathématiciens by U. Bini [2]. Partial solutions were offered by 
Dubouis and Mathieu. 


2. A General Solution. E. T.‘Bell [1] showed that if x,,x,, x3, 1,2, y3 are 
integers satisfying x,x,x3; = y,y,y3, then there exists an integer matrix 


TELS LSE 
T= {boy by by 


such that 
3 
x,= ILt;,, 1<i<3 
13! . (2) 
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Insertion of (2) in the first equation in (1) yields an equation in ¢,,, t,5,t33 of the 
form 


Ajty + Agty + Azts; = 0, (3) 
where 
A, = tints — thts 
A, = tyyt3s — tintg. }- (4) 
Ag = t3it3) — tistds 


If t15, t13, 91, t23 t31, 32 are assigned arbitrary values, (3) is the equation of a 
conic in two-dimensional projective space. All rational points can be found if one 
is known. It can be checked that (¢,3, t3,,¢,.) is a solution of (3), ie., if we put 
ti, = to3, top = ba, t33 = ty, then (3) is identically satisfied. This corresponds to 
the trivial solution of (1) in which (y,, y5, y;) is a permutation of (x,, x», x3). Any 
line with rational slope through (¢,;,¢3,,¢,,) intersects the conic at another rational 
point and any rational point may be so obtained. A parametric form for a line with 
rational slope through (¢,3,t3,, t1>) is 


top = Ant + Cs) }. (5) 
C33 = Azl + a0) 


where a,,@5, @, are arbitrary integers. The line given by (5) intersects the conic 
given by (3) at a point where 


—2(@,A jto3 + @2 Antz, + @3A3l17) 


t= 
A,a? + A,as + A303 


Insertion in (5) gives 


— —afAjty; + @,A,(Ayt73 — 2ayts,) + @3A3( 323 — 2ayt,.) (6) 
‘t A,a? + A,a2 + Aza 
with similar expressions for ¢,, and f33. 

Equation (6), together with (4), gives the general solution of (3) in rationals in 
terms of the 9 integer parameters ¢,,, ¢13, t51, to3, t31, 39, @1,@, and a. If the 
denominators in (6) are removed then integral solutions, but not necessarily all 
integral solutions of (3) are obtained. All integral solutions can be put in the form 


f1= B( -atA jt, + @,A,( ato, — 2ajtz;) + @3A3( at. — 2ajt1)) 
lo. = B(a,A(ayts, — ant) — aS Apts, + a@3A3(azts, — 2ayt1)) , (7) 
[33 = B(a,A,(ayt1 — 2aztr3) + @,A,{Agty, — 2a3¢3,) — a3A sty) 


where £ is rational. The general solution of (1) is given by (7) and (2). 

It follows easily from (2) and (4) that if A, = A, = A; = 0, only trivial solutions 
of our original system are obtained. Formula (6) is meaningless when A,a? + 
A,a + A,a% = 0. A separate analysis of this case yields no solutions not already 
contained in our general solution. Indeed, only the case where there is a line of 
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solutions is non-trivial, i.e., 
A,Qyb3 + Arab 3, + A,a3t1 = Q. 

Here, the line (5) is part of the conic (3), yet substitution directly in (7) yields 

f= B(t23(.A,a? + A,a3 + A303) — 2a,(A,aytz, + Az@2ts; + Azazty)) 

= 0 = ty = bay 

so that the points of (5) cannot be obtained from (7) in this way. However, unless 
to, = 0, the point (—t3, t3), t,)) is not on the line (5), though still on our conic (3), 
and the points of (5) may be obtained from (7) by choosing:-(—tf,3, t3, t,)) as base 


point with varying a,,a, and a3. If t,, = 0, we may, instead, make a similar use of 
t,, or t,, unless they, too, are 0, in which case we have A, = A, = A, = 0. 


3. Partial Solutions. In [2], Dubouis and Mathieu presented (separately) two- 
parameter solutions of (1). A linear transformation of the parameters takes one 
solution into the other so that they are essentially equivalent. Dubouis’ solution 
corresponds, in our notation, to the matrix 

1 2b-a a-b 
a 1 2a —b 
a+b b 1 
This solution can be obtained from the general solution (7) if one chooses 
a, =a, =a, = 1 and 
1 
P = 8(a@? — ab + b*)(b ~a)(2a —b)(a — 2b) 


Zeitlin [4] gave two one-parameter families of integer solutions of (1). These 
correspond to the matrices 


1 1 2a+1 1 1 2a—-—1 
T(a)=}]1 1 3a —2|andT’(a) =] 1 1 3a+2\], 
a 6a-I1 1 a 6a-+1 1 


respectively. They can be realized from (7) when a, = a, = a; = 1 and B has the 
respective values 
1 1 


12(3a + 1)(a — 1)° and 4(3a + 1)(a — 1)(3a — 1) 


Zeitlin’s solutions cannot be obtained from those of Dubouis and Mathieu by 
specializing the parameters. In the latter solutions x? + x4 + x3 is always 5 times 
a square. This is not the case with Zeitlin’s solutions. 
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A Coloring Problem 


Ira M. GEssEt! 


Department of Mathematics, Brandeis University, Waltham, MA 02254 


Introduction. A well-known algorithm for coloring the vertices of a graph is the 
“greedy algorithm”: given a totally ordered set of colors, each vertex of the graph 
(taken in some order) is colored with the least color not already used to color an 
adjacent vertex. When applied to a path graph with at least two vertices, the 
algorithm uses either 2 or 3 colors, depending on the order in which the vertices 
are colored. 

J. Bouwer and Z. Star [1] solved the problem of counting the number of vertex 
orderings for a path of n vertices (out of n! possible vertex orderings) for which 
the greedy algorithm uses only two colors. They expressed the number of 2-color 
vertex orderings in terms of the number O(n) of 2-color vertex orderings in which 
the first vertex to be colored occurs in an odd position. They then found 
recurrences for O(2m) and O(2m + 1) that led to differential equations for the 
exponential generating functions 


ora) 2m+1 
G(t) = L Olam + Gms dD! 
and 
pom 
H(t) = r O(2m) (am)! 
(with O(0) = O(1) = 1) which they solved to obtain 
1 7 1 ; 2t ' 
aay 77t- P+ a] (1) 
and 
H(t) =e? exp| ['72G(2) ar) (2) 
0 


When a simple generating function can be found by solving a differential 
equation, it can often be found more directly by a combinatorial argument. 
Although (1) and (2) don’t look particularly simple, (1) can be rewritten as 

sinh ¢ 
cosh t + ¢t sinht 


-E ae /[t- Berg (3) 


mug (2m + 1)! 


G(t) 


‘Partially supported by NSF grant DMS-8703600. 
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tions of the previous paragraph imply that a permutation 7 of 1,2,...,n is a 
coloring permutation if and only if the following two conditions are satisfied: 

(i) Every hook in the hook factorization of 7 has even length. 

(ii) If n is even, then 7 has an empty tail. 
Thus by (5), the number of coloring permutations whose hooks have lengths 
2k,,...,2k, and whose tail has length 2m + 1 is 


2k, t+ +2k, + 2m +1 
2k,,..-,2k,,2m +1 
As before, multiplying (7) by ¢7*%1t 7" t74+2m*l (Qk, + +++ +2k.4+ 2m + 1)! 


and summing on k,,...,k, > 1, m > 0, r > 0 yields (3), and (4) can be derived 
similarly. 


(2k, — 1) +++ (2k, — 1). (7) 
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Norm Inequalities for the Powers of a Matrix* 


MAN KAM KwonG 
Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439 


A. ZETTL 
Department of Mathematical Sciences, Northern Illinois University, DeKalb, IL 60115 


The inequality 
|| Axil? < Kl xl | A2xll (1) 


has been studied extensively by many authors for operators A acting on a Banach 
or Hilbert space. Of particular interest are the cases when A is the differentiation 
operator Ax = x’ on the space of differentiable functions x defined on (—~, ©) or 
(0,2), or the forward difference operator A(x,) = (x,,, —x,) on the space of 
infinite sequences x = {x}. 

The basic questions are as follows: 


1. Given an operator A acting on a normed space X, does there exist a 
constant K,0 < K < ™, such that (1) holds for all x in X? 


*This work was supported by the Applied Mathematical Sciences subprogram of the Office of 
Energy Research, U.S. Department of Energy, under contract W-31-109-Eng-38. 
Fic. 1. Graph of K vs. b with h = 2 
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The inequality 
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has been studied extensively by many authors for operators A acting on a Banach 
or Hilbert space. Of particular interest are the cases when A is the differentiation 
operator Ax = x’ on the space of differentiable functions x defined on (—~,) or 
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2. If the answer to the first question is yes, what is the best, i.e., the smallest 
such K? 


In the case of a finite-dimensional normed vector space, question 1 was 
answered by Ljubic [3] who found that (1) holds with a finite K for all x in X if 
and only 


ker( A) = ker(A’), (2) 


where ker(L) = {x : Lx = 0} denotes the kernel of the operator L. In the simple 
case where 4 is a 2 X 2 matrix, (2) is equivalent to the fact that at most one 
eigenvalue of A is zero, which holds if and only if either the trace or the 
determinant is not zero. 

The reader is referred to the survey paper [1] for further information and 
references on inequality (1). 

This note reports on our effort in finding the best constant K = K(A) for 2 x 2 
real matrices acting on the Euclidean vector space R*. One of our purposes is to 
point out that even for 2 X 2 matrices, there are still non-trivial problems worth 
studying. The result we obtained is interesting. Indeed, we are rather surprised to 
find that K(A) is not given by one single formula and that it is constant for certain 
ranges of the elements of A. 

But the more important purpose of the note is to draw attention to our 
methodology, the use of symbolic manipulation software. Many people fear that 
too much reliance on the computer may suppress the need to think and reason, 
leading to a possible suffocation of creativity. We believe, however, that when used 
with discretion, computer software can enhance rather than hinder our thinking 
process. The work reported here is a good example. 

The problem of determining K(A) for a matrix has been around for some time. 
Although there is an elementary and straightforward line of attack, the computa- 
tion anticipated is huge enough to discourage anyone from pursuing that approach. 
Very little progress had been made before we enlisted the help of MAPLE to 
successfully solve the problem for 2 X 2 matrices. The computation needed for the 
most general case is too much even for the computer equipment available to us. 

MAPLE is symbolic manipulation software developed at the Department of 
Computer Science, University of Waterloo, Ontario, Canada. The version we used 
is 4.1 (released in May 1987) installed on a SUN Workstation model 3 /60. What it 
can carry out is exact algebraic (as opposed to numerical) computation, involving 
symbols used as unknowns. Besides ordinary algebraic operations like multiplying 
polynomials or rational fractions, it can handle trigonometric and other elementary 
functions, solve algebraic equations, differentiate and integrate given functions, all 
with amazing speed and accuracy. 

MAPLE did not take over the entire reasoning process. We used MAPLE first 
to investigate special test matrices. Then after successfully determining K(A) for a 
few matrices, we mechanized the procedure. Thus we were able to efficiently 
repeat our experiments with a large number of examples. From the output we 
detected patterns and formulated conjectures which were later confirmed theoreti- 
cally. 

The role played by MAPLE in this work was in extending our computational 
capability. Equipped with added confidence, we ventured into and explored 
avenues that previously appeared inaccessible. In the absence of MAPLE the 
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problem would probably have remained unsolved for a while longer. On the other 
hand, human intervention is essential in reformulating the problem to be suitable 
for MAPLE, in interpreting the output, in stating conjectures, and finally in 
proving those conjectures. Although a symbolic manipulation program frees us 
from some technical routines, it has its limitations. Enough problems always 
remain to challenge our ingenuity. An example is the higher dimensional analog of 
our problem. 

More details of the involvement of MAPLE in this work are presented in [2]. 

We first show that the general case can be reduced to two special ones. These 
special cases are then solved completely. 

Clearly K(A) = 0 if and only if A = 0. Let 


O( A, x) | Axl f € R*, A*x #0 (3) 
A, i rrr Or X 3 Xx . 
|| x|| || A? 
From (1), we have 
K(A) = max{Q(A,x):x © R’, A’x # 0}. (4) 


Lemma 1. Assume that A satisfies (2). Then for any orthogonal matrix S and any 
a € R, a # 0, we have 
K(A) = K(a@SAS~'). (5) 
Proof. The invariance with respect to a is clear from the homogeneous nature 
of (1). The invariance with respect to orthogonal similarity transformations follows 
directly from the characterization (4) and the norm preserving properties of 
orthogonal] matrices in Euclidean vector spaces. 


Although Lemma 1 sounds obvious once it is stated, it has not been noticed by 
earlier authors. We suspected its validity only after scrutinizing many special cases. 
Some simple consequences of Lemma 1 are stated in the next lemma. 


LEMMA 2. 
1. If A~' exists, then K(A) = K(A“'). 
2. Assume that A satisfies (2). Then 


K((t 2))=«((§ 4))=«((¢ 8) 
=K((_2 a) (2% 22)) (6) 


Proof. Part 1 is obvious if,we rewrite (3) in the following form which is 
symmetric with respect to A and A7!: 


lIyll? 
A,y) = —————. fory = Ax E R’, Ay # 0. 7 
C49) = Taya ” 
Part 2 follows by applying Lemma 1 repeatedly with 
_{ 0 1 1 O 0 1 
s= (2 oblo S}(D o}) (8) 


A more useful consequence of Lemma 1 is that the most general matrix A can 
be reduced to one with equal diagonal elements. 
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THEOREM 1. Given any matrix A there exists an orthogonal matrix S such that 


SAS~' has a constant diagonal, i.e., SAS~' has the form ( \ “J. 


Proof. Rather than using an orthogonal matrix, we can use some constant 
multiple of the form T = (_, . ) Note that T/ ¥1 + B” is orthogonal. Then 


B’d+B(bt+c)+a —b*c + B(d-—a) +1 
—B°b+B(d-a)+c B’a-B(bt+c)+d 
Setting the diagonal elements in (9) equal to each other yields a quadratic equation 
in B whose discriminant D is 

D=4((b +c)’ + (a—d)’)>0. (10) 


Hence a real root exists. 
Lemmas 1 and 3 reduce the problem of determining K(A) to one of the two 


cases A = ( . ° and A = (? °). Let us take care of the former case first. 


TAT' = . (9) 


THEOREM 2. For any b,c € R, b #0, and c # 0, 
0 b)\\_ b| \c 
K((? Al ~ max{ = 15 }. (11) 


Proof. Note that condition (2) is satisfied under the hypotheses of the theorem. 
For vectors of the form 


Cc 


9 


x= ( 1), (12) 
let 
(b2x? + ¢?)° 
(b2c?)(1 +. x2)” 
If b? =c*, then g(x) = 1 for all x in R; hence K(A) = 1, and the theorem is 
proved. If b? #c?, then x = 0 is the only real critical point, the other roots of 


q(x) = 0 being +ic/b. Since q(0) = c?/b* and lim, ,,, q(x) = b*/c?, it follows 
that 


q(x) = Q(A,x) = (13) 


b? c¢? 

K(A) = dae = max| 5}, (14) 
c“ b 

completing the proof of the theorem. 

Observe that Theorem 2 shows that every K > 1 is realized as a best constant in 
inequality (1). That K cannot be less than 1 for any A ¥ 0 satisfying (2) can be 
seen as follows: Let ||.A|| = max{||_Ax||: ||x|| = 1} denote the usual operator norm of 
A induced by the Euclidean vector norm. Then 


| Axl < [|All lxl| for all x. (15) 


|| Al || Ax||, By compactness, there exists an x, such that 
1. If furthermore A’x, # 0, then 


| Ax all” . Axl] 
|A*xoll ~ |All 


In particular, ||A7x|| < 
|X oll = 1 and || Axgll = 


K(A) > (16) 
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If A’x, = 0, using a sequence of vectors approaching x, will yield the same 
conclusion. 
The remaining case is more intriguing. 


THEOREM 3. For b,h € R with h > 0, let 


1 b 
fd) =K((, 1, 7)}: (17) 
This function is symmetric about the point b = —h/2, i.e., 
h h 
f{-b- 5) =1,(6 - 5). (18) 
Let 
hy (1-1) (1 +r) 
r= (1+ 5] , b= b= TFT. (19) 


We have, forh > 0, and b > —h/2, 


r b,<b<b, 


< 
b) = |1-—b(b+h)| h 20 
fix) (+h boy oy and bp eb. (20) 
1+ 5 2 


Before discussing the proof we illustrate the theorem by sketching the graph of 


f(b): 


value of K 


—-11 -9 -7 -5 -3 -1 1 3 5 7 9 
Fic. 1. Graph of K vs. b with h = 2. 


Proof. We do not have a simple and short proof for formula (20). The only one 
we have is obtained with the help of MAPLE. Since the computer output is 23 
pages, we can give only a brief outline here. We refer the readers to [2] for more 
details. 
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We again used normalized vectors of the form (12). The corresponding function 
to be maximized is then 


2 

((1 + bx) + (b+h+x)’) 
(1 +x7)(1+ b(b +h) + 2bx)? + (x + (D+ A)(2 + dbx))’ 
All that remains to be done is to obtain the real roots of qg'(x) = 0, compute the 
values of q(x) at each of these critical points, as well as at x = ©, and then pick 
out the largest one. We accomplished these using MAPLE. For b #0, q‘(x) 


reduces to a rational function with a polynomial numerator of degree 6. Two of its 
factors are 


. (21) 


q(x) = 


(bx +1) and (x+bd+hy). (22) 


When b = 0, the numerator of q‘(x) is only of degree 5, and the first factor in (22) 
is absent. Another factor is 


(1 + b?)x* + (4b + 2h)x + (14+ 57 + 2bh +h’), (23) 
which can be ignored since it has only complex roots. The remaining factor 
(2b°h + 3b*h? + 8b? + 6bh + bh? + h?)x? + (8b7h + Bbh? + 2h°?)x 
+(2b°h — 8b? + 3b*h? + bh? — 10bh — 3h”) (24) 


may or may not furnish real roots, depending on the sign of its determinant. More 
precisely, in the interval [b,,b,] and its symmetric partner, the roots are real; 
otherwise they are complex. In the former case, the maximum of q(x) is attained 
at one of these two roots. This maximal value, 1 + h?/4, is independent of b in the 
entire interval. It is interesting to note here that MAPLE was unable to simplify 
the expression for g(x) without external help when one of these roots was 
substituted for x. In the latter case, the maximum is attained at one of the two 
roots given by (22). 

The fact that the maximum is not attained at x = ~ can be seen as follows. If 
the maximum is attained at x = ©, then y = 0 will be a critical point for the 


function q(y) obtained by using x = ( ; instead of (12). This possibility can be 
ruled out with the help of MAPLE. 
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With Supercalculators 
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1. A sobering experience. The ever easier access to supercalculators enables 
instructors and students to delegate lengthy calculations to the machines while 
spending more time thinking about concepts. Yet supercalculators also introduce 
another phenomenon: large, inconspicuous numerical errors. For example, all 
digits may be wrong in all the entries of the computed inverse, A’ = A~', of a 
nearly singular matrix A, even though A‘: A = to all twelve displayed digits. 
The analysis of such errors appears in detail in numerical analysis texts [3, 14] and 
briefly in Hewlett-Packard’s manuals [5, 6, 7], and certain applications do not 
entail or do not suffer from such errors [3, p. 124]. However, most students who 
use supercalculators never study numerical analysis; consequently, they may need 
to learn at least that each digital algorithm fails in certain situations, and they 
should be alerted to the possibility of nonsense in displayed results. 


EXAMPLE 1. Consider the following system (abbreviated A¥ = b in the sequel): 


888,445x + 887,112y = 1, 
887,112x + 885,78ly = 0. 


Even with the value of the determinant (one) given to them, no one in a class of 
twenty juniors and seniors, all majoring in mathematics or computer science, some 
with supercalculators, produced any values of x and y anywhere close to the 
solution. 

Such a classroom experience demonstrates that, while supercalculators have a 
key purporting to solve linear systems, students have problems choosing the best 
numerical method and assessing its accuracy. This phenomenon—the absence of 
any infallible algorithm—constitutes a new development. For, hitherto, the keys on 
calculators represented only real, complex, direct, or inverse elementary functions 
and returned results accurate to all but perhaps the last displayed digit [5, p. 179]. 
Now, some keys perform operations with matrices, and with these, all digits may be 
wrong. 

The next section shows the extent of the problem, and the last section suggests 
explanations that may help students think about concepts while using supercalcula- 
tors. 


2. Unsettling experiments. Widely different algorithms applied to the system in 
Example 1 produce wildly different results, as the following experiments show. 
(Since the last few displayed digits may depend upon the particular implementa- 
tion of each algorithm, readers may request from the author listings of the 
programs used below.) 
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2.1. Supercalculator keys. On Hewlett-Packard’s HP-28 (C & S) machines, the 
division key (+) also solves linear systems. For the system in Example 1, it yields 


x = 1,279,847.88527 and y = —1,281,771.0215. 


(Then the experienced user may apply iterative refinements [3, p. 126; 5, p. 120; 
6, p. 111; 7, p. 68], which wander in the range 884,000 <x < 887,000 and 
— 888,000 < y < —886,000.) 


2.2. Gaussian elimination. Programmed on the HP-28, Gaussian elimination 
and backward substitution gives (without pivoting, without scaling) 


x = 997,001.502623 and y = —998,499.625748. 


(Hence, iterative refinements behave as described in the preceeding paragraph.) 


2.3. Singular value decomposition. Also programmed on the HP-28, the singular 
value decomposition, with annihilation of the smaller singular value [10, p. 56-57] 
produces 


x = 2.82236169408 x 10°’ and y = 2.81812709528 x 107’ 


This result accurately represents the orthogonal projection of the solution on the 
eigenspace of A with the larger eigenvalue, ,.., = 1,774,226.00002, spanned by 
Vax ~ (0.707637442, 0.706575721)", and ignores (as it should) the other, numeri- 
cally unstable projection on the eigenspace with the smaller eigenvalue, A,.. = 
5.63513515643 x 107’, spanned by Vz, ~ (0.706575721, — 0.707637442)’. This 
amounts to one accurate coordinate instead of two inaccurate ones. To check these 
results, observe that the symmetric matrix A is well conditioned for the eigenvalue 
problem because it may be diagonalized by an orthonormal change of coordinates 


[15, p. 87]. Also, observe that 


A = A = 887,112.5 x (| i): 
with eigenvalues 1,774,225 and 0, and eigenvectors (1/V2,1/ V2)’ and 
(1/ ¥2,- 1/ v2)". 


2.4. Eigenvalue diagonalization. Since the matrix A in Example 1 is symmetric 
(and positive definite), it has a matrix 0 = (V¥,,,,, V,,;,) Of orthonormal eigenvec- 
tors. Programming Jacobi’s method [14, p. 341-344] to compute ©, and invertin 
the (positive) diagonal matrix QAO, all on the HP-28, gives X = 0(0740)~'O"b 
with 

x = 885,957 898511 and y = — 887,289.164320. 


2.5. Cramer’s rule on the HP-28. The HP-28 correctly evaluates the determinant 
(«<A DET)> returns 1) and the transpose (((A TRN))); hence changing two signs 
yields A~', in the first column of which appears, finally, the solution: 


x = 885,781 and y = — 887,112. 
2.6. Question. What might instructors and textbooks tell students, who carefully 


copy down their supercalculators’ results without ever suspecting that they are 
copying random digits? The next section suggests answers at various levels. 


1991] THE TEACHING OF MATHEMATICS 541 


3. Close encounters of the singular kind. Though the matrix A in Example 1 
has only six-digit integral entries, the following phenomenon explains why compu- 
tations carrying twice as many digits must err: there exists a singular matrix S that 
lies so close to A that the supercalculator cannot distinguish A from S. Thus, 
computing xX = A~'b through the command bA + amounts to the nonsense bS + 
[5, p. 102]. 


EXAMPLE 2. Retaining at most twelve digits from each input, the HP-28 does 
not distinguish A from the singular matrix 


g . (888445 — 13548450 887112 — 1/3548450 
~~ | 887112 — 1/3548450 885781 — 1/3548450 


~ | 888444 9999997 887111.9999997 
887111.9999997 885780.9999997 } ° 


In general, the same phenomenon occurs if the matrix A lies too close to the 
set of singular matrices, D. To visualize this situation, restrict the discussion to the 
space S(2,R) of symmetric 2 X 2 real matrices, which has dimension three and 
may be pictured as R?: 

S(2,R) = {s EM,,.,(R): S = (*, "3 u,v,w R} = R?, 
In that space, the singular matrices form the variety D where the determinant 
uv — w* vanishes. The change of coordinates u:=r+s and v =r —s reveals 
that D is the right circular cone about the r-axis, with vertex at 0 and equation 
7? = 5? + w?, 

Thus, computations will fail if the distance from the point A to the cone D is 
less than the relative accuracy of the supercalculator. The next section explains 
how to calculate this distance from other quantities. 


4. Condition numbers. Consider a linear system AX = b and denote by r the 
result returned by a supercalculator attempting to solve for x. For convenience, let 
Ax = r— x and Ab = Ar — b = A(X + AX) — b = A(AX). One method to esti- 
mate the discrepancy Ax between the result r and the solution x, widely applied by 
students and considered as a benchmark by some mathematicians [4, p. 1341; 
12, p. 992; 13, p. 395], consists of substituting r for x and evaluating the “residual” 
Ab = Ar — b. A standard argument [14, p. 175—183] reveals the worthlessness of 
the method, by means of “subordinate norms.”’ 


DEFINITION 1. Let |||| represent any norm on R”. Its subordinate norm, also 
denoted by ||||, is defined on the matrices in M,,,.,(R) by 
|All = max{||Avll: ¥ © R”, |l¥ll = 1} = max{I|AvI|/|vil: ¥ eR” \ {0}. 
A subordinate norm measures the maximum amount by which a matrix may 
stretch any nonzero vector x: 
|| Axl 


x! 


In the particular case AX = b, this inequality gives ||b|| = || Axl < || All - |fxl|. If, 
moreover, A has an inverse, A~', then the same inequality applied to AX = 


|| Ax] = xl < max{||AvI|/l¥ll: ¥ © R” \ (0}} - [xl] = ILAll - IIL. 
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Then let X, = A~'y, and choose a matrix F € M,,.,(R) such that FX, = Ax, and 
IF ll = 1/|| A? (Observe that ||FXoll = ||AXoll = ll¥oll = ILA~*YpII/I|A7 "Il = 
IX gI|/|A~ "I; consequently, ||F|| = 1/||A~'|l on Span(X,). Then construct F by 
finitely many such induction steps as those in the standard proof of the Hahn- 
Banach Theorem [1, p. 149-—150].) Finally, let S := A — F. Then S € D because 
S¥, = AX, — FX, =0 and ¥,=A7~'y, #0. Also, ||A — S| =||Fll = 1/|Am HL 
Therefore, d(A, D) = 1/||A7'|l. 


EXAMPLE 4. With the “maximum” norm on R? and with A as in the previous 
examples, d(A, D) = 1/\|A7'|| = 1/1,775,557. Incidentally, notice that 
1/1,775,557 < 1/1,774,225 = 2/3,548,450 = ||A — S||, which means that the sin- 
gular matrix S in Example 2 is not quite the one closest to A (with distances 
measured by the row-sum norm). 


5. Conclusions for students. Unfortunately, there exists no general algorithm to 
estimate errors [9]. Therefore, the use of a supercalculator, and the scant discus- 
sion in its manual about the possible ill-condition of the matrix under considera- 
tion, demand an understanding of why and how computations may fail (in other 
words, theory). At the least, a manual (or an instructor) could recommend the 
following simple test to reveal the possibility of hidden computational failures: 
compute the logarithm of the condition number, Log,,.(cond(A)). (On the HP-28, 
run < A RNRM LOG A INV RNRM LOG + > , replacing Log,,(\l.All - | 47 "ID 
by Log,o||.All + Log,oll-A~ ‘|| to avoid overflow.) If the result exceeds the number of 
digits carried by the supercalculator, then the computation of X = A~'b through 
bA + might be meaningless, as explained in Example 3. For instance, for the 
matrix A in the preceding examples, Log,,(cond(A)) returns 12.6585012324, which 
exceeds 12. Observe that the same possibility of failure remains even if the large 
value of the test just described arises from errors in the computation of A7! 
(because then an example of a failure has already occurred). 
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Gram-Schmidt Orthogonalization by Gauss Elimination 
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Department of Mathematics and Statistics, University of Missouri, Rolla, MO 65401 


1. Introduction. In classroom work, attempts by students to use the Gram- 
Schmidt process typically lead to messy and unorganized computations. Little 
effort is made to connect this process to the already familiar process of Gauss 
elimination. In what follows, we show how to use Gauss elimination to orthogonal- 
ize a given set of vectors. While it is not numerically preferable to existing 
algorithms, our algorithm is simple and has some surprisingly happy side effects. 
These are discussed in the conclusion. 


2. An Example. Suppose 


0 —2 1 
1 3 1 
a=] [> @2:= oP = 14 
1 1 5 
Let A be the 4 X 3 matrix containing a, as its kth column. Then 
2 4 6 
A'A=|]4 14 6}. 
6 6 28 


Applying Gauss elimination on [ A‘A|A’], we have 


2 4 6 0 1 0 1 2 4 6 0 1 0O 
4 14 6 |-2 3 0 1/7] O 6 -6!] -2 #1 0 -1 
6 6 28 | 1 1 1 5 0 -6 10} 1 -2 #1 
2 4 6 0 1 0O 1 
—-10 6 -6|] -2 1 0 -1 
0 O 4; -1 -1 1 1 
The rows on the right side of this last matrix are mutually orthogonal, and the 
squares of the lengths of these rows appear on the diagonal of the left side of the 
matrix. The readers can check for themselves that these rows are precisely 
the vectors produced by using Gram-Schmidt orthogonalization (without normaliz- 
ing) on a@,,a,, and a3. Note that the only row operation used in our process is the 
subtraction from one row of a scalar multiple of a row above it. We do not use the 
interchange of two rows or the multiplication of a row by a scalar. 
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Schmidt process typically lead to messy and unorganized computations. Little 
effort is made to connect this process to the already familiar process of Gauss 
elimination. In what follows, we show how to use Gauss elimination to orthogonal- 
ize a given set of vectors. While it is not numerically preferable to existing 
algorithms, our algorithm is simple and has some surprisingly happy side effects. 
These are discussed in the conclusion. 


2. An Example. Suppose 


0 —2 1 
1 3 1 
1 1 5 
Let A be the 4 X 3 matrix containing a, as its kth column. Then 
2 4 6 
A'A=14 14 6 
6 6 28 


Applying Gauss elimination on [A‘A|A7], we have 


2 4 6 0 1 0 1 2 4 6 0 1 0O 
4 14 6 |-2 3 0 1/7] O 6 -6!] -2 #1 0 -1 
6 6 28 | 1 1 1 5 0 -6 10} 1 -2 1 
2 4 6 0 1 0O 1 
—>-10 6 -6|] -2 1 0 -1 
0 0 4; -1 -1 1 1 
The rows on the right side of this last matrix are mutually orthogonal, and the 
squares of the lengths of these rows appear on the diagonal of the left side of the 
matrix. The readers can check for themselves that these rows are precisely 
the vectors produced by using Gram-Schmidt orthogonalization (without normaliz- 
ing) on a,,a,, and a3. Note that the only row operation used in our process is the 
subtraction from one row of a scalar multiple of a row above it. We do not use the 
interchange of two rows or the multiplication of a row by a scalar. 
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3. The justification. Let m and n be positive integers, let R” be the Euclidean 
m-space of real vectors, and let a,,...,a, be n linearly independent vectors in R”. 
Define A to be the m Xn matrix whose jth column is a,, and let F(A) be the 
column space of A. 

The linear independence of the columns of A implies that A’A is positive 
definite. It follows that Gauss elimination can be carried out on A‘4 without row 
exchanges [1, p. 413], [3, p. 331]. In fact, the only row operation used is that of 
subtracting from one row a scalar multiple of a row above it. Hence, there are 
unique n Xn matrices, L and U, such that 


ATA = LU, (1) 


L being a lower triangular matrix with 1’s on the diagonal and U being an upper 
triangular, invertible matrix. Define 


O:=A(L7)". 

Using (1), it follows that O70 = U(L~')". Since O7Q is symmetric and U(L~!)? 
is upper triangular, O'O must be a diagonal matrix, 1e., the columns of O are 
orthogonal. Since QO? = L~ 14’, it follows that Q? will be computed automatically 
if Gauss elimination is done on the n X (n + m) matrix, [A“A|A’]. Specifically, 
Gauss elimination turns [.A74|A‘] into [U|O7]. 

Actually, more than this happens. Recall that, since the columns of A are 
linearly independent, there are two unique matrices, Q and R, such that 


A =QR, (2) 


Q being an m Xn matrix with orthonormal columns and R being an upper 
triangular matrix with positive entries on its diagonal. The columns of Q are 
precisely the vectors obtained from doing the Gram-Schmidt process on the 
columns of A [1, p. 237], [3, p. 175]. Defining 


R:=L', 
it follows at once from our definition of Q that 
A = OR. (3) 
Comparing (2) and (3) and defining 
D:= Q"Q, 


it is immediate that 
Q=QOD-'? and R=D!R. 


Since U = DL’, the entries on the diagonals of D and U are the same, i.e., the 
squares of the lengths of the columns of O are the diagonal entries of U. (See 
[3, p. 196].) Since the entries below the diagonal in L are the negatives of the 
multipliers used in Gauss elimination as A‘4 is turned into U, and since R= L’, 
the QR factorization of A is obtained with no extra work. In our earlier example, 


i oO 0 
L=;2 #1 Oj. 


3-1 I 


Hence, as a by-product of our method of doing Gram-Schmidt orthogonalization 


546 PURSELL AND TRIMBLE [June-July 


on the columns of A, we have 


0 -2 1 0 -2 -1 12 3 
A= 1 3 1;]_]1 1 -1 01 -1 
0 0 1 0 0 1 0 0 1 
1 1 5 1 -1 1 
Our process can still be used even if the vectors, a,,...,a,, are not linearly 


independent. In doing Gauss elimination on a matrix, it is normally necessary to 
switch rows if 0 should appear on the diagonal in a place where a non-zero pivot is 
needed. However, as the following theorem shows, if the matrix is positive 
semidefinite, it is never necessary to make this switch. 


THEOREM. Let B be a positive semidefinite matrix. Then there are a lower 
triangular, invertible matrix, L, and an upper triangular matrix, U, such that 


B=LU. (*) 


Further, if the diagonal entry of the ith row of U is 0, then all of the entries in that 
row of U are 0. 


Proof. Suppose 0 appears as the diagonal entry on the kth row of the matrix 
that is the result of doing Gauss elimination on the first kK — 1 rows of B and that 
no interchange of rows has yet taken place in doing this Gauss elimination. Then 
B=L,B8B,, where L, is the identity matrix, except that entries below its diagonal 
in its first kK — 1 columns need not be 0, and the entries of B, below its diagonal in 
its first k — 1 columns are all 0. (If k = 1, then L, is the identity matrix and 
B, = B.) Further, the diagonal entry in the kth row of B, is 0. 

Now (L,')’ is the identity matrix except that entries to the right of its diagonal 
in its first k — 1 rows need not be 0. The definition of matrix multiplication then 
implies that the kth rows of B, and B,(L;,')’ are the same. In particular, the kth 
diagonal entry of B,(L;,')" is 0. 

Since B,(L;, 1)" = L,'B(L, 1)", it follows that B,(L,')" is positive semidefi- 
nite. So, there is a matrix, C, such that B,(L,')’ = C'C, [3, pp. 339-340]. Let e, 
be the column vector with the same length as a column of B, with 1 in its kth 
position, and with 0’s elsewhere. Then the kth diagonal entry of B,(L;,')" is 
e/(C’C)e,. From above, this is 0. Hence, Ce, = 0. Since Ce, is the kth column of 
C, it follows from the definition of matrix multiplication that the kth row of C’C 
is 0. Hence, the same is true of the Ath row of B,. 

Since B,(L;')" is symmetric, its Ath column must also be 0. But the definition 
of matrix multiplication and the shapes of B, and (L,')" imply that the entries in 
B, and B,(L;')" in their kth columns below the diagonal must be the same, i.e., 
the entries in B, in its kth column below its diagonal are all 0. Consequently, 
there is no need to do Gauss elimination on this column. The Gauss algorithm now 
proceeds to the (kK + 1)st column with L, = L,,,, B, = B,,,,and B = L,,,B,41. 
Continuing this process, we conclude that (*) holds and that, if the diagonal entry 
of the ith row of U is 0, then the ith row itself is 0. 0 


Now A?A is always positive semidefinite, so the previous theorem shows that (1) 
always holds, and the proof shows that (1) can be obtained by Gauss elimination 
without any row exchanges. The definition of Q below (1) does not depend on the 
existence of U~', and in all cases the columns of O form an orthogonal set of 
vectors. The way in which Gauss elimination is done shows that the ith column of 
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Applying Gauss elimination on A’, we discover that, if 


—] 0 
—] 0 

b,:=| -3 and b,:= | 4 |, 
1 0 
0 1 

then {b,, b,} is a basis for .“(A‘). Defining 

0 --2 1 -1 O 
1 3 1 -1 O 
B:=1|0 0 1 -3 4 
1 1 5 1 O 
0 0 —-4 0 1 


and using Gauss elimination on [B’B|B*], we have 


2 4 #6 0 0 0 1 0 1 0 
4 14 6 0 0 |-2 3 0 1 0 
6 6 44 0 0 1 1 1 5 -4 
0 OO O 12 -12 |-1 -1 -3 1 0 
0 OO O -12 17 0 0 4 0O 1 
2 4 66 lO 0 0 1 0 1 0 
0 6 -6 QO 0 |-2 1 0 -1 0 
>-!10 0 20 £0 0 {|-1 -1 1 1 —-4/|. 
0 O 0 12 -12 j-1 - —3 1 0 
0 O 0 O 5 |-1 -1 1 1 1 
So, the desired orthogonal basis is 
0 —2 —] 
141 1 1 1 —1 
W= FO) i= 7) OF, a3:= =e] 1), 
v2 14 6 | _4 v20} , 
0 0 —4 
—] —] 
1 —1 1)\-1 
D4 —3 » 45.= 1 
V2} 3 el 
0 1 


6. Conclusion. We would like, to make it clear we are not proposing that the 
current method ({1, p. 229], [3, p. 172]) of presenting the Gram-Schmidt process be 
abandoned; students have a real need to understand the geometry of this process. 
We are suggesting that our method, presented above, of hand-calculating the 
vectors is superior to current hand-calculating methods. Its value does not lie in a 
shortening of the students’ arithmetic work nor in greater computer precision. 
(Our calculations suggest that the number of multiplications is actually increased 
by O(n°/6). And while our method is equivalent to the modified Gram-Schmidt 
process ([1, p. 233], [2], [3, p. 181]), the increased precision of that modified process 
is lost because of the necessity of finding pivots in doing the LU-factorization of 
A‘A.) Instead, the value of our method lies in the organized format that Gauss 
elimination naturally brings to a problem. 
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Perhaps more importantly, by being exposed to our method, the students gain a 
much deeper understanding and appreciation of the uses of Gauss elimination. 
Our method unifies the ideas of two apparently unconnected processes and 
demonstrates the power of viewing matrix transformations as factorizations. 
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On Finite Groups and Finite Fields 


J. D. Rep 
Department of Mathematics, Wesleyan University, Middletown, CT 06457 


Underlying one of the proofs, perhaps the most elementary one, of the fact that 
the multiplicative group of nonzero elements in a finite field is cyclic is an idea 
that we think perhaps deserves to be made more explicit. This can be done in 
terms of certain functions defined by finite groups, functions that might have some 
interest in their own right (see, for example, the Remarks below). Furthermore this 
approach to cyclic groups suggests an analogous treatment of a more purely field 
theoretic problem, namely the derivation of a formula for the number of irre- 
ducible polynomials of degree n over a finite field of cardinality g. While all this is 
essentially just a reformulation of standard results (cf. [1], pp. 259-261, or [3], p. 4, 
for example), we hope nonetheless that some readers will find this point of view 
interesting. 


Groups. Suppose then that G is any group of order n. For elements a of G we 
denote by 0(a) the order of a and by (a) the cyclic subgroup of G generated by a. 
Further, let #(G, d) denote the number of cyclic subgroups of G of order d. Now 
define two equivalence relations on G as follows: 


a ~ b, if and only if O(a) = 0(b) 

a=b_ if and only if (a) = (b> 
Then a = b implies a ~ b so each ~ class is a union of = classes, and there is 
(potentially) one ~ class for each divisor d of n. Since each cyclic group of order 


d has ¢(d) generators, where ¢ is the Euler function, it follows that the ~ class 
corresponding to d contains #(G, d)p(d) elements. Hence we have 


n= ) #(G,d)o(d) (1) 
d\n 


for any finite group G of order n. In particular, taking G to be the cyclic group of 
order n we have Gauss’s formula 


n= di o(d) (2) 


d\n 
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Perhaps more importantly, by being exposed to our method, the students gain a 
much deeper understanding and appreciation of the uses of Gauss elimination. 
Our method unifies the ideas of two apparently unconnected processes and 
demonstrates the power of viewing matrix transformations as factorizations. 
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Underlying one of the proofs, perhaps the most elementary one, of the fact that 
the multiplicative group of nonzero elements in a finite field is cyclic is an idea 
that we think perhaps deserves to be made more explicit. This can be done in 
terms of certain functions defined by finite groups, functions that might have some 
interest in their own right (see, for example, the Remarks below). Furthermore this 
approach to cyclic groups suggests an analogous treatment of a more purely field 
theoretic problem, namely the derivation of a formula for the number of irre- 
ducible polynomials of degree n over a finite field of cardinality g. While all this is 
essentially just a reformulation of standard results (cf. [1], pp. 259-261, or [3], p. 4, 
for example), we hope nonetheless that some readers will find this point of view 
interesting. 


Groups. Suppose then that G is any group of order n. For elements a of G we 
denote by 0(a) the order of a and by (a) the cyclic subgroup of G generated by a. 
Further, let #(G, d) denote the number of cyclic subgroups of G of order d. Now 
define two equivalence relations on G as follows: 


a ~ b, if and only if O(a) = 0(b) 

a=b_ if and only if (a) = (b> 
Then a = b implies a ~ b so each ~ class is a union of = classes, and there is 
(potentially) one ~ class for each divisor d of n. Since each cyclic group of order 


d has g(d) generators, where ¢ is the Euler function, it follows that the ~ class 
corresponding to d contains #(G, d)e(d) elements. Hence we have 


n= )#(G,d)o(d) (1) 
d\n 


for any finite group G of order n. In particular, taking G to be the cyclic group of 
order n we have Gauss’s formula 


n= di o(d) (2) 


d\n 
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since #(G, d) = 1 for all d in this case. Subtracting (1) from (2) gives 


0= D[1- #(G,4)] (a). (3) 
d|n 

The relations (1) and (3) are what we had in mind. As immediate consequences 
we have the two standard results: 


THEOREM. G is cyclic if and only if #(G, d) < 1 for all d. 


Proof. If G is cyclic then #(G, d) = 1 for all d. Conversely if #(G, d) < 1 for 
all d then all coefficients of the y(d) in (3) are non-negative. Since the sum is zero, 
all coefficients must be zero. In particular, #(G, n) = 1, i.e., G is cyclic. D 


CoROLLARY. Any finite subgroup of the multiplicative group of a field is cyclic. 


Proof. lf G is such a subgroup the existence of two distinct cyclic subgroups of 
G of order d would lead to more than d solutions to the equation X¢ — 1 = 0 in 
the field, which is impossible, i.e., #(G,d) < 1 for all d. 


Remarks. It may be interesting to note that, for an arbitrary group G of order 
n, we obtain a parity of sorts between divisors d of n for which there is more than 
one cyclic subgroup of G of order d and those for which there are none. This is 
expressed in the following equation which comes directly from (3): 


u ¢(d4)= Ye [A#(G,d) - 1]e(4). (4) 
#(G, d)=0 #(G,d)>1 
Secondly, from the structure theory of finite abelian groups it is clear that, for 


any abelian group, the function #(G, d) is a multiplicative function, i.e., that if d 
and d’ are divisors of n and gcd(d, d') = 1, then 


#(G,dd') = #(G,d)#(G,d’). 


Fields. For a second application of this idea, let F be a finite field of cardinality 
q and let E be the extension of F of degree n. For a in E let f,(x) be the 
polynomial of least degree with coefficients in F having a as root and denote by 
deg f(x) the degree of this polynomial. Finally, let N(q, d) denote the number of 
irreducible polynomials of degree d with coefficients in F. This depends only on q 
and d. Much as before define two equivalence relations on E by 


a ~ bif and only if deg f(x) = deg f,(x). 

a = bif and only if f,(x) = f,(x). 
Note that a ~ b if and only if the fields F(a) and F(b) are equal. Again, a ~ b 
implies a ~ b and now there are N(q, d)d elements in each ~ class since each of 


the N(q, d) irreducible polynomials of degree d have d (distinct) roots and distinct 
polynomials are relatively prime. Hence, we obtain an analog of (2), namely, 


q” = UN(q,4)d. (5) 
d\n 


Now, in the usual way, Mobius inversion gives 


ae ° 


1 
N(a,n) =~ dip d 


d\n 
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where yu is the MObius function. For another approach to the formula (5), and 
further results, see for example [2, p. 289]. 

Formulas (5) and (6), as well as (2), are due to Gauss. The author thanks the 
referee for informing him that the usual name for (2) is: Gauss’s formula. 
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ELEMENTARY PROBLEMS 


E 3445. Proposed by Ronald A. Jansen and Pieter Moree, University of Leiden, 
The Netherlands. 


(a) Suppose p is an odd prime and k = 1 (mod p). Prove that for any positive 
integer n the highest power of p dividing n is equal to the highest power of p 
dividing 1 +k +k?74+--- +k"7}, 

(b) Suppose k = 1 (mod 4). Prove that for any positive integer n the highest 
power of 2 dividing n is equal to the highest power of 2 dividing 1+k + 
ke pire +krot, 


E 3446. Proposed by Jean-Pierre Grivaux, Lycée Chaptal, Paris, France. 
Suppose 0 < uy < 1 and 0 <A < 1. Define a sequence {u,}"_, by putting 


Uns, =U, t+AYV1 — uz (n = 0,1,2,...). 


For small n the sequence {u,} is real and increasing. Let N = N(A, u,) be the first 
integer for which u,, > 1. For fixed uy find 


lim AN(A, Uo). 


A->0+ 
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Solution by Kee-Wai Lau, Hong Kong. For n = 2 we show that 2 —-L, = 
n~\{Inn + 1—1In2 + Odn?n/n)}. The standard formula for arc length yields 
L, = fg + n?x2"~7)!'/* dx. Letting f(z) = 1+ z— (1 + z*)'”’, this implies that 
2-L, = {if(nx"—') dx. Setting u = nx"~', we can rewrite the preceding integral 
as n'/O—-™(n — 1)-1C7 + J), where 


I= [fur du and J= [fuze — 1) du. 
0 0 
Since f(u)u~! has the primitive u — (1 + u’)'”* + In{1 + (1 + u?)!”7}, we have 
T=n-(14n?) + inft + (14+ n?)} +1-1n2 


=Inn+1-—In2+ O(n"). 


Note also that n'/¢~n - 1)7' =n714+ Odnn/n’). 

To bound J, we break the integral at u = 1. Note that f(u)u~'! <1 when 
u<1. Since ff -— u/¥@-?) du =n! and (n'7@—) — 1)f?f(wu-! du = 
O(In? n/n), we obtain the formula claimed. 


Editorial comment. A few solvers obtained more precise results by more 
refined arguments. M. Ashbaugh showed that the terms described as O(In? n/n) 
above are —(In $n)?/(2n) — (ar? — 6)/(12n) + O(n? n/n”). Results equivalent to 
2—L, =Inn/n + O(n“) or stronger were obtained by those listed below; many 
different methods were used. Note that the inequality 2 — L, < Unn)/n + 1/n is 
trivial, since it follows from the fact that L,, exceeds the length of the broken line 
with vertices (0, 0), (n~!/”, n~1), and (1, 1) and so is greater than n7~ 17" + 1 -—n7!. 

Solved also by J. Anglesio (France), M. Ashbaugh, D. Borwein (Canada), P. Bracken (Canada), 
E. R. Canfield, Z. Franco, M. Golomb, P. G. Kirmser, M. E. Kuczma (Poland), E. Levine, F. Levron 


(France), A. Meir (Canada), O. P. Lossers (Netherlands), S. Philipp, C. C. Rousseau & J. Warren, D. B. 
Tyler, the Chico Problem Group, an anonymous solver, and the proposer. 


A Bijection between Matchings and Rooted Binary Trees 


E 3349 [1989, 837]. Proposed by Ldszlé A. Székely, University of New Mexico, 
Albuquerque. 


Consider the set of matchings M on 2n labeled vertices and the set T of rooted 
binary trees with n + 1 labeled leaves (i.e., having n unlabeled nonleaf vertices 
and 2n edges). It is well known,that [M| = (2n)! /(n!2”). Prove that T has the same 
size by giving a constructive bijection between M and T. 


Solution by Jerrold W. Grossman, Oakland University, Rochester, MI. The binary 
trees must be unordered in the sense that interchanging the subtrees of an internal 
vertex does not change the tree; this is the same as viewing the tree as a partially 
labeled graph. Assume that the leaves of the trees are labeled from 1 to nm + 1 and 
the vertices to be matched are labeled from 1 to 2n. We construct ¢: T — M and 
Ww: M — T. It will be clear from the construction that these are inverse functions. 

Given T € T, we successively assign the labels m + 2,...,2n to the nonleaf 
vertices other than the root as follows: having assigned labeis up to j — 1, we 
assign j to the unlabeled vertex among those whose children are both labeled that 
has the child with the smallest label. Such a vertex exists, since we begin with all 
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leaves labeled and there are exactly n — 1 nonleaf vertices to be labeled. Define 
¢(T ) to be the matching that pairs each vertex with its sibling. 

Conversely, given a matching M, we successively assign to the vertices n + 2 
through 2n a matched pair of children as follows: having assigned pairs to vertices 
up to j — 1, we assign to j the unassigned pair among those whose labels are both 
less than j that has the vertex with the smallest label. Such a pair exists, since 
when j < 2n at most 2n — (j — 1) pairs are ineligible by having a number greater 
than j — 1 and exactly (j — 1) — (n + 1) pairs are ineligible by being already 
assigned, leaving at least 1 of the n pairs available. Define %(m) to be the tree 
such that the root has the only unassigned pair as its children and any other 
nonleaf vertex with label j has the pair assigned to j as its children. 


Editorial comment. A more general version of this result will appear in a paper 
by the proposer. 


Solved also by F. Bernhart, D. Callan, D. S. Franxblau, I. Gessel, R. J. Hendel, M. D. Hendy (New 
Zealand), K. M. Levasseur, D. Magagnosc, R. Stong, and the proposer. 


A Sequence of Least Common Multiples 


E 3350 [1989, 837]. Proposed by Paul Erdds, Hungarian Academy of Sciences, 
Budapest. 


For positive integers n and k with 1<k <n let A(n,k) denote the least 
common multiple of n,n — 1,...,n —k +1. 

(a) Prove that 14 is the largest value of n for which there exists a positive 
integer k <n such that 


A(n,1) <A(n,2) < +++ <A(n,k) =A(n,n), 


i.e., 14 is the largest value of n for which A(n, j) is strictly increasing in j until it 
reaches its maximum value A(n, n). 
(b) Let f(n) be the largest value of k such that 


A(n,1) <A(n,2) < ++: <A(n,k). 


Prove that f(n) < Cvn for some constant C. 
(c) Prove that f(n) tends to infinity with n. More precisely, prove that if 
n>k!4+k, then f(n) > k. 


Composite Solution by Richard Stong, University of California at Los Angeles, 
and John Henry Steelman, Indiana University of Pennsylvania, Indiana, PA. Ob- 
serve that 


A(n,k +1) = Ilem{n —k, A(n,k)} forl <k <n. (*) 
Thus, A(n,k) < A(n,k + 1) and equality holds if and only if n —k divides 
A(n, k). We first solve (b) by proving that f(n) < 3Vn. Clearly, f(1) = 1. Note that 
for m > 1, 
A(m?*,2) = m?(m? — 1) = (m* — m)(m? +m). 


Thus, m? — m divides A(m7?,2) and hence it also divides A(m?,m). So we have 
from (*) that A(m?, m) = A(m?, m + 1). Hence, f(m7) < m for all positive inte- 
gers m. Note that (*) implies that if A(n,k) = A(n,k + 1), then A(n + j,k +/j) 
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= A(n + j,k + j + 1) for all positive integers j. Thus, f(n + j) < f(n) + j for all 
positive integers n and j. Now, if m* <n <(m + 1)’, then n = m? + j, where 
0 <j <2m. Hence, f(n) < f(m?) +j < 3m < 3vn, where one of the last two 
inequalities must in fact be a strict inequality. 

To get the required proof for (a), first observe that 


A(14,1) <A(14,2) < ++: <A(14,7) = A(14, 14). 


Now, let 1 be an integer as in (a) with n > 14, and consider any prime p <n. Let 
k be the largest integer for which p* <n, and let m(p) be the largest multiple of 
p* which is <n. It is easy to see that A(n,n — m(p)) < A(n,n — m(p) + 1). 
Thus, for each prime p <n, we find that m(p) must be in the set {n,n — 1,..., 
n — f(n) + 1}. We now show that if p and q are distinct primes <n, then 
m(p) # m(q). Assume otherwise. We may suppose that g > p. Then for some 
integers r, s, k, and t, we get that m(p) = p*r = q's = m(q) so that gq divides r. 
But then r > gq > p—p implies a contradiction to the maximality of k. Hence, for the 
a(n) primes p <n, we get distinct integers m(p) in the set {n,n — 1,...,n — 
f(n) + 1}. This implies that f(m) > w(n). Recall that, from the result of part (b) 
already established, we have f(n) < 3v¥n. Thus, we get that a(n) < 3vn. The 
estimate w(x) > x/log x for all x > 17 (See (3.5) of J. B. Rosser and L. Schoenfeld, 
“Approximate formulas for some functions of prime numbers,” Illinois J. Math. 6 
(1962), 64-94) implies that a(x) > 3Vx for all x > 289. A direct computation 
shows that w(x) > 3vx for all x © [179, 289). Thus, for (a), we need only consider 
n & (14, 179). These can readily be handled by using (*). For example, if 15 <n < 
17, observe that 9 does not divide A(n, n — 9) and 10 divides A(n, n — 10) so that 
A(n, n — 10) = A(n,n — 9) < A(n,n). Thus, each of 15, 16, and 17 cannot be an 
integer n as in (a). Similarly, we can handle the cases n € [18, 179) by observing 
that for 18 <n < 21,11 + A(n,n — 11) and 12|A(n,n — 12); for 22 <n < 25, 
13 + A(n,n — 13) and 15|A(n,n — 15); for 26 <n < 37,19 + A(n,n — 19) and 
20|A(n, n — 20); for 38 <n < 57,29 + A(n,n — 29) and 30|A(n,n — 30); for 58 
<n < 85,43 + A(n,n — 43) and 45|A(n,n — 45); for 86 <n < 141,71 + 
A(n,n — 71) and 72|A(n,n — 72); and for 142 <n < 178,101 + A(n,n — 101) 
and 102|A(n, n — 102). 
For (c), we use (*) to obtain 
A(n,k +1) =1 k, A(n, k Att, k(n = 8) 
(1, k +1) = lem{n — k, A(n, k)} = ean kn =] 1) 
A(n,k)(n — k) 
= i 
k! , 

Therefore, if n >k!+k, then A(n,1)< --- <A(n,k) <A(n,k + 1) so that 
f(n) > k. 


Solved also by the proposer. James Duemmel supplied solutions to parts (b) and (c). 
Weighing All the Possibilities 


E 3365 [1990, 64]. Proposed by Dean S. Clark, University of Rhode Island, 
Kingston. 


Let S denote the sum of the integer weights that come with a two-pan balance 
scale. Give a necessary and sufficient condition on the weights that assures that all 
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integer-weighted objects in the range of 1 to S can be weighed exactly. Use of 
weights on both pans is permitted. 


Solution by R. J. Chapman, University of Exeter, UK. Let w, < +:: <w, be the 
weights, and let S; = L/_,w,. Then every integer amount from 1 to S can be 
weighed if and only if w, = 1 and w,,, < 28,+ 1 forl <j <xa. 

For sufficiency, we induct on j to show that every amount between 1 and S, 
inclusive can be weighed using {w,,..., w,}. This is plainly true for j = 1. If we can 
weigh i using {w,,...,w,}, then we can also weigh w,,, —i and w,,, + 7% when 
W;41 18 also available, so we can weigh every amount from w,,, — S$; to wj4, + 
S; = S;., using {w,,...,w,,,}. Since w,,, — 8S; < S; +1 by hypothesis, we can 
weigh every amount up to S;,,j. 

For necessity, suppose we can weigh all amounts from 1 to S. The second 
largest weighable amount is S — w,, which must be S — 1, so w, = 1. If wi, > 
2S; + 1 for some j, let W = S — 2S; — 1; we claim W cannot be weighed. The 
largest amount that can be weighed without putting all of {w, tbee+>W,¢ in one 


pan is 


n J 
LL wet Lew, =S— wi. < W, 
i=j+2 i=1 
but the smallest amount that can be weighed using all of {w,,,,...,w,} in one 
pan is 


n J 
~ w- Lew, =S- 28, > W. 


i=j+1 i=1 


Solved also by D. M. Barrington, D. Callan, M. Falkowitz (Israel), R. Galia, M. Golomb, R. High, 
B. G. Klein, K. M. Levasseur, O. P. Lossers (The Netherlands), H. Noland, I. Schiermeyer (Germany), 
J. H. Steelman, R. Stong, D. P. Sumner, J. S. Sumner & K. L. Dove, H. H. Tehrani (Iran), A. Tissier 
(France), J. T. Ward, E. A.Weinstein, NSA Problems Group, University of Wyoming Problem Circle, 
and the proposer. One incorrect solution was received. 


Heuristic vs. Rigorous 


E 3380 [1990, 342]. Proposed by Nick MacKinnon, Winchester College, U.K. 


Let N be the largest number of consecutive zeros occurring among the first 
101° decimal places of (1 + ¥5)/2. 

(i) Give a heuristic argument for the order of magnitude of N to be expected. 

(ii) Give a rigorous upper bound for N. 


Solution by James G. Merickel (student), Temple University, Philadelphia, PA. 
(i) Assuming that the digits of (1 + ¥5)/2 do not exhibit irregular statistical 
behavior, we would expect N to be around 1000. For in a randomly selected 
sequence of decimal digits a given string of 1000 digits occurs on average once 
every 10/°° digits. Moreover if h > 0 and if we consider the 10!°°*” possible 
strings of 1000 + A decimal digits, then fewer than 1/10” of these strings can 
occur among the first 10° digits of a sequence of decimal digits, and so it would 
be reasonable to wager 10” — 1 to 1 against the occurrence of a particular one, 
such as the string of 1000 + / zeros. 
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integer-weighted objects in the range of 1 to S can be weighed exactly. Use of 
weights on both pans is permitted. 


Solution by R. J. Chapman, University of Exeter, UK. Let w, < +-: <w, be the 
weights, and let S; = L/_,w,. Then every integer amount from 1 to S can be 
weighed if and only if w; = 1 andw,,, <258,+ 1 forl <j <a. 

For sufficiency, we induct on j to show that every amount between 1 and 5S, 
inclusive can be weighed using {w,,..., wi}. This is plainly true for j = 1. If we can 
weigh i using {w,,...,w,}, then we can also weigh w;,, —i and w,,, +7 when 
W;,, is also available, so we can weigh every amount from w,,, — S$; to w,,, + 
S, = S;,, using {w,,...,w,,}. Since w,,, — S; < S$; + 1 by hypothesis, we can 
weigh every amount up to S,,). 

For necessity, suppose we can weigh all amounts from 1 to S. The second 
largest weighable amount is S — w,, which must be S — 1, so w, = 1. If wi, > 
2S; + 1 for some j, let W = S — 2S; — 1; we claim W cannot be weighed. The 
largest amount that can be weighed without putting all of {w, tp-:+>W,} in one 


pan is 


n J 
VL wet Lew, =S— wi, < W, 
i=j+2 i=] 
but the smallest amount that can be weighed using all of {w,,,,...,w,} in one 
pan is 


n J 
~ w- Lw,=S-28,> W. 


i=j+1 i=1 


Solved also by D. M. Barrington, D. Callan, M. Falkowitz (Israel), R. Galia, M. Golomb, R. High, 
B. G. Klein, K. M. Levasseur, O. P. Lossers (The Netherlands), H. Noland, I. Schiermeyer (Germany), 
J. H. Steelman, R. Stong, D. P. Sumner, J. S. Sumner & K. L. Dove, H. H. Tehrani (Iran), A. Tissier 
(France), J. T. Ward, E. A.Weinstein, NSA Problems Group, University of Wyoming Problem Circle, 
and the proposer. One incorrect solution was received. 


Heuristic vs. Rigorous 


E 3380 [1990, 342]. Proposed by Nick MacKinnon, Winchester College, U.K. 


Let N be the largest numher of consecutive zeros occurring among the first 
101° decimal places of (1 + ¥5)/2. 

(i) Give a heuristic argument for the order of magnitude of N to be expected. 

(ii) Give a rigorous upper bound for N. 


Solution by James G. Merickel (student), Temple University, Philadelphia, PA. 
(i) Assuming that the digits of (1 + ¥5)/2 do not exhibit irregular statistical 
behavior, we would expect N to be around 1000. For in a randomly selected 
sequence of decimal digits a given string of 1000 digits occurs on average once 
every 101° digits. Moreover if h > 0 and if we consider the 10'°°+” possible 
strings of 1000 + A decimal digits, then fewer than 1/ 10” of these strings can 
occur among the first 10/°° digits of a sequence of decimal digits, and so it would 
be reasonable to wager 10” — 1 to 1 against the occurrence of a particular one, 
such as the string of 1000 + / zeros. 
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(ii) We prove rigorously that N < 5 - 10°”. Since 


(1+ v5)/2=1+ 1/{(1 + V5 )/2}, 


all partial quotients in the continued fraction expansion of (1 + V5) /2 are 1. Thus 
it follows from the proof of Theorem 188 in [3] that for any positive integers p and 
q we have 


lp/q — (1 + V5)/2| > 1/(10q7). 


(In fact it is known that 10 can be replaced by V5 +e for sufficiently large q. Cf. 
Theorem V of Chapter I of [1].) In particular 


|p/10" — (1 + ¥5)/2| > 1/102"*! 


for any positive integer n. This means that if the mth digit in the decimal expansion 
of (1 + ¥5)/2 is non-zero, then no more than the next n consecutive digits can be 
zero. Thus a string of more than 5 - 10°”? consecutive zeros among the first 1010” 
decimal digits of (1 + 75 )/2 is impossible. 


Editorial comment. If N, denotes the length of the longest string of consecutive 
zeros among the first 10! decimal places of (1 + V5 )/2, it is easy to determine that 
N, = 1, N, = 1, and N, = 3. These data fit in with the conjecture made in the 
above solution that N, has the same order of magnitude as 1. Unfortunately it is 
not easy to extend these calculations as far as Nig = N. 

If r is around 1000, there is an approximate formula for the probability of at 
least one occurrence of r consecutive zeros among the first 10!° places of a 
random sequence of decimal digits, that is, the probability of at least one success 
run of length 7 in 10/°°° Bernoulli trials each with success probability 1/10. In 
fact, according to the last problem of Chapter XIII of [2], 


p, = 1 — exp{—101°(9/10)(1/10)"} = 1 — exp{—9 - 10°9°~"}. 


Thus Po 9 = 0.99988, D999 = 0.59343. If h is a small positive integer, the probabil- 
ity Diooo+, Of at least one occurrence of 1000 + h consecutive zeros among the 
first 10'°° places of a random sequence of decimal digits is around 


1 — exp{-—9-10-"*-1} =9-107"7!; 


this agrees with the remark in the above solution of (i) that pj 9, < 107”. 


REFERENCES 


1. J. W. S. Cassels, An Introduction to Diophantine Approximation, Cambridge Tract No. 45, 1957. 
2. William Feller, An Introduction to Probability Theory and Its Applications, Volume I, Wiley, 1968. 
3. G.H. Hardy and E. M.Wright, An Introduction to the Theory of Numbers, Oxford, 1960. 


Solved also by the proposer. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Two Sets of Density One 


6583 [1988, 880]. Proposed by Moshe Laub, Jerusalem, Israel. 


(a) Let A be the set of positive integers z such that z* is expressible as a sum 
of two nonzero squares. Prove that A has asymptotic density one. 

(b) Let B be the set of positive integers z such that the triangular number 
T, = z(z + 1)/2 is expressible as a sum of two nonzero triangular numbers. Prove 
that B has asymptotic density one. 


Solution by L. E. Mattics, University of South Alabama, Mobile. (a) We begin by 
showing that z € A if and only if z is divisible by at least one prime congruent to 1 
modulo 4. On the one hand, if z = jp, where p is a prime congruent to 1 modulo 
4, then p =a’ +b*, where a>b>O0, so that p* = (a* — b”)* + (Qab)* and 
hence z? = j7p” = (a’j — bj)? + (Qabj)*. On the other hand, if z is not divisible 
by any prime congruent to 1 modulo 4, then there are exactly four solutions of 
x’ +y* =z? in integers x, y, namely x = +z, y=O0O and x =0, y= +z. (Cf. 
§5.10 of [3]). 

Let C be the set of positive integers complementary to A, that is, C is the set of 
positive integers not divisible by any prime congruent to 1 modulo 4. Since the sum 
of the reciprocals of the primes congruent to 1 modulo 4 diverges, we see that C 
has natural density zero and so A has natural density one. (Cf. §§11.1 and 11.3 
of [3].) 

(b) We begin by showing that z € B if and only if 2z* + 2z + 1 is composite. 
For 8T, + 2 = (2z + 1)? + 1 = 2r, where all prime factors of r are congruent to 1 
modulo 4 (since —1 is a quadratic residue modulo any prime factor of r). Now 
T, = T, + T, for positive integers u and v if and only if 


2r = (2u + 1)° + (20 + 1)’. (*) 


Since r has only prime factors congruent to 1 modulo 4, the number of solutions of 
x* + y? = 2r in integers x, y (necessarily odd) is equal to 4d(r), where d(r) is the 
number of positive integral divisors of r. The equality 2r = (2z + 1)? + 1 provides 
eight solutions x = +(2z +1), y= +1 and x = +1, y= +z +1) of x7 + 
y” = 2r and so there is a solution of (*) in positive integers u and v if and only if 
d(r) > 2, ie., if and only if r is composite. 

Let D be the set of positive integers complementary to B, that is, D is the set 
of positive integers z such that 2z? + 2z+1 is prime. Now for any prime gq 
congruent to 1 modulo 4, gq <z, the primality of 2z” + 2z +1 requires that z 
must avoid the two residue classes s modulo gq such that 


2(252 + 2s +1) = (25 + 1)? +1 =0 (mod q). 


(If one of these two residue classes is represented by s(q), where 0 < s(q) <q, the 
other is represented by q — 1 — s(q).) Thus if q, is the ith prime congruent to 1 
modulo 4, the density of D does not exceed 


nbd 


i=1 q; 
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for any fixed m. Since the sum of the reciprocals of the primes congruent to 1 
modulo 4 diverges, we see that D has natural density zero and so B has natural 
density one. 


Editorial comment. It follows from Theorem 2.2 of [1] that if G(x) is the 
number of elements of the set C not exceeding x and H(x) is the number of 
elements of the set D not exceeding x, then 


G(x) = O(x(log x) 7"), H(x) = O(x(log x) '). 


The result of §183 of [2] shows that actually G(x) ~ ax(log x)~!/7 as x > ~, 
where a is a positive constant. Heuristically we would also expect an asymptotic 
formula H(x) ~ Bx(log x)~! as x > ©, where B is a positive constant. 

Thomas Maxsein and Paul R. Smith remarked that the equivalence “z € B if 
and only if 2z2 + 2z + 1 is composite” used in the solution of (b) has occurred at 
least twice in the literature, namely in [4] and [5]. 


REFERENCES 
1. H. Halberstam and H.-E. Richert, Sieve Methods, Academic Press, 1974. 


2. Edmund Landau, Handbuch der Lehre von der Verteilung der Primzahlen, Chelsea Publishing Co., 
1953. 

3. Ivan Niven and Herbert S. Zuckerman, An Introduction to the Theory of Numbers, Wiley, 1980. 

4. Daniel Shanks, An analytic criterion for the existence of infinitely many primes of the form 
(n* + 1)/2, Illinois J. Math., 8 (1964) 377-379. 

5. Waclaw Sierpinski, Sur les nombres triangulaire qui sont sommes de deux nombres triangulaires, 
Elem. Math., 17 (1962) 63-65. 


Solved also by the late Emil Grosswald, Thomas Maxsein & Paul R. Smith, and the editors. The 
Grosswald and Maxsein-Smith solutions were similar to that given above. The editors’ solution used the 
method of Orrin Frink, Almost Pythogorean triples, Math. Magazine, 60 (1987) 234-236. 


A Property of the Rationals 


6614 [1989, 846]. Proposed by Justin G. MacCarthy, Deming, NM. 


A proper subset A of an infinite abelian group G is called a “wedge” in G if 
the family of all translates A + x, x © G, is closed under finite intersection. Prove 
that in Q*, the additive group of the rationals, the only wedges are half-lines (open 
or closed). 


Solution by Richard Stong, University of California at Los Angeles. Let A be a 
wedge in Q*. Define B = {x € Q*: A + x CA}. We will show that B = Q 2 [0, ©) 
or B = ON (—~,0]. This clearly implies that A is a half-line. 

First we will show B is a wedge. Clearly 0 € B, so B is nonempty and B # QO”, 
since otherwise we would have A = Q*. Suppose y € Q”. Then clearly B + y = 
{x € O*: A+xCA+y}. Thus since A is a wedge 


NB +y) = [peOu Ate VA +y0)| 


={xeQ*:At+xcAt+z=B+z, 


for some z. That is, B is a wedge. Next note that B is closed under addition, i.e. B 
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is a submonoid of Q*. This is clear since A +x CA and 4+yCA clearly 
implies d+x+yCA+yCa. 

Suppose B contained both positive and negative elements, say a and —b. Since 
B is closed under addition B would contain their positive multiples and hence 
would contain c and —c for some positive c. Rescaling A, and hence B, we may 
assume +1 € B. Since B is a wedge, for any integer n we have 
= B+ P , 

q 

for some p/q € Q* (we may assume p > 0 by, if necessary, replacing A by —A). 
Hence p/q and p/q — 1/n belong to B. Since B is closed under addition, we 
would thus have 


1 
Bo (B+— 
n 


+ p(-1) EB 


1 D 
--~=(q-1)—+ 
n q 
and 


1 Pp p |i 
—=(nq-—n+1)—+(n—-1)]/—-— —] +np(-1) +16B. 
n q q fn 


Since B is closed under addition, this would imply B = Q*. 

Therefore B contains elements of only one sign. Replacing A by —4A, if 
necessary, we may assume B CQ 1 [0,~). Fix x € QM [0,), let y be such that 
Bo(B+x)=B+t+y, and let k be the least positive integer such that there is a 
nonnegative integer n with ny + kx © B (k exists since for some large K and M, 
Kx = My © B). Then 

B+(n4+1)y+(k-1)x =(Btnyt+ke) 0 (Btny+(k-1)x) CB, 
since B + ny + kx CB. In particular, (n + 1)y + (k — 1)x © B. This contradicts 
the minimality of k unless k = 1. Thus there is a least nonnegative integer n for 
which ny + x € B. However, since BN (B+x)=B+y and ny €B, this would 
imply ny +x &€Bty or(n —- 1)y +x €B. This contradicts the minimality of n 
unless n = 0 and hence x € B. Therefore B = Q 1 [0,), as desired. 


Editorial comment. Duane Broline and the proposer point out that the result is 
false for the additive reals (just subject the half-line wedge 0 <x to a totally 
discontinuous linear transformation—this yields an everywhere dense image 
wedge). On the other hand, F. J. Flanigan proved the result for an arbitrary 
subgroup of the additive rationals. 


Solved also by Duane Broline, F. J. Flanigan, and the proposer. 
Averages of the GCD and of the LCM 


6615 [1989, 847]. Proposed by Kee-Wai Lau, Hong Kong. 


Denote by (i, j) and [i, j] respectively the greatest common divisor and the least 
common multiple of the positive integers i and j. Prove that as n tends to infinity 


n7log n 


iu LGD ~ ay (i) 
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is a submonoid of Q*. This is clear since A +xCA and 4 +yCA clearly 
implies dA+x+yCAt+yCa. 

Suppose B contained both positive and negative elements, say a and —b. Since 
B is closed under addition B would contain their positive multiples and hence 
would contain c and —c for some positive c. Rescaling A, and hence B, we may 
assume +1 € B. Since B is a wedge, for any integer nm we have 
= B+ P , 

q 

for some p/q € Q* (we may assume p > 0 by, if necessary, replacing A by —A). 
Hence p/q and p/q — 1/n belong to B. Since B is closed under addition, we 
would thus have 


1 
Bo (B+ — 
n 


+ p(-1) €B 


1 Dp 
--—=(q-1)—+ 
n qd 
and 


1 Pp p i 
—=(nq-—n+1)—+(n-1)|—-- —] +np(-1) +1648. 
n q q wn 


Since B is closed under addition, this would imply B = Q”. 

Therefore B contains elements of only one sign. Replacing A by —4A, if 
necessary, we may assume B CQ 1 [0,~). Fix x € QM [0,), let y be such that 
Bo(B+x)=Bty, and let k be the least positive integer such that there is a 
nonnegative integer n with ny + kx € B (k exists since for some large K and M, 
Kx = My &€ B). Then 

B+(n+1)y+(k-1)x=(Bt+nyt+kxe) N(Bt+ny + (k-1)x) CB, 
since B + ny + kx CB. In particular, (n + 1)y + (k — 1)x © B. This contradicts 
the minimality of kK unless k = 1. Thus there is a least nonnegative integer n for 
which ny + x € B. However, since BN (B+x)=B+y and ny © B, this would 
imply ny+xEeBt+y or(n — 1)y +x €B. This contradicts the minimality of n 
unless n = 0 and hence x € B. Therefore B = Q 1 [0,~), as desired. 


Editorial comment. Duane Broline and the proposer point out that the result is 
false for the additive reals Gust subject the half-line wedge 0 <x to a totally 
discontinuous linear transformation—this yields an everywhere dense image 
wedge). On the other hand, F. J. Flanigan proved the result for an arbitrary 
subgroup of the additive rationals. 


Solved also by Duane Broline, F. J. Flanigan, and the proposer. 
Averages of the GCD and of the LCM 


6615 [1989, 847]. Proposed by Kee-Wai Lau, Hong Kong. 


Denote by (i, j) and [i, j] respectively the greatest common divisor and the least 
common multiple of the positive integers 7 and j. Prove that as n tends to infinity 


n7log n 


iL LGD) ~ sa (i) 
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Using (2), (3) and (4) we obtain 


n _ n? 1 n ; 
2X a in" nde og + O(n ) 
AD ——logn + O(n’), 


which proves (A). 
The proof of (B) uses the arithmetical function 


h(m) = di du(d), 
d|m 


where p is the Mobius function. Note that h(m) = ane — p), so that 


|a(m)| = [1 (p - 1) < Ile < 


p\|m 

Mobius inversion gives us 
1 h(d) 
m d|m d 


Take m = (i, j) in this formula, note that [i, 7] = #//(, j), and sum on i to obtain 


n n h(d n/a h(d 
Eid- Ler “8 -jiy ¥ aay 
i=1 i=1 di, j) d|j q=1 
j n\? jn’? _ h(d) jn _, h(d) 
= 5 EMa){ (7) +0(5)| = De of Fo“) 
jn? _ h(d) 
are pe + O(nir(i)), 


where 7(j) is the number of positive integers dividing 7. Summing on j and using 
Dirichlet’s estimate for r(j), we find 


non n2 n ne ’) 
LY Liki, => Yi i 3 + O(n? log n) 
j=li=l j=1 dij 
n* h(d) 
2 qd<n d’ 
n2 ” h(d 
-—y hd) q + O(n? log n) 
2 d=1 d q<n/d 
n* "2 h(m n \lh(m)| 
=—) a + 0[m S ( + O(n toe) 
4 m=1 m m= 
n* ” h(m) n 
_ a 3 +O me ~ + O(n? log n) 
n* 2 him 
= 7 » O(n? > log n). 
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The principal term is 


nn? h(m) n* 2 1 n* 

— =— Y —¥ du(d) = — du(d 
4 XL m° 4 m° X a 4 qd<n ( d)° a 
n* ” wld d? 

EAD olf 
n* n wd) ; 
= 7é@) Lu da + O(n") 

m3), 


from which we obtain (B). 


Editorial comment. A number of solvers found various error terms and general- 
izations. In particular, a collection of results on these sums and related ones may 
be found in Persi Diaconis and Paul Erdos, “On the distribution of the greatest 
common divisor,” Technical Report No. 12, Department of Statistics, Stanford 
University, Stanford, 1977. They trace results of this type back to E. Cesaro, Etude 
moyenne du plus grand commun diviseur de deux nombres,’ Ann. Mat. Pura 
Appl., (2) 13 (1885) 235-250. 

Solved also by Persi Diaconis and Paul Erdés Gointly), Mark C. W. Jones (U.K.), Haseo Ki (Korea), 


Enoch Kin-S: Lee, O. P. Lossers (The Netherlands), Richard Stong, Alain Tissier (France), T. P. T. 
Williams, and the proposer. 


A Formula for the mth Largest Prime Factor of an Integer 


6618 [1989, 942]. Proposed by Armel Mercier, Université du Québec a Chicoutimi, 
Canada. 


Let w(n) denote the number of distinct prime factors of n, p(n) the smallest 
prime factor of n, and P_(n) the mth largest prime factor of n. 
(i) Prove that for any nonnegative integer m we have 


Praia) = (ey D(0C4) ~ "u(y (a), 
din 


where « denotes the Mobius function. 
(ii) Prove that for any nonnegative integer m we have 


Y {o(d)}"u(d)p(d) = Y (-1)*'S(m + 1,7 + IP .(n), 
din jJ=0 
where S(m, j) denotes the Stirling number of the second kind. 


Solution by John Henry Steelman, Indiana University of Pennsylvania, Indiana, 
PA. It should probably be noted that these formulas assume that P,(n) = 0 if 
m > w(n) and that p(1) = 0. 
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(i) We establish the desired formula by induction on w(n). If w(n) = 0 (ie, 
n = 1), then trivially 


C—O (CCD ~ ud) p(d) = 0 = Pym) 


d\n 


for all nonnegative integers m. 

So assume that the formula has been established for all positive integers having 
fewer than w(n) distinct prime factors. We can factor n as n = aq’ where q isa 
prime larger than any prime factor of a. For the case m = 0, note that 


(1 (°C) ~ "\ucaypca) 


d\n 
=(-1))0 > [ota do | u(da’) aa’) 
dla J=0 
= (-1) Ea(a')a + » {u(d)p(d) — u(d) p(d)} 
dla 
d>1 


=q=P,(n). 


If m > 0 we have 


("D(C ~ 1d) oa) 


d\n 


-(-)"" DY [o( da") - | u(da’) p( da’) 


dla j=0 


=(-r" y [(0) 1) — (04) \ ayo) 


dla 
d>1 


= (-1" (0) 7 \ucaynca) 


dla 
— P,,(@) = Py+(n). 


(ii) We require the following basic property of the Stirling numbers of the 
second kind (which can be used as their defining property): If m is a non-negative 
integer and x is an indeterminate, then 


ae 


sim +1, DIL =i, (*) 


where the empty product (when j = 0) is understood to be 1. 
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The result of (ii) can be deduced from (*) and the result of (i) as follows: 


Y (-1)'**7!S(m + 1,7 + 1) P(r) 
J=0 


E(-asis(m + tp MVE (PO "acayoca) 


d\n 
=) y j1S(m +1,j +1) “a '\a(d)eCa) 
d|n j=0 
= Y{w(d)}"n(d) p(d). 
d\n 


Solved also by David Callan, Haseo Ki (Korea), and the proposer. 


The Ratio of the Roots of an Irreducible Quadratic 


6621 [1990, 76]. Proposed by William C. Waterhouse, Pennsylvania State Univer- 
sity, University Park. 


Which numbers can occur as a ratio of roots of an irreducible quadratic 
equation? More precisely, suppose K is an algebraically closed field. For each 
subfield E of K and each separable quadratic extension F of E with conjuga- 
tion , certain elements of K occur as ratios ¢/c with c in F. Which elements of 
K (besides 1) do arise in this way when we allow all possible choices of E and F? 


Solution by Dennis R. Estes, University of Southern California, Los Angeles. The 
problem is to determine the elements 6 (other than O and 1) in an algebraically 
closed field K having property 


A. There exist subfields E, F of K with F separable of degree 2 over F, and 
c © F such that 0 = c*/c where * is the generator of the Galois group of F 
over E. 


Let II denote the subfield of K generated by 1. We shall show that A is equivalent 
to 


B. Either 9 is transcendental over II, or 6 and @~! are roots of the same 
irreducible polynomial over II. 


It is easy to see that B follows from A. Assume @ is algebraic over II with minimal 
polynomial p(x) € I[x]; since * fixes p(x) we have p(@) = p(6*) = p(@—') = 0. 

To show that A follows from B, first consider the case 6 # —1. Observe that B 
implies the existence of an automorphism * of F = II(@) sending 6 to 6~'. Let E 
denote the fixed field of *. Since 6 # 9~! = 6*, the generator * has order 2 and by 
elementary Galois theory F is a separable degree 2 extension of E with Galois 
group generated by *. (Alternatively, 9 + @~' € E and since @ is a root of 


g(x) =x?-(6+07')x+1=0, 


we have [F: E] <2. Since 06 € E, we have [E:F]=2 and since g(x) has 
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The result of (ii) can be deduced from (*) and the result of (i) as follows: 


y (-1)'7!8Cm + 1,7 + 1) P(r) 
jJ=0 


© (-1)Y1s(m + 1,74 )(-1) YE jo - '\a(d) pCa) 
j=0 


d\n 
=) Y 18(m + 1,7 + 1) “a '\a(d)eca) 
d\n j=0 
= Li {o(d)}"u(4)p(4). 
d\n 


Solved also by David Callan, Haseo Ki (Korea), and the proposer. 


The Ratio of the Roots of an Irreducible Quadratic 


6621 [1990, 76]. Proposed by William C. Waterhouse, Pennsylvania State Univer- 
sity, University Park. 


Which numbers can occur as a ratio of roots of an irreducible quadratic 
equation? More precisely, suppose K is an algebraically closed field. For each 
subfield E of K and each separable quadratic extension F of E with conjuga- 
tion , certain elements of K occur as ratios €/c with c in F. Which elements of 
K (besides 1) do arise in this way when we allow all possible choices of E and F? 


Solution by Dennis R. Estes, University of Southern California, Los Angeles. The 
problem is to determine the elements 6 (other than O and 1) in an algebraically 
closed field K having property 


A. There exist subfields E, F of K with F separable of degree 2 over EF, and 
c € F such that 0 = c*/c where * is the generator of the Galois group of F 
over E. 


Let II denote the subfield of K generated by 1. We shall show that A is equivalent 
to 


B. Either 6 is transcendental over II, or @ and 67! are roots of the same 
irreducible polynomial over II. 


It is easy to see that B follows from A. Assume @ is algebraic over II with minimal 
polynomial p(x) € II[x]; since * fixes p(x) we have p(6) = p(6*) = p(@—') = 0. 

To show that A follows from B, first consider the case 6 # —1. Observe that B 
implies the existence of an automorphism * of F = II(@) sending 6 to 6° '. Let E 
denote the fixed field of *. Since 9 + 6~! = 6*, the generator * has order 2 and by 
elementary Galois theory F is a separable degree 2 extension of E with Galois 
group generated by *. (Alternatively, 9 + 0~! € E and since @ is a root of 


g(x) =x*-(64+0 ')x+1=0, 


we have [F:E] <2. Since 0¢£E, we have [E:F]=2 and since g(x) has 


568 PROBLEMS AND SOLUTIONS [June-July 


precisely 6 and 67! as its distinct roots, F is separable over E.) Now observe that 


(1+ 67')* 
—  14+67! 
(One could also handle this case with Hilbert’s Theorem 90). 
We now consider the case 0 = —1. Since 6 # 1 the field K does not have 


characteristic 2. Hence either II is a field with p elements where p is an odd 
prime, or II is isomorphic to the field of rationals. In either case II + II”. Take 
a — II with a € II”. Then 


(va )* 
Ta 


where * is the generator of the Galois group of the separable extension II(Va ) /II. 


Solved also by R. J. Chapman (U.K.) and the proposer. 
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As we wrestle with the revision of the mathematics major in response to the 
impetus of technological change and declining numbers of students, we face a 
variety of new curricular problems. But there are also some older problems which 
have never been successfully resolved. First, there are simply too many topics that 
need to be included in the training of any student who is serious about mathemat- 
ics. This problem has become particularly acute with the recent much-needed 
addition to the curriculum of courses in discrete mathematics and modeling. Our 
programs are structured in such a way that a student either takes a full course on a 
subject or else learns nothing about it at all. As a result, students often remain 
totally ignorant of such venerable but still important subjects as number theory, 
calculus of variations, and geometry—projective, differential, or hyperbolic. 

A second problem is that most students majoring in mathematics, even very 
good ones who take many courses in the discipline, complete their undergraduate 
programs with little sense of the sweep and unity of the subject. How can they 
develop such an overview when we offer them a hodgepodge of almost unrelated 
courses? It is rare to see any calculus or geometry used in an algebra course or to 
encounter a group or a matrix in undergraduate analysis. Even for those few 
students who continue on to graduate school and into the profession, this lack of 
perspective is rarely remedied. In too many graduate programs once the utilitarian 
basic training is completed, there is little encouragement to take or to audit 
courses away from a narrow research area. 

It has been proposed that an undergraduate course in the history of mathemat- 
ics, perhaps as a capstone course in the major, might partially solve these 
problems. But few of us have more than a rudimentary knowledge of the history of 
our subject or much of a desire to expend energy in learning it. One can also argue 
that a serious student is better off learning some new subject in depth than taking 
a course in mathematical history, which necessarily must be somewhat superficial. 
Thus such a course would be best for our less dedicated or weaker students who 
probably don’t have the necessary background or maturity to appreciate it anyway! 
For these reasons I have never felt very enthusiastic about this or any other 
capstone idea. But Stillwell’s book is making me think again. It is the first book 
concerned with the history of mathematics that I have found really captivating. 
This is partly because it uses a historical perspective to discuss a broad range of 
mathematical topics and in the process fills many of the gaps mentioned above. 

Let me illustrate the advantages of the historical point of view with two 
geometric examples, one with an algebraic flavor and the other from differential 
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geometry. We all learn about lines and conics in calculus courses, but for most of 
us the study of algebraic curves stops with degree two. What about cubics, the next 
most complicated case? Newton was the first to attempt a classification of cubics 
similar to that for conics as ellipses, parabolas, and hyperbolas. He wrote several 
papers on the topic over a period of twenty-five years, the last in 1695. He wished 
to classify according to their asymptotic behavior all curves defined by the general 
nondegenerate cubic equation in two real variables 


ay? + bry? +cx*y + dete’? +fytexrth+k+1=0. 


Here nondegenerate means that the equation does not factor over R, so that its 
graph contains no components which are lines or conics. The rotations and 
translations that simplify conics do not suffice for the more complicated set of 
cubics. But by employing a general affine change of coordinates Newton was able 
to reduce a nondegenerate cubic to one of four types: 


y = Ax? + Bx*+Cxe+D (1) 
xy = Ax? + Bx? + Cx +D (2) 
y? =Ax? + Bx? +Cx+D (3) 
Mxy? + Ny = Ax? + Bx? +Cx+D, M#0. (4) 


To do this, he performed a tour de force of classical algebra which involved 
passing through an equation with 84 terms. Today it is much less difficult to carry 
out this simplification. For a hint on how to do it efficiently (as well as for a lucid 
modern treatment of the theory of curves), see [2, p. 91]. Notice that by allowing a 
general affine change of coordinates, Newton enlarged the group of permissible 
transformations beyond the Euclidean group, which suffices to classify the conics. 

Newton then proceeded to divide these curves into 72 species according to their 
asymptotic behaviour. Actually, there should be 78 species in all according to his 
classification scheme—-Newton missed 6 species which were found by later au- 
thors. Types (1) and (2), which yield one species each, are curves that we all graph 
in elementary calculus. There are five species of type (3), depending on whether 
the right-hand side of the equation has three distinct real roots, two complex and 
one real one, a single real root of multiplicity three, or a real root of multiplicity 
two either greater or less than the other root of multiplicity one. It is a good 
exercise for calculus students to obtain this five-fold breakdown and to draw a 
graph of each species. Type (4), which yields the remaining 71 species, is not easily 
dealt with by hand (though of course Newton managed it!), but representative 
curves can be obtained with Derive or Mathematica, since one can complete the 
Square in y and thus express y in terms of x. For those looking for an interesting 
exploratory exercise using such a package, here it is. Newton presented a graph of 
each species (see [4], [5], or [2, pp. 93-98], where a portion of Newton’s article is 
reproduced). 

Newton’s classification scheme was criticized by Euler and others because his 
geometric criteria appeared to lack a unifying theme and resulted in so many 
different species. Later authors were undeterred however: Pliicker in 1835 gave a 
more refined classification and obtained 219 different species. For a discussion of 
Newton’s and Pliicker’s criteria see [3]. But the real answer to Euler’s criticism was 
already hinted at in a brief section of Newten’s paper entitled “On the Genesis of 
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Curves by Shadows’. Newton asserted that all nondegenerate cubic curves have as 
a projective image one of the five species of type (3). Thus by adopting the point of 
view of projective geometry or, equivalently, by enlarging the group of permissible 
transformations from the affine group to the projective group GL(3, R)/{k - Id}, 
the classification of cubics simplifies greatly. 

Considering the classification problem within the projective plane makes good 
intuitive sense, since that plane is formed by adding to the affine plane a point for 
each asymptotic direction, and it was the variety of asymptotic behaviour which 
gave rise to the large number of species of cubics. It was not until the eighteenth 
century that projective geometry was sufficiently well developed to provide a 
rigorous proof of Newton’s claim (see [1], pp. 122—124). The final step in the 
classification of cubics, a contribution of nineteenth century mathematics, was to 
introduce the complex projective plane with its associated larger group of complex 
projective transformations GL(3, C)/{k - Id}. Since the field of complex numbers is 
algebraically closed, the factorization of polynomials simplifies and all cubics fall 
into just three classes according to the number of distinct complex roots of the 
right-hand side of (3). In the case of a single root the cubic is projectively 
equivalent to Neil’s parabola y* = x°, which has a cusp at the origin, and in the 
case of exactly two roots, one having multiplicity two, to the curve y? = x*(x + 1), 
which has a node. Thus in both of these cases the curve has a singularity. In the 
remaining case of three distinct roots the curve is everywhere nonsingular and is 
equivalent to an equation of the form y? = 4x° — g,x — g, with g} — 27g? + 0, 
which is known as the Weierstrass normal form. Two such curves are projectively 
equivalent if and only if their J-invariants J = g3/(g3; — 27g?) are the same. For a 
full discussion of this classification see [2, pp. 283-305]. 

There are essential topological and algebraic differences between the singular 
and nonsingular cases. Singular cubic curves when viewed as surfaces in two- 
dimensional complex projective space have genus zero and possess a parametriza- 
tion by rational functions, while nonsingular curves have genus one and admit no 
rational parametrization. So how can a nonsingular curve be parametrized? By 
elliptic functions, and this leads us to yet another topic of nineteenth century 
mathematics. The genus one curves are called elliptic curves and are currently the 
object of intensive study because of their connections with such arithmetic phe- 
nomena as Fermat’s Last Theorem. By tracing the history of cubic curves we have 
seen the application of tools from calculus, linear algebra, group theory, topology, 
projective geometry, and complex variables, all to one very concrete problem. 
What better way is there to see that all of mathematics fits together? 

Our second example concerns the origins of the Gauss-Bonnet theorem, a 
fundamental result in differential geometry which links the curvature of a surface 
to its topological type. Although this theorem can be appreciated with only a 
smattering of differential geometry and topology, it is usually encountered only at 
the end of a full course in these subjects. One version of this theorem asserts that 
if § is a two-dimensional surface in R*? which is bounded by a finite number of 
smooth curves C;, then 


a; + Ll Ke ds + J [Kaa = 27y(S), (5) 


In this formula K denotes the Gaussian curvature function on the surface which at 
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each point measures the bending of S, and x, denotes the geodesic curvature 
function on the boundary curves C, which is the projection of the usual curvature 
Kk of a three-dimensional curve onto the tangent plane of the surface. Thus, in a 
planar region x, agrees with curvature as defined in calculus. The a, (if present) 
are the (signed) exterior angles at which the curves C; meet, and y(S) denotes the 
Euler characteristic of the surface S, which can be calculated from the familiar 
formula ,»(S) = V — E + F. For precise definitions of these concepts see any book 
on differential geometry (or the book being reviewed here). The important fact is 
that the curvature functions capture the geometry of the surface S and the curves 
C;, while the Euler characteristic determines its topology. In the form stated here 
the theorem is a late nineteenth century elaboration of work of Gauss in 1825 and 
Bonnet in 1848, but it had some very early antecedents which help considerably in 
appreciating the theorem and in making it plausible. 

Perhaps the simplest and earliest recognizable precursor is the familiar theorem 
in Euclidean geometry that the sum of the exterior angles of a convex polygon is 
equal to 27r. Since a polygon is bounded by straight lines and lies in a plane, both 
curvature terms are zero. Its Euler characteristic can be directly calculated to be 
one, and we see that this ancient result is a special case of the Gauss-Bonnet 
theorem. Notice that the angular terms in (5) are a reasonable measure of turning 
at those points where the geodesic curvature is not defined. Thus the Euclidean 
theorem can be interpreted as asserting that the total turning in traversing the 
boundary of a convex polygon is 277. 

Another simple special case is the observation that for a simple closed curve in 
the plane {x ds = 277. If curvature x is calculated by the formula d6/ds, where at 
each point on the curve @ is the polar angle and s denotes arc length, this result 
follows directly from the Fundamental Theorem of Calculus. 

A third antecedent is due to Thomas Harriot, a colorful figure in English 
mathematics who, undeservedly, is little known today. He was a protegé of Sir 
Walter Raleigh and wrote a book in 1588 about Raleigh’s first attempt to establish 
a settlement in Virginia. Harriot discovered several particular facts which have had 
a significant impact on the development of mathematics and which should have 
earned him a permanent place in mathematical history. One of these, which is 
sketched in Stillwell’s book, is an elegant geometric relation between the area, 
radius, and angles of a spherical triangle. A spherical triangle is a triangle on the 
surface of a sphere whose sides are formed by arcs of great circles. Harriot showed 
that the area A(T) of a spherical triangle 7, the radius R of the sphere on which it 
lies, and the interior angles B, of T are related by the equation 


B,+P,+B3;-7 = A(T) /R’. 


The left-hand side of this equation expresses the “angular excess” of the spherical 
triangle over the angle sum 7 for a plane triangle. A sphere has constant Gaussian 
curvature equal to 1/R’, and the second term in (5) vanishes, since the projection 
of a great circle onto the tangent plane is a line with curvature zero. Using the 
facts that at each vertex the interior and the exterior angles sum to 7 and that 
the Euler characteristic of T is one, we find that Harriot’s result is precisely the 
Gauss-Bonnet theorem for spherical triangles. 

Around 1630, Descartes generalized the Eucledean angle sum theorem from 
polygons to polyhedra. His result was that for any convex polyhedron the total 
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exterior solid angle is 477, where at each vertex the solid exterior angle is by 
definition the sector of the unit ball bounded by planes normal to the edges. This 
result was lost until 1860, so it is only in hindsight that we can see it as a 
forerunner of the Gauss-Bonnet theorem. The surface of the polyhedron is closed 
(without boundary), so the first two terms in (5) do not appear. To make the 
connection with the Gauss-Bonnet formula, it is necessary to view the polyhedron 
as the limit of smooth surfaces. If curvature is correctly named, it is then 
reasonable that the third term in (5) approaches the sum of the solid angles, and 
Gauss proved this to be true. Since the Euler characteristic of a polyhedron can be 
computed to be two, we again have a special case of the Gauss-Bonnet theorem. 

Let us now consider Stillwell’s book itself. It offers a guided historical tour of 
mathematics from the time of Pythagoras to that of Godel with separate chapters 
on subjects such as elliptic functions, differential geometry, and projective geome- 
try, which are totally absent from most modern curricula. Also present are 
discussions of standard historical topics such as Greek geometry and the origins of 
calculus, topology, and group theory. The chapters are arranged in rough chrono- 
logical order, and each concludes with a brief biography of one or two of the great 
mathematicians who have been most active in developing the topic discussed. 
Interspersing the mathematics with biographical sketches lightens the flow of ideas 
and reminds the reader that mathematics is created by interesting human beings. 
The exposition is masterfully linked together so that certain major themes such as 
curves and the solution of equations reappear in several chapters as the topic is 
attacked with different tools. Fundamental results such as Markov’s theorem 
asserting the unsolvability of the homeomorphism problem for four manifolds, 
Bezout’s theorem counting the intersections of algebraic curves, and Hermite’s 
solution of the quintic using elliptic functions are discussed with some background. 
There are also statements of unsolved problems such as the Riemann hypothesis 
and the Poincaré conjecture that are part of basic mathematical culture and which 
ought to be mentioned in the undergraduate training. The discussion is at a deep 
enough level that I suspect most trained mathematicians will find much that they 
do not know as well as good intuitive explanations of familiar facts. The careful 
exposition, lightness of touch, and the absence of technicalities should make the 
book accessible to most senior undergraduate mathematics majors. 

The selection of topics in a book of this sort is, of course, very much a matter of 
personal taste. Stillwell has chosen to say little about probability, statistics, or 
discrete mathematics, and applied mathematics is represented only by mechanics 
(in a very nice chapter which touches on Fourier series and the calculus of 
variations). I would have liked also to see a discussion of the origins of the Central 
Limit Theorem and of graph theory. The twentieth century is represented only by 
set theory, logic, and computation. A brief treatment of Lie groups and how they 
are used in modern physics would have presented a more balanced picture of 
contemporary mathematics. 

The book contains interesting exercises often on topics which would enliven 
standard courses, for example, continued fractions, tesselations, and the pretty 
series expansion 
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(if you are unable to verify this formula, see page 187.) The bibliography is 
excellent. It is possible, for instance, to begin a more thorough study of cubics (as I 
have done) by following up the references cited. 

There are of course slips and imprecisions in a book which strives to be a 
readable and nontechnical introduction to a very broad subject. In particular, there 
is an incorrect statement of the Riemann hypothesis ignoring the existence of the 
trivial zeros at the negative even integers (p. 129) and an assertion of the 
convergence of integrals in the Lebesgue theory with no hypothesis whatsoever 
(p. 318). Overall, however, the accuracy is good and the fluency of the writing and 
cogency of the arguments are extremely high. I recommend this book both to the 
professional for personal enjoyment and for use as a text to impart a large amount 
of mathematical culture to advanced students. If you are seeking a text for a 
capstone course, consider this book. 
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General, P*. Moving Beyond Myths: Revitaliz- 
ing Undergraduate Mathematics. National Research 
Council. National Academy Pr, 1991, ix + 65 pp, 
(P). [ISBN: 0-309-04489-8] The long-awaited “MS 
2000” report containing summary analysis and rec- 
ommendations from the National Research Council’s 
Committee on Mathematical Sciences in the Year 
2000. Challenges U.S. colleges and universities to 
achieve the same stature for mathematics education 
as has been achieved during the past four decades in 
mathematical research. LAS 


Elementary, T(13: 1). Geometry for College Stu- 
dents, Second Edition. Peter B. Geltner, Darrell 
J. Peterson. PWS-Kent, 1991, xiii + 336 pp, $29. 
[ISBN: 0-534-92472-7] Precise coverage of elemen- 
tary concepts of traditional Euclidean geometry for 
college students with no geometry background. This 
edition has two-color format, rewritten definitions, 
more examples and exercises, and a new section on 
transformations, but retains two-column proofs. JNC 


Education, P, L. Developments in School Mathe- 
matics Education Around the World, Volume 2. Eds: 
Izaak Wirszup, Robert Streit. NCTM, 1990, xiv + 
475 pp, $20 (P). [ISBN: 0-87353-302-X] 25 papers 
presented in April 1988 at the Second International 
Conference on Mathematics Education sponsored by 
UCSMP, the University of Chicago’s School Math- 
ematics Project, supplemented by other invited re- 
ports. In three sections: international reports on the 
process of curriculum reform, research in mathemat- 
ics education, and UCSMP reports. LAS 


Education, P*. A Call For Change: Recommenda- 
tions for the Mathematical Preparation of Teachers 
of Mathematics. Ed: James R.C. Leitzel. MAA Re- 
port. MAA, 1991, xiv + 47 pp, (P). [ISBN: 0-88385- 
072-9] Standards for undergraduate mathematical 
experiences for teachers in K-4, 5-8, and 9-12, build- 
ing on six standards appropriate for all mathematics 
teachers. Not organized by courses but by “experi- 
ences” requiring at least 9 semester hours for K-4, 
15 semester hours for 5-8, and a full mathematics 
major for 9-12. Written in parallel with the just- 
released NCTM Professional Standards for Teaching 


Mathematics (TR, May 1991). Emphasizes need to 
improve how college mathematics is taught in order 
that prospective teachers will learn by example how 
they can meet the NCTM Standards for school teach- 
ing. LAS 


Education, P*. Counting On You: Actions Sup- 
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Research Council, 1991, 17 pp, (P). A brief policy 
synopsis from MSEB that highlights changes recom- 
mended in the two NCTM Standards documents, in 
the new MAA recommendation on teacher education 
(A Call for Change), and in previous MSEB reports. 
Intended to help mobilize public and corporate sup- 
port for improvements in mathematics teaching. LAS 
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[ISBN: 3-525-40312-7] Revised texts of ten papers 
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Zariski. Carol Parikh. Academic Pr, 1991, xvii 
+ 264 pp, $32.95. [ISBN: 0-12-545030-3] A com- 
pelling account of Zariski’s extraordinary life (com- 
plementing his “real life” which is recorded in his 
scientific papers) based on interviews with family, 
colleagues, and students, and on tape-recorded in- 
terviews Zariski completed a few years before his 
death in 1986. Introduced by a foreword for non- 
mathematicians by Mumford; illustrated throughout 
with family photographs; concludes with a mathe- 
matical survey of Zariski’s mathematical work (by 
Artin, et al.) and a complete Zariski bibliography. 
Award-winning author Parikh brings to life the in- 
tellectual ferment and international dimensions of 
Zariski’s peripatetic career. LAS 


Logic, T*(15-17: 1), L*. Introduction to Modern 
Set Theory. Judith Roitman. Wiley, 1990, xiii + 
156 pp, $39.95. [ISBN: 0-471-63519-7] Well-written 
text provides introduction to modern and classical 
set theory. Includes models of set theory, consis- 
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tency results, large cardinals. No knowledge of logic 
assumed. KES 

Logic, T(15-17), P, L. Classical Recursion The- 
ory: The Theory of Functions and Sets of Natural 
Numbers. Piergiorgio Odifreddi. Stud. in Logic & 
Found. of Math., V. 125. North-Holland (US Distr: 
Elsevier Science), 1989, xvii + 668 pp, $110.50. 
[ISBN: 0-444-87295-7] Text and reference manual 
covers recursive and partial recursive functions, re- 
cursively enumerable sets, hierarchies, Turing de- 
grees, and m-degrees. Includes applications to logic, 
computer science, and other areas of mathematics. 
Note price. KES 


Foundations, S(16-18), P, L. Creative Un- 
derstanding: Philosophical Reflections on Physics. 
Roberto Torretti. Univ of Chicago Pr, 1990, xvi 
+ 369 pp, $32.50 (P); $79.95. [ISBN: 0-228-80835- 
1; 0-226-80834-3] Einstein noted that concepts and 
fundamental laws of physics are not derived from ex- 
perience, for they are “free inventions of the human 
mind.” The aim of this book is to elucidate the work- 
ings of that creative understanding. Divided into 
five chapters labelled Observations, Concepts, Theo- 
ries, Probability, and Necessity. The writing is com- 
pelling; the work makes a significant contribution to 
the philosophy of science. LCL 


Foundations, P. Higher Recursion Theory. Gerald 
E. Sacks. Perspectives in Math. Logic. Springer- 
Verlag, 1990, xv + 344 pp, $89. [ISBN: 0-387-19305- 
7] Extensions of classical recursion theory (CRT), 
including hyperarithmetic sets (lifting CRT to least 
admissible set after w), metarecursion (CRT on the 
recursive ordinals), a-recursion (CRT on the admis- 
sible ordinals), and H-recursion (extension to sets of 
arbitrary rank). RM 

Foundations, T*(13-14: 1), S*, L*. How To 
Read and Do Proofs: An Introduction to Mathe- 
matical Thought Processes, Second Edition. Daniel 
Solow. Wiley, 1990, xx + 242 pp, $19.95 (P). (ISBN: 
0-471-51004-1] Revised edition includes new chap- 
ter on nested quantifiers and more exercises. Excel- 
lent text for introduction to proof courses or supple- 
ment for other courses involving proof writing. (First 
Edition, TR, December 1982.) KES 


Combinatorics, P. Combinatorial Games. Eds: 
Richard K. Guy, et al. Proc. of Symposia in 
Appl. Math., V. 43. AMS, 1991, xii + 233 pp, $52. 
{ISBN: 0-8218-0166-X] Ten papers from an AMS 
short course, including analysis of Gd and connec- 
tions with coding theory, graph theory, number the- 
ory, and complexity theory. Contains unsolved prob- 
lems and a selected bibliography. JPH 


Combinatorics, T(16-17: 2), S. Combinato- 
rial Designs: Construction Methods. Ian Anderson. 
Math. & Its Applic. Ellis Horwood (US Distr: Pren- 
tice Hall), 1990, 290 pp, $79.95. [ISBN: 0-7458-0349- 
0] ‘Treats block designs, orthogonal Latin squares, 
Steiner systems, and tournament designs. Draws on 
research results through 1990. Suitable for seniors or 
graduate students. Prerequisite: linear algebra. AD 
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crete Algorithmic Mathematics. Stephen B. Mau- 
rer, Anthony Ralston. Addison-Wesley, 1991, xix 
+ 894 pp, $38. [ISBN: 0-201-15585-0] Algorithmic 
approach but programming not a prerequisite. Possi- 
ble use as a year-long alternative to calculus for well- 
prepared freshmen. Covers induction, graphs, count- 
ing methods, difference equations, probability, logic, 
sequences, series, and generating functions. Exten- 
sive chapter on basics and on linear algebra. AD 
Linear Algebra, T(17-18: 1, 2), P, L. Topics 
in Matriz Analysis. Roger A. Horn, Charles R. 
Johnson. Cambridge Univ Pr, 1991, viii + 607 pp, 
$59.50. (ISBN: 0-521-30587-X] A sequel to Matriz 
Analysis by the same authors (TR, January 1987). 
This volume contains a more specialized set of top- 
ics: the field of values, matrix stability and iner- 
tia, singular value inequalities, matrix equations and 
Kronecker products, Hadamard products, and ma- 
trix functions. Designed to be useful as a reference 
and as a graduate-level textbook. AO 


Linear Algebra, T(16-17: 2), L. An Introduc- 
tion to Linear Algebra. L. Mirsky. Dover, 1990, viii 
+ 440 pp, $10.95 (P). [ISBN: 0-486-66434-1] Ab- 
stract, rigorous approach to linear algebra. In addi- 
tion to studying vector spaces, linear operators, and 
determinants, text covers canonical forms, orthogo- 
nal and unitary matrices, and quadratic forms. LC 


Linear Algebra, S(14), C. MAX: The MAtrix 
Algebra Calculator. (IBM PC Software.) Charles 
H. Jepsen, Eugene A. Herman. Brooks/Cole, 1988; 
Linear Algebra Problems for Computer Solution, v 
+ 144 pp, (P). [ISBN: 0-534-09300-0] A straight- 
forward package for performing matrix calculations 
encountered in the typical first course in linear alge- 
bra. The accompanying text contains sample prob- 
lems paralleling the standard course syllabus, mostly 
based on 160 selected matrices that come as part of 
the package. (New matrices can be entered from the 
keyboard or from a file.) LAS 


Group Theory, S(18), P. Lecture Notes in 
Mathematics-1456: Groups—Canberra 1989. Ed: 
L.G. Kovacs. Springer-Verlag, 1990, xii + 198 pp, 
$25 (P). [ISBN: 0-387-53475-X] Collection of papers 
presented during the Australian National Univer- 
sity Group Theory Program in 1989, mostly during 
the Third International Conference on Group The- 
ory and Related Topics in Canberra. Contains eight 
survey articles and eight research papers. List of par- 
ticipants. JS 


Group Theory, $(18), P. Discrete Subgroups of 
Semisimple Lie Groups. G.A. Margulis. Ser. of Mod. 
Surveys in Math., Band 17. Springer-Verlag, 1991, ix 
+ 388 pp, $85. [ISBN: 0-387-12179-X] Deals mainly 
with structure and classification problems for dis- 
crete subgroups of Lie groups, concentrating on lat- 
tices, which are defined as discrete subgroups of fi- 
nite covolume. Topics include density and ergodicity, 
factor groups, boundary theory, rigidity, and arith- 
meticity. Appendices, historical notes, references, in- 
dex. JS 


Algebra, T*(16-18), S*, P*, L*. Algebra, Vol- 
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umes I-II. B.L. van der Waerden. Transl: John R. 
Schulenberger. Springer-Verlag, 1991, $29.95 each. 
Volume I, xiv + 265 pp, [ISBN: 0-387-97424-5]; Vol- 
ume II, xii + 284 pp. [ISBN: 0-387-97425-3] This 
classic work, a reprint of the Seventh Edition, still 
sets the standard for the discipline, and offers so 
much of the best mathematics that it will never be- 
come unfashionable. Volume I contains the funda- 
mentals of sets, groups, rings, ideals, and fields, fol- 
lowed by Galois theory and the theory of commuta- 
tive fields. Volume II continues with valuation the- 
ory, group representations, theory of algebraic func- 
tions, and topological algebra. LCL 


Algebra, T(17-18: 2), P, L. The Symmet- 
ric Group: Representations, Combinatorial Algo- 
rithms, and Symmetric Functions. Bruce E. Sagan. 
Wadsworth,. 1991, xv + 197 pp, $49.95. [ISBN: 0- 
534-15540-5] Aims to bring together many of the 
important results and techniques in the study of rep- 
resentations of the symmetric group. Except for 
some knowledge of linear algebra and elementary 
group theory, the book is relatively self-contained. 
It begins with an introduction to group represen- 
tations. The second chapter studies representa- 
tion of S,, approaching it from the viewpoint of 
Specht modules. A combinatorial approach is used 
in Chapter 3. Included are the Frame-Robinson- 
Thrall hook formula, the Frobenius- Young determi- 
nantal formula, the Robinson-Schensted-Knuth al- 
gorithm, and Schtitzenberger’s jeu de taquin. Fi- 
nally, in Chapter 4, the symmetric functions, in- 
cluding the Schur functions, are introduced. The 
Jacobi-Trudi determinants, giving the Schur func- 
tions in terms of the elementary and complete sym- 
metric functions, is proved. The book ends with 
the Littlewood-Richardson rule and the Murnaghan- 
Nakayama rule. LC 


Algebra, P. Cohen-Macaulay Modules over Cohen- 
Macaulay Rings. Yuji Yoshino. London Math. Soc. 
Lect. Note Ser., V. 146. Cambridge Univ Pr, 1990, 
177 pp, $24.95 (P). [ISBN: 0-521-35694-6] Revised 
version of lectures given at Tokyo Metropolitan Uni- 
versity in 1987. Book addresses the following ques- 
tion: How can we classify CM modules over a given 
CM ring, or classify CM rings which have essen- 
tially a finite number of CM modules? The author 
presents a systematic, largely self-contained treat- 
ment of the subject. LCL 


Algebra, P. Lecture Notes in Mathematics-1448: 
Non-Commutative Ring Theory. Eds: S.K. Jain, 
S.R. Lopez-Permouth. Springer-Verlag, 1990, v + 
166 pp, $19 (P). [ISBN: 0-387-53164-5] The pro- 
ceedings (sixteen papers) of the Midwest Conference 
held at Ohio University in 1989. The intention for 
the conference was to emphasize and revitalize the 
interaction between researchers throughout the ring 
theoretic spectrum. LCL 


Algebra, P. The Polynomial Identities and Invari- 
ants of n x n Matrices. Edward Formanek. CBMS 
Reg. Conf. Ser. in Math., No. 78. AMS, 1991, v + 
57 pp, $22 (P). [ISBN: 0-8218-0730-7] Results in 
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theory of polynomial identities related to the ring of 
nm X n matrices over a field or commutative ring. In- 
cluded are Kaplansky’s Theorem, Posner’s Theorem, 
and Artin’s Theorem. LC 


Calculus, T(13: 1-4). Calculus, One and Sev- 
eral Variables. Robert Ellis, Denny Gulick. Har- 
court Brace Jovanovich, 1991, xiv + 1121 pp, $29.95. 
[ISBN: 0-15-505692-1] A ‘black-and-white’ version 
of the Fourth Edition of their calculus text; the idea 
being to offer an inexpensive text ($29.95!). It does, 
however, differ from the previous edition in two ways: 
applications of the integral precedes techniques for 
integration, and a chapter on complex numbers and 
functions is included. (First Edition, TR, August- 
September 1978; Second Edition, TR, October 1986; 
Third Edition, TR, June-July 1988; Fourth Edition, 
TR, November 1990.) KS 


Calculus, S(13), C. Calculus Calculator. David 
Meredith. Prentice Hall, 1991, xii + 205 pp, $15 (P); 
$25 book and disk. [ISBN: 0-13-117441-X] An elec- 
tronic blackboard on an IBM PC disk (inside book 
jacket). Two windows show input and output, the 
latter saved in a stack for subsequent use. Simple 
commands produce graphs in two and three dimen- 
sions. Includes capabilities for complex numbers and 
vector analysis. One chapter in accompanying man- 
ual is devoted to programming and development of 
scripts for instructional use. Manual contains dozens 
of applications, both of the calculator and program- 
ming features. Well-designed user interface is easy 
to learn and use. Requires 512K RAM. LAS 


Complex Analysis, P. Several Complez Vari- 
ables VI: Complex Manifolds. Eds: W. Barth, R. 
Narasimhan. Encylop. of Math. Sci., V. 69. Springer- 
Verlag, 1990, 310 pp, $59. [ISBN: 0-387-52788- 
5] Five longer survey papers, in expository style, 
on aspects of complex function theory and complex 
spaces: proper holomorphic mappings, Hodge the- 
ory, holomorphic transformation groups, surfaces, 
Teichmiller spaces. Each contains an extensive bib- 
liography and a historical review. Published in honor 
of R. Remmert’s sixtieth birthday. PZ 


Complex Analysis, P**. An Introduction to Mul- 
ticompler Spaces and Functions. G. Baley Price. 
Pure & Appl. Math., V. 140. Marcel Dekker, 1991, 
xiii + 402 pp. [ISBN: 0-8247-8345-X] Multicom- 
plex spaces are higher dimensional generalizations of 
the complex numbers, Banach algebras in their own 
right, obtained inductively by appropriately adjoin- 
ing additional roots of -1. Such spaces support a 
theory analogous (in some respects, not in others) 
to classical complex function theory. Here the the- 
ory of the “bicomplex” space (with real dimension 
4) is studied in detail; generalizations to higher di- 
mensions are treated more briefly in a final chapter. 
Exposition is admirable: clear, readable, rich with 
exercises and useful asides to the reader. PZ 


Differential Equations, S, C. Phaseplane, The 
Dynamical Systems Tool, Version 3.0. Bard Er- 
mentrout. Brooks/Cole, 1990, xi + 131 pp, (P). 
[ISBN: 0-534-12894-7] An IBM-PC software pack- 


578 


age for numerical solution and trajectory display of 
ordinary differential and difference equations. Em- 
ploys several methods for numerical integration (Eu- 
ler, Runge-Kutta, Adams, Gear) and for calculation 
of equilibria. Menu-oriented screen display provides 
graphs with numerous options. Manual provides an 
overview of the various commands followed by ten 
tutorial examples based on built-in equations. Ref- 
erence section provides details on file structures. LAS 


Differential Equations, T*(14-15: 1), S, L*. 
Differential Equations: A Dynamical Systems Ap- 
proach, Part I: Ordinary Differential Equations. 
John H. Hubbard, Beverly H. West. Texts in Appl. 
Math., V. 5. Springer-Verlag, 1991, xix + 348 pp, 
$39. [ISBN: 0-387-97286-2] A vivid, visual intro- 
duction to differential equations based less on an- 
alytic technique than on computer-assisted graphs 
to understand functions defined by differential equa- 
tions. Makes effective use of descriptive concepts like 
“funnel,” “fence,” and “antifunnel” to identify im- 
portant behavioral features. Chapters treat qualita- 
tive, analytic, numerical, and iterative methods as 
well as existence and uniqueness theorems. Part I 
comprises first five of seventeen chapters. Although 
intended for use with MacMath, the software package 
developed by the authors for this course, this text can 
be used with any system of similar capabilities. LAS 


Partial Differential Equations, P. Asymptotic 
Distribution of Eigenvalues of Differential Opera- 
tors. Serge Levendorskii. Math. & Its Applic., V. 53. 
Kluwer Academic, 1990, xviii + 279 pp, $112. 
[ISBN:0-7923-0539-6] Using the Weyl-Hormander 
calculus of pseudodifferential operators as a principal 
tool, this book describes ways to compute and ana- 
lyze the asymptotics of eigenvalues of wide classes of 
differential operators. A book for researchers; not a 
text, and not for novices. AWR 


Partial Differential Equations, T(18), S, P, 
L. Variational Methods: Applications to Nonlinear 
Partial Differential Equations and Hamiltonian Sys- 
tems. Michael Struwe. Springer-Verlag, 1990, x1 
+ 244 pp, $39.50. [ISBN: 0-387-52022-8] Carefully 
written state-of-the-art overview. Begins with clas- 
sical direct method and some recent extensions. Fol- 
lowed by chapter on minimax methods, and closes 
with chapter emphasizing limiting cases of the Palais- 
Smale condition. Preface provides nice brief history. 
Extensive bibliography. KS 


Numerical Analysis, S(17). Mathematical Anal- 
ysis and Numerical Methods for Science and Tech- 
nology, Volume 4: Integral Equations and Numeri- 
cal Methods. Robert Dautray, Jacques-Louis Lions. 
Transl: John C. Amson. Springer-Verlag, 1990, x 
+ 465 pp, $89. [ISBN: 0-387-50209-2] The fourth 
in a six-volume series, contributed by well-known ex- 
perts. Treats integral equations, mixed problems and 
Tricomi equations; comprehensive coverage of finite 
element methods for stationary boundary value prob- 
lems. Studies briefly numerical methods for integral 
equations. LS 


Numerical Analysis, P. Lecture Notes in Mathe- 
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matics-1457: Preconditioned Conjugate Gradient 
Methods. Eds: O. Axelsson, L. Yu. Kolotilina. 
Springer Verlag, 1990, 196 pp, $25 (P). [ISBN: 0-387- 
53515-2] Eleven papers given at a 1989 conference 
at the University of Nijmegen, The Netherlands, are 
collected here in a book uncompromisingly directed 
to experts in the field. No effort in the preface or in 
any article to give an overview of the field, a context 
for the articles, or an indication of how they relate 
to each other. AWR 


Numerical Analysis, T(2). Numerical Meth- 
ods for Conservation Laws. Randall J. LeVeque. 
Lect. in Math. Birkhauser Boston, 1990, 214 pp, 
$24.50 (P). [ISBN: 3-8176-2464-3] Lecture notes for 
a mathematics course on conservation laws. Part I 
deals with theory of structure and solutions of the 
corresponding nonlinear systems. Part II deals with 
numerical methods, with emphasis on general tools 
rather than specific methods in detail. LS 


Operator Theory, T(18), S, P. Unbounded Op- 
erator Algebras and Representation Theory. Konrad 
Schmiidgen. Operator Theory: Adv. & Applic., V. 
37. Birkhauser Boston, 1990, 380 pp, $79.50. [ISBN: 
0-8176-2321-3] The graph topology, topologies of 
O*-algebras, the generalized Calkin algebra, com- 
mutants of O*-algebras and *-representations, com- 
pletely strongly positive mappings and the decom- 
position theory of states and *-representations are 
studied in detail. KS 


Functional Analysis, S(18), P. Lecture Notes in 
Mathematics-1449: Minimal Projections in Banach 
Spaces. Whodzimierz Odyniec, Grzegorz Lewicki. 
Springer-Verlag, 1990, viii + 168 pp, $19 (P). (ISBN: 
0-387-53197-1] Mainly devoted to the solution of 
uniqueness problems for minimal projections in Ba- 
nach spaces. Also exhibits relevance of this prob- 
lem to the uniqueness problem in mathematical pro- 
gramming and to the problem of characterization of 
Hilbert spaces. KS 


Analysis, P. Lecture Notes in Mathematics-1453: 
Global Analysis—Studies and Applications IV. Eds: 
Yu. G. Borisovich, Yu. E. Gliklikh. Springer Ver- 
lag, 1990, 320 pp, $35 (P). (ISBN: 0-387-53407-5] 
Translation from Russian of nineteen articles from 
1988-1989 in global analysis and nonlinear equations 
and in topological and geometric methods of analy- 
sis. Also includes a tribute to Kolmogorov. OJ 


Analysis, P. Representation of Lie Groups and Re- 
lated Topics. Eds: A.M. Vershik, D.P. Zhelobenko. 
Adv. Stud. in Contemp. Math., V. 7. Gordon and 
Breach, 1990, xiv + 557 pp, $294. [ISBN: 2-88124- 
678-8] Collection of articles on recent work in the 
Soviet Union on representations of “big” groups— 
ones that act like infinite dimensional Lie groups. 
Includes work on current groups (groups of sections 
of fiberings with a locally compact fibre), infinite lin- 
ear groups (“classical” groups of operators on Hilbert 
spaces), and automorphism groups of manifolds. RM 
Analysis, P. Lecture Notes in Mathematics-1452: 
Number-Theoretic Analysis. Eds: H. Hlawka, R.F. 
Tichy. Springer-Verlag, 1990, v + 220 pp, $24 
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(P). [ISBN: 0-387-53408-3] Continuation of seminar 
proceedings appearing in volumes 1114 and 1262. KS 


Algebraic Geometry, P. The Curves Seminar at 
Queen’s, Volume VII. Anthony V. Geramita. Pa- 
pers in Pure & Appl. Math., No. 85. Queen’s U, 
1990, 307 pp, (P). Contains two expository arti- 
cles by Geramita (The non-singular cubic surface in 
P3 — II) and by Robbiano (Hilbert functions), and 
nine research papers. SG 


Algebraic Geometry, P. Degeneration of Abelian 
Varieties. Gerd Faltings, Ching-Li Chai. Ser. of 
Modern Surveys in Math., Band 22. Springer-Verlag, 
1990, xii + 316 pp, $39.80. [ISBN: 0-387-52015- 
5] Study of degenerations (given by maps G — S, 
S an irreducible scheme, G a group variety with 
abelian variety as generic fibre), and applications to 
constructions of moduli spaces, diophantine geome- 
try. RM 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1454: p-adic Analysis. Eds: F. Baldassarri, 
S. Bosch, B. Dwork. Springer-Verlag, 1990, v + 382 
pp, $41 (P). [ISBN: 0-387-53477-6] A collection of 
twenty-plus papers touching on such topics as Drin- 
feld modules, rigid analytic spaces, zeta functions, 
approximations, and others. SG 


Differential Geometry, T?(18), P. Minimal Sur- 
faces, Stratified Multivarifolds, and the Plateau Prob- 
lem. Dao Trong Thi, A.T. Fomenko. Transl. of 
Math. Mono., V. 84. AMS, 1991, ix + 404 pp, 
$152. (ISBN: 0-8218-4536-5] Good overview of cur- 
rent approaches and results for Plateau’s problem. 
Presents solutions by Fomenko in the class of spectra 
of manifolds with fixed boundary and by Dao Trong 
Thi in the homotopy class of multivarifolds with a 
given boundary. Designed for advanced graduate 
students and researchers. Assumes basic knowledge 
in differential geometry, differential equations, alge- 
braic topology, and the calculus of variations. The 
introduction on the one-dimensional Plateau prob- 
lem and the first chapter on the history of Plateau’s 
problem are accessible to a wider audience. Exten- 
sive bibliography. OJ 

Geometry, S*(7-13). Similarity. Tom M. Apos- 
tol. Project Mathematics! California Institute of 
Technology (Distr: MAA), 1990, 28 minute video- 
tape. Program Guide and Workbook, 30 pp, (P). One 
of a series of motivational and instructional video- 
tapes intended primarily for students in grades 7-9. 
Production is consistent with normal TV fare: an- 
imation, stock footage, voice overlays, background 
music, etc. Focuses on scale models (moon lander) 
and scaling (maps) as an ubiquitous application of 
similarity—a “great triumph of Euclidean geome- 
try.” Uses computer graphics with gimmicks (ray 
guns for enlargement and reduction) to illustrate ra- 
tios, perimeters, areas, and volumes. Divided into 
“slates” coded with small on-screen numbers that 
are coordinated with the accompanying Workbook. 
Tapes are available from state Departments of Edu- 
cation, from the MAA, and from the Cal Tech book- 
store. LAS 
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Topology, T(15-17), S, L. Elementary Topology, 
Second Edition. Michael C. Gemignani. Dover, 1990, 
xi + 270 pp, $6.95 (P). [ISBN: 0-486-66522-4] An 
unabridged and corrected republication of the 1972 
Second Edition of the work originally published by 
Addison-Wesley in 1967 (TR, January 1968; Ex- 
tended Review, January 1969; 1972 edition, TR, Oc- 
tober 1973). An introductory text in which abstract 
concepts are introduced within the concrete context 
from which they developed (e.g., geometry, analy- 
sis, etc.). Topics include fundamentals of metric 
spaces, topologies, convergence, compactness, con- 
nectedness, homotopy theory, paracompactness, and 
complete regularity. A good buy. LCL 


Operations Research, T(17-18: 1, 2), P. Dis- 
crete Location Theory. Eds: Pitu B. Mirchan- 
dani, Richard L. Francis. Wiley, 1990, xiv + 555 
pp, $64.95. [ISBN: 0-471-89233-5] A collection of 
twelve articles written by distinguished researchers. 
Provides an overview of the theory and its applica- 
tion in real-world discrete location problems. The 
articles discuss the p-median problem, the uncapaci- 
tated facility location problem, the multi-period ca- 
pacitated location problem, decomposition methods, 
covering problems, the quadratic assignment prob- 
lem, competitive location problems, and the location 
of mobile units in a stochastic environment. Exer- 
cises and bibliographic references are included at the 
end of each chapter. AO 


Optimization, S(17), P, L. Introduction to Min- 
imaz. V.F. Dem’yanov, V.N. Malozemov. Transl: 
D. Louvish. Dover, 1990, vii + 307 pp, $8.95 (P). 
(ISBN: 0-486-66423-6] This translation nicely pulls 
together in one place topics commonly spread out 
in American schools over approximation theory, the 
calculus of several variables, game theory, and lin- 
ear programming. Chapter 1 treats best polynomial 
approximations (in the Chebyshev norm) in the dis- 
crete case, Chapter 2 in the continuous case. Chap- 
ter 3 discusses the linear minimax problem without 
constraints, and the same problems (still the discrete 
case) with convex constraints are taken up in Chap- 
ters 4and 5. Chapter 6 concludes the matter by tak- 
ing up the continuous case. The price, coupled with 
the quality of printing, makes this book a real bar- 
gain. (1974 Halsted Press edition, TR, March 1975.) 
AWR 


Optimization, T*(15-17), L. Introduction to 
Mathematical Programming: Applications and Algo- 
rithms. Wayne L. Winston. PWS-Kent, 1991, xi + 
794 pp. [ISBN: 0-534-92520-0] Aimed at under- 
graduate majors in business and operations research 
as well as MBA students, this text aims at being 
mathematically sound but not demanding in terms 
of preparation: no calculus is required until Chapter 
12; the basic linear algebra needed is presented in 
Chapter 2. The use and interpretation of the output 
of LINDO and GINO are included, but set in sepa- 
rate sections that can be skipped by those who wish 
to do so. An attractive entry into this market. AWR 


Optimization, P. Parametric Optimization: Sin- 
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gularities, Pathfollowing and Jumps. J. Guddat, F. 
Guerra Vazquez, H. Th. Jongen. Wiley, 1990, viii 
+ 191 pp, $85. [ISBN: 0-471-92807-0] The subject 
matter is optimization problems in which some or all 
the individual data depend on just one parameter. 
The main tools for a solution algorithm are path- 
following methods, but these are shown not to be 
successful in every case, which leads to the subject 
of jumps in the set of local minimizers. The authors 
report on work centered at Kepler University (Linz) 
and Humboldt University (Berlin). Note the price 
per page. AWR 


Dynamical Systems, S*(12-15). Chaos, Frac- 
tals, and Dynamical Systems. Robert Devaney. Sci- 
ence Television (Box 2498, Times Square Station, 
New York, NY 10108), 1989. 63 minute videotape, 
$59. (ISBN: 1-878310-00-3] A collage of materials 
on dynamical systems and Julia sets woven together 
by a lecture (a “talking head” in the upper right 
corner of the screen) that begins with generalities 
(weather, planets, stock markets) and very elemen- 
tary exercises, but moves quickly to relatively sophis- 
ticated exploration of dynamics in the plane. Incor- 
porates several of Devaney’s short films on Julia sets 
of various sine and cosine functions in the complex 
plane. Illustrates various types of critical behavior 
both with static visuals (slide-like graphs) and with 
evolving Julia set films. Although the learning slope 
from beginning to end is rather steep, the film can 
serve to motivate and illustrate dynamical systems 
at many different levels. LAS 


Modelling, S(15-17), P, L. The Art of Modeling 
Dynamic Systems: Forecasting for Chaos, Random- 
ness, and Determinism. Foster Morrison. Wiley, 
1991, xvii + 387 pp, $49.95. (ISBN: 0-471-52004-7] 
A non-technical survey, or field guide, of the meth- 
ods of mathematical modelling with expository com- 
ments to indicate what each can do and cannot do. 
Techniques are illustrated with examples that can be 
read by anyone with an undergraduate education in 
science or mathematics. The purpose is to help the 
reader use and appreciate new developments in mod- 
elling techniques, including their limitations. LCL 


Control Theory, P. Lecture Notes in Control and 
Information Sciences-146: Minimum Entropy Ho 
Control. D. Mustafa, K. Glover. Springer-Verlag, 
1990, ix + 144 pp, $30 (P). [ISBN: 0-387-52947- 
0} An investigation of the entropy minimization 
problem. The approach taken here exploits recent 
progress in the state-space solution to the H. con- 
trol problem and explicit formulae can be obtained 
for the solution. LCL 


Control Theory, P. Lecture Notes in Mathematics- 
1459: Optimal Control of Nonsmooth Distributed 
Parameter Systems. Dan Tiba. Springer-Verlag, 
1990, vii + 159 pp, $20 (P). [ISBN: 0-387-53524-1] 
Overview and recent research on distributed control 
problems governed by nonlinear evolution equations 
(parabolic and hyperbolic) in absence of differentia- 
bility. Includes nonlinear delay differential equations, 
boundary control, elliptical problems, and optimal 
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Control Theory, P. Distributed Parameter Control 
Systems: New Trends and Applications. Eds: Goong 
Chen, et al. Lect. Notes in Pure & Appl. Math., V. 
128. Marcel Dekker, 1991, x + 516 pp, (P). [ISBN: 0- 
8247-8444-8] A collection of seventeen papers based 
on invited presentations at a workshop held August 
4-11, 1989 at the University of Minnesota. AO 


Probability, T(16-18), L. Probability for Appli- 
cations. Paul E. Pfeiffer. Texts in Stat. Springer- 
Verlag, 1990, xix + 679 pp, $59.95. (ISBN: 0-387- 
97138-6] An introduction to advanced probability 
theory aimed at teaching theory while understanding 
basic examples. Chock full of understandable, illus- 
trative examples and exercises. Topics are standard: 
probability measures, set theory, independence, con- 
ditional probability, random variables and distribu- 
tions, expectation, Radon-Nikodym and conditional 
expectation, stochastic processes. MK 


Probability, P. Lecture Notes in Mathematics- 
1427: Ecole d’Eté de Probabilités de Saint-Flour 
X VITI—1988. A. Ancona, D. Geman, N. Ikeda. 
Springer-Verlag, 1990, vii + 330 pp, $35 (P). (ISBN: 
0-387-53508-X] Lecture notes from three courses 
given at l’Ecole d’Eté de Calcul des Probabilités 
de Saint-Flour, August 21-September 7, 1988. An- 
cona: “Théorie du potentiel sur les graphes et les 
variétés.” Geman: “Random fields and inverse prob- 
lems in imaging.” Ikeda: “Probabilistic methods in 
the study of asymptotics.” LC 


Stochastic Processes, S(18), P. Introduction to 
Stochastic Integration, Second Edition. K.L. Chung, 
R.J. Williams. Prob. & Its Applic. Birkhauser 
Boston, 1990, xv + 276 pp, $34.50. [ISBN: 0-8176- 
3386-3] Presents a modern version of the theory of 
stochastic integration, in particular integration with 
respect to a local martingale with continuous paths. 
This edition includes discussion of stochastic differ- 
ential equations, Cameron-Martin-Girsanov formula, 
the Feynman-Kac-Schrodinger development, and re- 
flected Brownian motions. Proofs are carefully de- 
tailed, but motivation and exposition is left for the 
reader to discover. (First Edition, TR, December 
1984.) MK 


Stochastic Processes, S(18), P. Empirical Pro- 
cesses: Theory and Applications. David Pollard. 
NSF-CBMS Reg. Conf. Ser. in Prob. & Stat., V. 2. 
Institute of Math Statistics, 1990, villi + 86 pp, 
(P). [ISBN: 0-940600-16-1] A monograph based on 
lectures given at the University of Iowa in July 
1988. Results concern sums of independent stochas- 
tic processes allowing generalizations to nonidenti- 
cally distributed random variables. Makes use of 
Vapnik-Cervonenkis classes to generalize and extend 
Glivenko-Cantelli and Donsker’s theorems for empir- 
ical distribution functions, as well as empirical cen- 
tral limit theorems, uniform laws of large numbers, 
least absolute deviation estimators, estimation for 
censored data, and more. MK 


Stochastic Processes, T(18). Continuous Mar- 
tingales and Brownian Motion. Daniel Revuz, Marc 
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Yor. Grund. der math. Wissenschaften, B. 293. 
Springer Verlag, 1991, ix + 533 pp, $89. (ISBN: 0- 
387-52167-4] Develops theory for continuous Mar- 
tingales, Markov processes, and Gaussian processes 
by focusing on the theory of Brownian motion. Im- 
portant applications include stochastic calculus and 
excursion theory. Lots of exercises ranging from 
straightforward to very difficult. MK 


Elementary Statistics, S(13-16), L. How to Tell 
the Liars from the Statisticians. Robert Hooke. Mar- 
cel Dekker, 1983, xv + 173 pp. [ISBN: 0-8247-1817-8] 
Nice collection of brief accounts showing that statis- 
tical reasoning affects much of our lives—from drug 
testing to mass production. Well-written for those 
with little or no mathematical or statistical back- 
ground. Good supplementary text in introductory- 
level courses. MK 


Statistics, P. Theory of Random Determinants. 
V.L. Girko. Math. & Its Applic., V. 45. Kluwer 
Academic, 1990, xxv + 677 pp, $245. [ISBN: 0- 
7923-0233-8] An encyclopedic coverage, expanded 
by 50% from the original Russian edition that dates 
from 1980, including original results which have not 
been published previously. The theory of determi- 
nants of random matrices (random determinants) in- 
volves the theory of probability and statistics and its 
applications. Note price! LCL 


Statistics, T(17), S, P. Conditioning Diagnostics: 
Collinearity and Weak Data in Regression. David 
A. Belsley. Prob. & Math. Stat. Wiley, 1991, xx 
+ 396 pp, $59.95. [ISBN: 0-471-52889-7] Exposes 
state-of-the-art regression diagnostic methodology. 
Updates and revises material from Regression Di- 
agnostics: Identifying Influential Data and Sources 
of Collinearity, Belsley, Kuh, and Welsch (TR, 
March 1981). Includes new data sets, extensions 
to nonlinear models, and simultaneous-equation re- 
gression. Topics include collinearity; weak data; 
ill-conditioning diagnostics; geometry of mean cen- 
tering, correlation, and least squares; condition in- 
dices; variance-decomposition proportions; interpret- 
ing collinearity diagnostics with applications; and 
discussion of influential observations. Also indicates 
open research problems. MK 


Statistics, S(13). Calculator Enhancement for In- 
troductory Statistics, Preliminary Edition. Clemson 
University Calculator Project. Eds: Iris Fetta, D.R. 
LaTorre. Harcourt Brace Jovanovich, 1991,.x + 160 
pp, $6 (P). [ISBN: 0-15-505678-6] A collection of 
applications of the programmable, graphing ealcu- 
lator (in this case, the Sharp EL-5200) to problems 
and methods of elementary probability and statistics: 
e.g., histograms, simulation, various distributions, 
sampling, hypothesis testing, linear regression, etc. 
Not a textbook or study guide: application of tech- 
nology, not mathematical content, is emphasized. 
Although tied tightly to the Sharp (e.g., programs 
are displayed as keystroke sequences), the ideas con- 
tained are readily transferable to other machines. 
Diligent use of the supplement seems, as claimed, 
likely to “enhance” understanding, both of technol- 
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ogy and of mathematics. PZ 


Elementary Computer Science, T(13-14). Ad- 
vanced LOGO: A Language for Learning. Michael 
Friendly. Lawrence Erlbaum Assoc, 1988, xiv + 
661 pp, $24.95 (P); $79.95. [ISBN: 0-89859-933-4] 
Intended for high school and college students with 
some experience in LOGO. After a quick review of 
basic LOGO, topics such as procedures, lists, recur- 
sion, trees, graphics, and dynamics are treated in 
depth. SG 


Programming, T(14-16: 1), L. Object-Oriented 
Database Programming. Suad Alagic. Texts & 
Mono. in Comput. Sci. Springer-Verlag, 1989, xv 
+ 320 pp, $49.50. (ISBN: 0-387-96754-0] A text 
on programming methodology and associated con- 
cepts, techniques, and language notation devoted to 
database applications, using an extension of Modula- 
2. “Modern programming methodology is just as 
important” in application-oriented problems as in 
system programming or mathematics-oriented algo- 
rithms. Presumes an introductory programming 
course, basic mathematics. Exercises. RB 


Algorithms, P. Uses of Randomness in Algorithms 
and Protocols. Joe Kilian. ACM Distinguished Dis- 
sertations. MIT Pr, 1990, 235 pp, $39.95. [ISBN: 
0-262-11153-5] The author’s MIT dissertation in 
which randomness is used to attack three problems: 
finding a probabilistic algorithm to test primality; 
zero knowledge proofs about committed bits; and 
proof systems for determining if a string is in a lan- 
guage. A lively and well-written book. SG 


Computer Systems, S(16-17), P. OSI Ezplained: 
End-to-End Computer Communication Standards, 
Second Edition. John Henshall, Sandy Shaw. Ser. in 
Comput. & Their Applic. Ellis Horwood (US Distr: 
Prentice Hall), 1990, 249 pp. [ISBN: 0-13-639451-5] 
Relatively accessible explanation of Open Systems 
Interconnection standards and procedures for com- 
puter networks, with special emphasis on the top four 
levels (application, presentation, session, transport) 
which specify how networked systems cooperate to 
support complex applications. RM 


Computer Systems, T(14-15: 1), S, L. A Pro- 
grammer’s Guide to File Processing. Douglas L. 
Cashing. PWS-Kent, 1991, xv + 169 pp, $28. [ISBN: 
0-534-92567-7] Language independent, conceptual 
treatment of file structures and processing, suitable 
for ACM Curriculum ’78 CS 5 course. Progresses 
from non-technical overview through implementation 
issues, including overview of database management 
and sorting. RM 


Computer Systems, P. Advanced Topics in Data- 
Flow Computing. Eds: Jean-Luc Gaudiot, Lubomir 
Bic. Prentice-Hall, 1991, xx + 626 pp. [ISBN: 0- 
13-006503-X] A monograph based on a workshop in 
Eliat, Israel associated with the 1989 Sixteenth In- 
ternational Symposium on Computer Architecture. 
Surveys of the principles and 20-year evolution of 
data-flow computer architectures; 21 research papers 
on architecture issues, data-flow/von Neumann hy- 
brids, high-level languages and compilation issues, 
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scheduling and resource management, special pur- 
pose systems. RB 


Artificial Intelligence, S$(16-17), P. Artificial In- 
telligence: A Tool for Industry and Management. 
Hans W. Gottinger, H. Peter Weimann. Ser. in Ar- 
tif. Intellig. Ellis Horwood (US Distr: Prentice Hall), 
1990, 158 pp. [ISBN: 0-13-048372-9] Overview, in 
the form of a technology assessment, of some ma- 
jor commercial developments and applications of ar- 
tificial intelligence in areas such as expert systems, 
natural language processing and speech recognition, 
robotics, database systems, and decision support sys- 
tems. RM 


Computer Science, $?(16), P, L. The Practice 
of Prolog. Ed: Leon S. Sterling. MIT Pr, 1990, 
312 pp, $37.50. [ISBN: 0-262-19301-9] Collection of 
articles covering fairly large (and interesting) appli- 
cations developed in Prolog: database prototyping, 
VLSI editor, assisting register transfer design, par- 
allelizing Pascal compiler, partial evaluation system, 
natural language explanation, learning, and stream 
data analysis. Good reading for a programmer who 
wants to see how to build real (i.e., not toy) systems 
in Prolog. RM 


Computer Science, P. Communication Complez- 
ity: A New Approach to Circuit Depth. Mauricio 
Karchmer. ACM Doctoral Dissertation Award. MIT 
Pr, 1989, 69 pp, $25. (ISBN: 0-262-11143-8] This 
volume is the author’s doctoral dissertation from 
MIT, which won the award for the outstanding Ph.D. 
dissertation of 1988. It addresses the problem of the 
circuit complexity of Boolean functions. The author 
develops a new approach for analyzing the complex- 
ity, called communicational complexity, which pro- 
vides theoretical bounds on the complexity of com- 
puter circuits. GMS 


Computer Science, S(16-17), L. Readings in 
Object-Oriented Database Systems. Eds: Stanley B. 
Zdonik, David Maier. Ser. in Data Management Sys- 
tems. Morgan Kaufmann, 1990, x + 629 pp, $39.95 
(P). [ISBN: 0-55860-000-0] Good selection of foun- 
dational and recent papers on object-oriented ap- 
proach to database systems, including fundamentals 
of object orientation, semantic models, implementa- 
tion issues, extensions to the relational model, and 
applications. RM 


Computer Science, T(15-16), S. The Craft of 
Prolog. Richard A. O’Keefe. MIT Pr, 1990, xix + 
387 pp, $34.95. (ISBN: 0-262-15039-5] Prolog is a 
programming language based on the principles of the 
first-order predicate logic. This text is an introduc- 
tion to the language as well as a primer on the use of 
logic as a vehicle for problem solving and controlled 
deduction. Prolog has received a great deal of atten- 
tion recently due to its ever-increasing use in the field 
of expert systems and artificial intelligence. GMS 

Computer Science, T(15-16), S. How to Write 
Parallel Programs: A First Course. Nicholas Car- 
riero, David Gelernter. MIT Pr, 1990, 232 pp, $29.95. 
[ISBN: 0-262-03171-X] Parallel processing, the tech- 
nique of problem solving using multiple computers, 
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is becoming an increasingly popular technique for at- 
tacking very, very large computations. The authors 
introduce the basic concept of parallel processing and 
show a number of solutions implemented using the 
C and Linda programming languages, and the pop- 
ular MIMD parallel architecture. These languages 
and architecture are the most widely used and widely 
available. GMS 


Computer Science, P, L. Beauty Is Our Busi- 
ness: A Birthday Salute to Edsger W. Dijkstra. Eds: 
W.H.J. Feijen, et al. Texts & Mono. in Comput. Sci. 
Springer-Verlag, 1990, xix + 453 pp, $49. [ISBN: 
0-387-97299-4] How to honor the sixtieth birthday 
of a researcher and teacher who has profoundly im- 
pacted the intellectual foundations of computer sci- 
ence? This commemorative volume was prepared 
in secret by dozens of prominent colleagues he has 
influenced, and was presented to him in a surprise 
birthday conference! Fifty-four eclectic contributions 
seeking the best in elegance and style. RB 


Computer Science, P. Parallel Processing for Sct- 
entific Computing. Eds: Jack Dongarra, et al. SIAM, 
1990, 454 pp, $49.50 (P). [ISBN: 0-89871-262-9] 
Proceedings of the fourth SIAM conference on the 
topic, December 1989, in Chicago. Forty-nine pa- 
pers on matrix computations, numerical methods, 
differential equations, massively parallel computa- 
tion, performance, and tools. RB 


Applications, T, S, P, L. Fourier Methods for 
Mathematicians, Scientists and Engineers. Mark 
Cartwright. Math. & Its Applic. Ellis Horwood 
(US Distr: Prentice Hall), 1990, 326 pp, $37.50. 
[ISBN: 0-13-327016-5] In two parts: Fourier The- 
ory (220 pages) and Fourier Applications (88 pages). 
The first 100 pages on series and transforms con- 
tain no surprises and are very readable. The writ- 
ing throughout is lean and clear. Chapters on con- 
volution, filtering, correlation, and transforms fol- 
low a brief chapter on two-dimensions. Part I con- 
cludes with a chapter on the discrete Fourier trans- 
form and its uses. Part II is loaded with many un- 
usual, but admittedly sketchy, applications in physics 
(quantum theory, lasers, scattering, diffraction, tele- 
scopes, interferometry, impedance), chemistry (spec- 
troscopy), life science (vision, hearing, speech analy- 
sis), and earth science (water waves, turbulence, slid- 
ing glaciers, seismology). For upper-class students 
and beginning graduate students. Exercises at chap- 
ter ends in Part I with solutions to some. Excellent 
references, especially to topics in Part II. JDEK 


Applications, P. Computing Methods in Applied 
Sciences and Engineering. Eds: Roland Glowinski, 
Alain Lichnewsky. SIAM, 1990, 446 pp, $49.50 (P). 
[ISBN: 0-89871-264-5] Proceedings of the Ninth In- 
ternational Conference on Computing Methods spon- 
sored by INRIA (1990). Papers on state-of-the-art 
methods for solutions for large linear and nonlin- 
ear systems, optimization, dynamics and turbulence, 
computational chemistry and physics, parallel and 
supercomputing. RM 


Applications (Biological Science), S(18), P, 


1991] 


L. Quantitative Methods in Landscape Ecology: The 
Analysis and Interpretation of Landscape Hetero- 
geneity. Eds: Monica G. Turner, Robert H. Gard- 
ner. Ecological Stud., V. 82. Springer-Verlag, 1991, 
xv + 536 pp, $98. [ISBN: 0-387-97326-5] Compiles 
a list of techniques used to study spatial heterogene- 
ity. Essentially a collection of papers, each with an 
extensive bibliography, divided into four main sec- 
tions: introduction, analysis and interpretation of 
patterns, model development and simulation, and 
synthesis. SM 


Applications (Biological Science), T(16-17), 
P, L. Community Food Webs: Data and Theory. 
Joel E. Cohen, Frédéric Briand, Charles M. New- 
man. Biomath., V. 20. Springer-Verlag, 1990, xi 
+ 308 pp, $79. [ISBN: 0-387-51129-6] Intended for 
community ecologists with a quantitative bent and 
teachers/researchers involved with random graphs. 
An application of combinatorics. Several unsolved 
mathematical problems are given. SM 


Applications (Economics), P. The Economic De- 
sign of Control Charts. Elart von Collani. Skripten 
zur Math. Stochastik. BG Teubner Stuttgart, 1989, 
xii + 171 pp, DM 29 (P). [ISBN: 3-519-02729- 
1] Simple graphical methods are developed to 
determine the economic design of zZ—,c—, and 
p—charts. RWJ 


Applications (Fluid Dynamics), P, L. Annual 
Review of Fluid Mechanics, Volume 23, 1991. Eds: 
John L. Lumley, Milton Van Dyke, Helen L. Reed. 
Annual Reviews, 1991, x + 674 pp, $40. As usual 
with this series,‘this volume contains an interest- 
ing mix of theory and real-world applications. Eigh- 
teen articles ranging from “Theory of Hurricanes” to 
“Symmetry and Symmetry-Breaking Bifurcations in 
Fluid Dynamics.” JO 


Applications (Information Theory), T(18), P. 
Introduction to Shannon Sampling and Interpolation 
Theory. Robert J. Marks II. Texts in Electrical 
Engin. Springer-Verlag, 1991, xiii + 324 pp, $59. 
[ISBN: 0-387-97391-5] Study of generalizations of 
the “cardinal” series (bandlimited signal) of Shan- 
non sampling theory, for sampling and subsequent 
restoration by interpolation of continuous signals and 
images. Analysis of effects of truncation, noise, jit- 
ter, on restoration error, generalizations to higher 
dimensions, optimal sampling strategies. RM 


Applications (Information Theory), T(16-17: 
1), S. Information Theory. Robert B. Ash. Dover, 
1990, xi + 339 pp, $8.95 (P). [ISBN: 0-486-66521-6] 
Relatively self-contained introduction to the theory 
of classical Shannon information theory. Topics in- 
clude analysis of channel models (with and without 
memory), coding systems and coding theorems, sta- 
tistical properties of sources, noiseless coding, error- 


correcting codes, continuous channels. Exercises. 
Reprint of the 1965 Wiley edition. RM 
Applications (Physical Science), P. Model- 


ing One Dimensional Pattern Formation by Anti- 
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diffusion. J.F. Kaashoek. CWI Tract, V. 72. Math- 
ematisch Centrum, 1990, vi + 271 pp, Dfl 67 (P). 
[ISBN: 90-6196-389-3] Introduces the definition of 
pattern formation, providing physical backgrounds 
and examples. Presents models of phase transition, 
reaction-diffusion equations, mass conservation evo- 
lution equations, etc. Investigates theoretical prop- 
erties of solutions. Reports numerical simulations for 
the models. LS 


Applications (Physics), S(18), P. Wavelets: 
Time-Frequency Methods and Phase Space, Second 
Edition. Eds: J.M. Combes, A. Grossmann, Ph. 
Tchamitchian. Springer-Verlag, 1990, ix + 331 pp, 
$59 (P). [ISBN: 0-387-53014-2] Proceedings of an 
International Conference held in Marseille, France, 
December 14-18, 1987. MU 


Applications (Physics), P, L*. Selected Pa- 
pers, S. Chandrasekhar, Volume 6: The Mathemat- 
ical Theory of Black Holes and of Colliding Plane 
Waves. S. Chandrasekhar. Univ of Chicago Pr, 
1991, xix + 739 pp, $35 (P); $79.95. [ISBN: 0-226- 
10101-0; 0-226-10100-2] Final volume in these Se- 
lected Papers. Thirty-seven papers, mostly dating 
from the years 1975-85, in which Chandrasekhar re- 
turned to a subject he pioneered forty years earlier to 
develop “ab initio” the whole theory of black holes. 
Introduced by an extraordinarily warm and engag- 
ing personal retrospective on this phase of Chandra’s 
career written by Chandra’s colleague and frequent 
co-author Basilis Xanthopoulos, who was murdered 
in Crete on November 27, 1990 while giving a semi- 
nar lecture. (Chandra describes their association as 
“the most binding in all my years in science.” ) LAS 


Applications (Physics), P. Spectral and Scatter- 
ing Theory for Wave Propagation in the Perturbed 
Stratified Media. Ricardo Weder. Appl. Math. Sci., 
V. 87. Springer-Verlag, 1991, 188 pp, $39. [ISBN: 
0-387-97357-5] A stratified medium is one whose 
physical properties depend only upon a single coordi- 
nate, whereas if perturbed, then locally the physical 
properties depend on all the coordinates. An un- 
derstanding of wave propagation in these media is 
needed for areas such as ocean acoustics, integrated 
optics, and wave guides. This self-contained text in- 
cludes many new results and methods published for 
the first time. MPR 


Reviewers 


RB: Richard Brown, St. Olaf; JNC: Judith N. Cederberg, 
St. Olaf; LC: Laura Chihara, St. Olaf; AD: Amy Davidow, 
Macalester; SG: Steven Galovich, Carleton; JPH: Joan P. 
Hutchinson, Macalester; OJ: Ockle Johnson, St. Olaf; RWJ: 
Roger W. Johnson, Carleton; MK: Michael Kahn, St. Olaf; 
JDEK: Joseph D.E. Konhauser, Macalester; LCL: Loren C. 
Larson, St. Olaf; SM: Steve McKelvey, St. Olaf; RM: Richard 
Molnar, Macalester; AO: Arnold Ostebee, St. Olaf; MPR: 
Matthew P. Richey, St. Olaf; AWR: A. Wayne Roberts, 
Macalester; KS: Karen Saxe, St. Olaf; GMS: G. Michael 
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Calculus 
Reform 


PRIMING THE CALCULUS PUMP: 
INNOVATIONS AND RESOURCES 


Prepared by the CUPM Subcommittee 
on Calculus Reform and the 
First Two Years 


Thomas W. Tucker, Editor 


Among our many educational crimes, 
the one that bears the greatest respon- 
sibility for the shortage of young Ameri- 
cans choosing mathematics, physics or 
engineering aS careers may very well 
be the wretched way we teach calcu- 
lus. 


Peter D. Lax 


In the four years since the Tulane confer- 
ence issued a call for a “lean and lively 
calculus,” there have been countless ar- 
ticles, contributed paper sessions, talks, 
and panels. There have also been many 
individuals and institutions who have re- 
sponded with energy and imagination in get- 
ting grants, setting up laboratories, devel- 
oping software, assigning students projects, 
and rethinking from top to bottom what 
should go into a calculus course. They are 
changing not only what they teach, but also, 
what is far more difficult, how they teach. It 
is one thing to ask for reform in conferences 
and articles; it is another thing entirely to do 
it in the classroom, as these people are do- 


ing. 
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ad Resources 
Calculus Reform and the First Two Years 


innovations an 


Thomas W. Tucker. Editor 
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Unfortunately, these activities do not al- 
ways get the publicity they deserve. That 
is the purpose of this book—to dissemi- 
nate rapidly to the mathematical commu- 
nity detailed examples of calculus reform 
at work. The ten featured projects in this 
book, together with abstracts of more than 
sixty other projects and a collection of refer- 
ence materials and resources, are designed 
to give individuals and departments a con- 
crete idea of what they themselves can do, 
as well as information on how to do it, what 
to use, and who to contact. 


328 pp., Paper, 1990 
ISBN 0-88385-067-2 


List: $20.00 
Catalog Number NTE-17 
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Lascaux Graphics 3220 Steuben Ave. 
Bronx, NY 10467 USA (800) 338-0993 


STUDIES IN THE HISTORY OF MATHEMATICS 


STUDIES IN 
THE HISTORY OF MATHEMATICS 


Esther R. Phillips, Editor 


Esther Phillips has brought together a col- 
lection of articles showing the sweep of re- 
cent scholarship in the history of mathemat- 
ics. The material covers a wide range of 
current research topics: algebraic number 
theory, geometry, topology, logic, the réla- 
tionship between mathematics and comput- 
ing, partial differential equations, and alge- 
braic geometry. 

320 pp., 1987, ISBN 0-88385-1 28-8 

List: 36.50 MAA Member: $28.00 


Catalog Number MAS-26 


This is an excellent book! It is a 
very interesting and exciting book to 
read. The author does an extremely 
nice job of bringing together most, if 
not all, the mathematicians that were 
involved in a particular area of mathe- 
matics. The sources listed at the end of 
each section give the reader an oppor- 
tunity to look up other resources per- 
taining to the particular subjects, a fea- 
ture that is definitely lacking in many 
history books. The content of the book 
is choice. The professional mathemati- 
cian would definitely want to have a 
copy of this book. 


Barney Erikson in The Mathematics Teacher 


Order from: The Mathematical Association of America 
om 1529 Eighteenth Street, N. W. 
‘@) Washington, D. C. 20036 


(202) 387-5200 


MATHEMATICS: 


Queen and Servant 
of Science 


by E. T. Bell 


“Bell is a lively, stimulating writer, inoffensively crotchety 
and opinionated, with a good sense of historical circum- 
stance, a fine impatience with humbug, a sound grasp 
of the entire mathematical scene, and a gift for clear and 
orderly explanation.” 

James R. Newman, The World of Mathematics 


“Bell’s writing style makes you feel as if you are a for- 
tunate student spending time with a charming and gifted 
teacher. Not a history of mathematics, but a wide rang- 
ing exposition of ideas in the subject, the book is a classic 
and should be required reading for every teacher of 
mathematics.” 

Harold Jacobs, Grant High School, Van Nuys, California 


ATHEMATICS 


Queen &Servant 
of Science 


ERIC TEMPLE BELL 


“This book deserves a place in today’s market. It is a 
much more popular work than most histories of the sub- 
Ject, and that is exactly what makes it accessible to stu- 
dents as well as to non-mathematicians. It is rewarding 
reading for ... teachers and students of mathematics 
at all levels.” 


xv + 437 pp. Paperbound 
Republished 1987. 
ISBN-0-88385-446-3 

List: $18.50 


Morris Kline, Courant Institute MAA Member: $14.00 


This is an absorbing account of the development 
in pure and applied mathematics from the geom- 
etry of Euclid more than two-thousand years ago 
to some of the more recent developments in math- 

ematical physics. Published 30 years ago, MATHE- 

MATICS: QUEEN AND SERVANT OF SCIENCE 
remains a valuable tool for teachers, students, and 
all those enthusiastic about mathematics. The 
book includes 20 chapters on selected areas in 
pure and applied mathematics, including discus- 
sions on topics such as geometry, algebra, logic, 
probability, infinity, transformations, rings, groups, 
topology, and continuity. Also covered are sub- 
jects such as: Mersenne primes, quantum mechan- 
ics, and three-valued logic. The book is not a 
history of mathematics, but rather an enticing 


sampling of selected topics in mathematics and 
science. 


The book contains fascinating information on the 
personalities and philosophies of the great math- 
ematicians, plus perceptive resumes of their spe- 
cific contributions to the moving force of this great 
science. 
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The Mathematical Association of America 
1529 Eighteenth Street, N.W. 

Washington, D.C. 20036 
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The Joy of BER‘EDING. 


DERIVE,— A Mathematical 
Assistant program. 

PC Magazine says it’s 

“a joy to use” and proclaims it 
‘Editors’ Choice.” PC Week calls 
it “fast and capable.” The 
DERIVE® program is de- 
livered with built-in 
standard equipment 

that delights 
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both 
math lovers and 
math phobics. 


High performance. 


DERIVE does numeric and symbolic 
equation solving, exact and approximate 
arithmetic to thousands of digits, calculus, 
trigonometry and matrices. It displays or- 
mulas in comprehensible 2D format using 
raised exponents and built-up fractions. It 
plots beautiful 2D curves and 3D surfaces on 
monochrome or color monitors. 


Compact and sporty. 


PC Magazine calls ita “smal! wonder.” All it 
takes is 512K of memory and one floppy disk 
drive. DERIVE takes to the road on PC com- 
patibles, and really gets around on laptop 
and even handheld computers! 


DERIVE is a Handcrafted 
registered trademark of Software 
Soft Warehouse, Inc. for the Mind 


Human engineering. 


You don't have to be fluent in computerese to 
use DERIVE. In fact, it’s the friendliest and 
easiest to use of any symbolic math package 
on the road today. 
lts menu-driven 
interface 


and 
on-line help 
make it easy— 
you'll soon be up to 
speed doing math. instead of 
trying to learn how to drive the 
software. 


Freedom to maneuver. 


DERIVE’s automated expertise releases you 
from the drudgery of hand calculations. 
You can do problems you'd never attempt 
otherwise and obtain exact symbolic solu- 
tions, in addition to approximate numerical 
solutions. 


No sticker shock. 


DERIVE’s suggested retail price is $250. And 
of course DERIVE doesn't require an expen- 
sive computer, a math co-processor, or even 
a hard disk drive. 

Order DERIVE through your favorite software 
dealer or mail-order house. For a list of 
dealers, write Soft Warehouse, Inc. at 3615 
Harding Avenue, Suite 505, Honolulu, HI 
96816. Or call (808) 734-5801 after 11 a.m. 
Pacific Standard Time. 


And happy DERIVEing! 
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Revised 
and 
Updated 


THE LAST PROBLEM 


E. T. Bell 
Revised and updated by Underwood Dudley 


What Eric Temple Bell calls the last prob- 
lem is the problem of showing that Pierre 
Fermat was not mistaken when he wrote 
in the margin of a book, almost 350 years 
ago, that 2” + y” = z” has no solution in 
positive integers when n > 3. The orig- 
inal text of THE LAST PROBLEM traced 
the problem from Babylonia in 2000 B.C. 
to seventeenth-century France. Along the 
way we learn quite a bit about history, and 
just as much about mathematics. Under- 
wood Dudley's notes bring us up-to-date on 
recent attempts to solve the problem. 


The book is unique in that it is a biogra- 
phy of a famous problem. ‘ The book fits 
no categories. It is not a book of mathe- 
matics. Pages go by without an equation 
appearing. It is not a history of number the- 
ory because it includes too much about the 
history of the western world, and it is not 
a history of western civilization because its 
focus is on mathematics. It is too entertain- 
ing to be scholarly and contains too much 
mathematics to be widely popular. It is an 
unusual book. 


What T.A.A. Broadbent said about Bell's 
work applies to THE LAST PROBLEM. 
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His style is clear and exuberant, his 
opinions, whether we agree with them 
or not, are expressed forcefully, often 
with humor and a little gentle malice. 
He was no uncritical hero-worshipper, 
being as quick to mark the opportunity 
lost as the ground gained, so that from 
his books we get a vision of mathemat- 
ics as a high activity of the questing 
human mind, often fallible, but always 
pressing on the neverending search for 
mathematical truth. 


This is a rich and varied, wide-ranging book, 
written with force and vigor by someone with 
a distinctive style and point of view. It will 
provide hours of enjoyable reading for any- 
one interested in mathematics. 


328 pp., Paperbound, 1990 
ISBN-0-88385-45 1-1 


List: $17.50 MAA Member: $13.50 
Catalog Number TLP 


ORDER FROM 
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New From 


Steven G. Krantz 


COMPLEX ANALYSIS: 
THE GEOMETRIC VIEWPOINT 


Steven G. Krantz 


Geometric methods have been used in 
complex analysis since the 1930s when 
Lars Ahlfors discovered that they give a nice 
way to look at the Schwarz lemma. Since 
that time they have become a central part of 
the research activities of complex analysis. 
However these important techniques have 
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Education. Although using most features of E.Z. Math involves no more than plugging in the ROM card and following the 
various on-screen menus, the E.Z. Math User’s Manual explains very clearly and precisely in complete detail all features and 
display screens of the . Just as important, 46 pages of the manual are devoted to simple, intuitive explanations of the basic 
concepts involved in graphs, numbers, sets, variables, equations, savings and loans. 
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Number Theory as Gadfly 


B. Mazur, Harvard University 


Dr. Mazur received his Ph.D. from Princeton University in 1959, and was 
a Junior Fellow at Harvard University from 1961-64. Since then he has 
been at Harvard University with frequent visits to I1.H.E.S. in France. He is 
a member of the U.S. National Academy of Sciences and has received the 
Veblen Prize (in geometry) and the Cole Prize (in number theory) from the 
AMS. 


(This is an expository article which evolved from notes written in preparation for a 40-minute talk 
for a general audience at the “Symposium on Number Theory,” held in Washington D.C. on May 4, 
1989 under the auspices of the Board on Mathematical Sciences of the National Research Council. To 
make the text more informative the original version has been supplemented with lots of commentary, a 
section (§4) has been added which may be useful to readers familiar with the classical theory of 
modular forms, and an appendix has been added which is meant for an even more specialized audience. 
I am thankful to P. Diaconis, J. Mazur, K. Ribet and J.-P. Serre, who read early drafts of this paper, 
and whose suggestions were very helpful to me.) 


1. Introduction. When a friend saw the title to my talk he asked if what I had 
in mind was the well-known fact that number theory has an annoying habit: the 
field produces, without effort, innumerable problems which have a sweet, innocent 
air about them, tempting flowers; and yet...the quests for the solutions of these 
problems have been known to lead to the creation (from nothing) of theories which 
spread their light on all of mathematics'!, have been known to goad mathematicians 
on to achieve major unifications of their science*, have been known to entail 
painful exertion in other branches of mathematics to make those branches 
serviceable’. Number theory swarms with bugs, waiting to bite the tempted 
flower-lovers who, once bitten, are inspired to excesses of effort! 

Well, perhaps that summarizes the general aim of my talk—but, to put it more 
gently, I want to spend a few minutes considering one example (a conjecture, in 
fact) which shows how Number Theory can sometimes contrive to be a helpful, and 
possibly inspirational, goad to the rest of the Mathematical Sciences. 

The most celebrated of all deceptively simple (and still unsolved!) problems in 
Number Theory is surely Fermat’s Last Theorem*. Its curious history (whose 
statement first occurs as a marginal commentary on the equation arising from the 
Pythagorean theorem) is so well known, it needn’t be rehearsed here. Professional 
mathematicians, after Fermat, have approached Fermat’s Last Theorem with 


leg. Kummer’s theory of ideals 
*e.2., Grothendieck’s theory of schemes 
5e.g., The theory of group representations, and in particular, the “Langlands program” 


*For a detailed account of the recent work on this see Oesterlé’s Bourbaki report [O] listed in the 
References for §4. 
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mixed feelings. Kummer, for example, called it “more of a joke than a pinnacle of 
science,” and he came as close as anyone has come to proving it! Gauss wouldn’t 
work on it, even when urged by his friends to do so in order to get the cash reward 
which had been offered for its solution. 

But Fermat’s Last Theorem has always been the darling of the amateur 
mathematicians, and as things have progressed, it seems that they are right to be 
enamored of it: Despite the fact that it resists solution, it has inspired a prodigious 
amount of first-rate mathematics. Despite the fact that its truth hasn’t a single 
direct application (even within number theory!) it has, nevertheless, an interesting 
oblique contribution to make to number theory: its truth would follow from some 
of the most vital and central conjectures in the field. Although others are to be 
found, Fermat’s Last Theorem presents an unusually interesting ‘‘test’’ for these 
conjectures. 

My aim is to describe, in some detail, one of these grand conjectures (due to 
Shimura, Taniyama and Weil?) which, even though still unproved, plays a struc- 
tural and deeply influential role in much of our thinking and our expectations in 
Arithmetic. Thanks to recent work of Frey, Serre, and Ribet®, it has a large 
number of applications, Fermat’s Last Theorem among them. As I shall not have 
time to make clear, but hope, at least, to make believable, the conjecture of 
Shimura-Taniyama-Weil is a profoundly unifying conjecture—its very statement 
hints that we may have to look to diverse mathematical fields for insights or tools 
that might lead to its resolution’. 

As we Shall see, the conjecture of Shimura-Taniyama-Weil would imply a 
strange and important connection between the elliptic curves that arise in Arith- 
metic (we’ll get to that shortly!) and the Hyperbolic Plane. 

I mentioned above that the conjecture of Shimura-Taniyama-Weil has as one of 
its consequences, Fermat’s Last Theorem. As everyone knows, Fermat’s Last 
Theorem is an assertion about the family of (Diophantine®) equations 


XN+YN = ZN 


The fact that this conjecture has also been referred to as the Weil conjecture, the Taniyama-Weil 
conjecture, and the Taniyama conjecture points to the difficulty in assigning to it a clear attribution. It 
was originally formulated as a problem by Taniyama in a conference in 1955 and was published in 
Japanese, in Sigaku 7 (1956) p. 269. A more precise formulation corresponding to the modern form of 
the conjecture—involving important information concerning the conductor—was implicitly suggested 
by subsequent work of Weil which had the effect of bringing the problem to the attention of a large 
audience. The most precise version of this conjecture to date, which brings in the crucial issue of fields 
of definition, incorporates work of Shimura, Eichler, and others (see footnote 15 below, and the 
technical appendix at the end). For a moving evocation of the life and times of Taniyama (as well as an 
English translation of the original statement of Taniyama’s problems) see the article “Yutaka Taniyama 
and his time, very personal recollections,” Goro Shimura, Bull. London Math. Soc. 21 (1989) 186-196. 


See [S], [Fr 1, Fr 2], and [R] listed in the references at the end of §4. 


"It does not seem unnatural to look to differential geometry for progress with this conjecture, or to 
partial differential equations and the study of the eigenvalue problem for elliptic operators, or to the 
representation theory of reductive groups... . It would be no surprise if ideas from the classical theory 
of one complex variable and the Mellin transform were relevant, or of Algebraic Geometry... . But 
perhaps one should also look in the direction of Kac-Moody algebras, loop groups, or Y-modules, 
perhaps to ideas that have been, or will be, imported from Physics... . 


SThe adjective ‘““Diophantine” is in honor of the Alexandrian mathematician Diophantos (perhaps 
A.D. 250) and signals vaguely a type of equation not dissimilar from those Diophantos considered. 
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for N = 3,4,5,..., or with little loss of generality, for odd prime exponents N. 
But despite the elegance, and evident symmetries of the above family of equations, 
there is no denying that, after all, it is merely one family of Diophantine equations. 
What is so excellent about these particular equations? 

Now, the urge to put any single instance in an appropriate general context 
before dealing with it mathematically is strong. Consider, for example, the way 
Francois Viéte, the modern inventor of algebra, expressed that urge at the end of 
his treatise (c. 1591) by saying that algebra appropriated for itself “the proud 
problem of problems: which is to leave no problem unsolved.” More to the point, 
consider the most celebrated of recent Diophantine results, valid in a truly general 
context: the theorem of Faltings (conjectured originally by Mordell) which asserts 
that any algebraic equation in two variables, and of genus’ greater than or equal to 
2, has only a finite number of rational solutions. 

But what is an appropriate general context in which to place the Fermat family 
of Diophantine equations, and what is the appropriate Diophantine question to 
ask? Despite the fact that the Fermat problem has been with us for three centuries 
I don’t believe that we have any thoroughly comfortable answer, even to this 
modest question. One may always take recourse (in cases where it is not clear how 
to “correctly” generalize a problem) in the reliable method of kicking the problem 
a bit, to get a “nearby” one... . A relatively conservative move in this direction, in 
the case of Fermat’s equation, is to allow general coefficients in the equation, say 
one coefficient for starters—for example, fix a nonzero integer A, and consider 
the family 


A:XN+YN=ZN, 


and then ask: Is there an exponent N, such that for exponents N (or for prime 
exponents N) greater than N, there is no triple of integers CX, Y, Z), none zero, 
solving the above equation? In this slight perturbation of Fermat’s original prob- 
lem a few minutes of reflection will convince one to be circumspect in framing 
precise conjectures... (e.g., consider A = 2). Nevertheless the conjecture of 
Shimura-Taniyama-Weil has an impressive power of prediction concerning the 


About Diophantos’ personal history little is known, save what can be gleaned from the following 
problem which occurs in a collection, the Palatine Anthology, compiled, scholars believe, no more than 
a century after his death: 

Here you see the tomb containing the remains of Diophantos, it is remarkable: artfully it tells 
the measures of his life. The sixth part of his life God granted him for his youth. After a twelfth 
more his cheeks were bearded. After an additional seventh he kindled the light of marriage, and 
in the fifth year he accepted a son. Elas, a dear but unfortunate child, half of his father he was 
and this was also the span a cruel fate granted it. He consoled his grief in the remaining four 
years of his life. By this devise of numbers, tell us the extent of his life. 


The genus g of an algebraic curve is a nonnegative integer which was originally introduced by 
Riemann and defined by “topological means.” It also has an “algebraic” definition, and as such is an 
intrinsic invariant of the field of algebraic functions on the curve. If the curve is given as the locus of 
zeroes of a homogeneous form of degree d in three variables in the projective plane then g < (d — 
1)(d — 2)/2 with equality holding if and only if the curve has no singularities. In contrast to the genus, 
however, the degree is not given by the field of functions of the curve alone: it is defined in terms of the 
representation of the curve in projective space. For this reason it is more natural to look to the genus 
rather than to the degree as an invariant which determines “‘Diophantine behavior.” 
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nature of nontrivial integer solutions for these families. For example,!® using work 
of Frey, Serre, and Ribet, the Shimura-Taniyama-Weil conjecture would imply that 
the above equation has no such solutions for prime exponents N > 7 if A is any 
power of 3, or of 5, or of 7 (or of 11, 13, 17, 19, 23, 29, 53 or 59, for that matter", 
provided that N doesn’t divide A), and it would guarantee the existence of an N, 
such that for prime exponents N > N, there are no such solutions if A is any 
power of any odd prime not of the form 2” + 1 (i.e., if A is neither a power of a 
Mersenne nor of a Fermat prime). 

In summary, the conjecture of Shimura-Taniyama-Weil seems to be getting into 
the thorny thicket of these Diophantine issues—seems to be giving reasons why 
some (but not all!) of these equations cannot have solutions—seems to be 
beginning to put such Diophantine problems in a “‘context.”’ 

It also relates them to the extraordinary geometric questions to which we shall 
now turn. 


2. Euclidean and non-Euclidean covering mappings. One of the mysteries of 
the Shimura-Taniyama-Weil conjecture, and its constellation of equivalent para- 
phrases, is that although it is undeniably a conjecture “about arithmetic,” it can be 
phrased variously, so that: in one of its guises, one thinks of it as being also deeply 
“about” integral transforms in the theory of one complex variable; in another as 
being also “about” geometry”. 

The more striking of these two formulations is the geometric one. To explain it 
we need to review a few basics of geometry: symmetries, orbits, orbit spaces, 
covering mappings, and the interesting concept of “uniformization” ... . We'll build 
things up slowly by first considering these notions in a relatively simple context (on 
the Real Line), and then in the two contexts (Euclidean and Non-Euclidean) 
necessary for the actual “geometric” statement of the conjecture. 


T, =“shift to right >> by A units” 


(RealLine) oo gp gg lg “ng Sp pp — 


Fic. 1. The “lattice” A = {0, + A, + 2A,...} is the orbit of 0 under the translation 7}. 


(I). On the real line. 


Let A be a positive real number and let 7, denote the operation on the real line 
R consisting of “shifting all points in R to the right” by A (synonymously: 
translation by A; in equations: 7,(x) =x + A). We view T, as a symmetry of R. 
The iterated translates of 0 by 7, and by its inverse T_, give us a discrete evenly 
spaced “‘lattice” A in R consisting of all integral multiples of A. 


See thme. 2 and subsequent remarks in section 4.3 of J.-P. Serre’s [S] listed in the references at 
the end of §4. 


‘Iand with more work, one could surely produce more such consequences of Shimura-Taniyama- 
Weil. 


'-The equivalence of these two formulations is due to Weil, following upon work of Hecke. 
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The orbit of a real number x with respect to the lattice A, is the collection of 
real numbers obtained from x by translating x by iterates of 7, and by its inverse 
T_, (that is, the orbit is the collection of real numbers x + N~-A where N runs 
through all integers). The lattice A itself is the orbit containing the origin. Given a 
fixed real number A, any orbit for A has a unique representative in the interval 
A<x<AtaA. 


9 ——i———— 0 —- 4 —— 6 ——f-———. 0 — 3 0 —_ 3 0 —_-—— 0 
—3A x —2A —2X 0 +A +22 +3A 


Fic. 2. The black squares mark the “orbit” of some general real number x. 


The orbit space of R with respect to the symmetry 7, (or one might also say: 
with respect to the lattice A) is simply the collection of orbits (with respect to A). 
Call the orbit space R/A. One can think of the orbit space as obtained from R by 
“identifying” any point x in R with any iterated translate of x by 7, and T_,, Le., 
with all points y € R such that y = x + NA for some integer N. The easiest way to 
visualize this is by wrapping the real line (viewed as a helix in the figure below) 
around a circle (say a unit circle in the complex plane) in such a manner that 
points on R of distance A apart map to the sample point of the circle: 


zP e2Tiz/a (*) 


The circle, then, may be taken to be the orbit space. The displayed mapping (*) 
can be called a covering mapping and has the property that the pre-image with 
respect to (*) of any point on the circle consists in precisely one orbit with respect 
to A. 


(Real line, viewed 


RR 
a as a helix) 
Covering 


mapping 
oo \ (Orbit space, viewed 
as a circle) 


Fic. 3 
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and the various symmetries obtainable by iterations of them (x > N,-A, + Ny° A>, 
for integers N, and N,). Here A,, A, are two complex numbers such that 0, A,, and 
A, are not collinear (and therefore “generate” a lattice A as in FiGureE 5). 

An orbit with respect to A (that is, the set of points in the complex plane which 
can be obtained by the iterated application of the translations Toth, and their 
inverses to a single point in the complex plane) is simply a “displaced lattice” as in 
Ficure 6 below. Any such orbit has exactly one representative in the (half-closed) 
parallelogram as in the next figure, whose vertices are 0, A,, A,, and A, + A5. 


Fic. 6. The @’s mark an orbit with respect to the lattice. 


As in the example of R given previously, by the orbit space with respect to the 
two symmetries 7, and 7), (or, we may Say, with respect to the lattice A) we mean 
the collection of orbits with respect to A. We may visualize this orbit space as the 
parallelogram with vertices 0,A,,A,, and A, +A, “wrapped up” as shown in 
FiGuRE 7 below. Topologically it is a “torus,” i.e., the surface of a doughnut. Call 


Fic. 7 
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(Euclidean) a 


EZZE__ 


Euclidean 
Uniformization 


Elliptic curve 


Fic. 8. The “covering mapping” which brings the complex plane C to the orbit space C/A may also be 
visualized as a two-stage process, where in the first stage the plane is wrapped around a cylinder, and in 
the second stage the cylinder is wrapped around a torus. 


the orbit space C/A. The mapping which sends each point in the complex plane to 
the orbit which contains it is our covering mapping C — C/A: 

We wish to think of the orbit space C/A as inheriting a “conformal geometry” 
(and an orientation) from the standard Euclidean geometry of the complex plane C 
via this natural mapping. A conformal geometry on a smooth surface is a “geome- 


Fic. 9. In conformal geometry, there is no invariant notion of “length” of an arc, but there is a notion 
of angle. 
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try” where there is no intrinsic notion of “length of arc,” but if we are given two, 
real, smooth pieces of arc traced on the surface which intersect at a point (as in 
FIGURE 9) it makes sense to talk about their angle of intersection. 

To endow a smooth surface with an orientation and a (locally Euclidean) 
conformal geometry is the same as to consider it as a Riemann surface.’* {Here is a 
sketch of the modern formulation of this notion, originating essentially with 
Herman Weyl: to give a smooth surface the structure of Riemann surface has to 
specify at each point of the surface a preferred class of complex-valued functions 
defined locally about the point, called analytic functions, where this specification 1s 
done in the following manner: Among the analytic functions about each point 
there is at least one, call it z, which identifies some neighborhood of the point with 
an open set U of the complex plane so that the specified class of analytic functions 
on that neighborhood 1s identified with the class of standard analytic functions on 
U. Moreover, the condition of being an ‘analytic function’ is a local condition in 
the sense that a function defined on an open subset of the surface is ‘analytic’ if 
and only if it is ‘analytic’ on each subset of any open cover of that open subset. 
Given, then, a Riemann surface we may say what it means for a complex-valued 
function on it to be ‘analytic,’ or ‘meromorphic’ (a ratio of ‘analytic functions’). 

A function which plays the role required of the function denoted z above, 
which is defined locally about a point of the surface and, moreover, which vanishes 
at that point is sometimes called a local uniformizer at the point in question. The 
local isomorphism between an open subset of a Riemann surface and U C C given 
by a ‘local uniformizer’ z identifies the ‘‘conformal geometry” on that open subset 
with the Euclidean conformal geometry on U}. 

With Weierstrass let us ask for (doubly) periodic'* functions on the complex 
plane: functions which are periodic with respect to both 7,, and 7, ,. These doubly 
periodic functions are precisely the meromorphic functions on the quotient space 
C/A, meromorphic in the sense of its Riemann surface structure. As Weierstrass 
showed, the problem of constructing all such periodic functions can be elegantly 
settled by the construction of one special periodic function, the so-called g-func- 
tion attached to A,g,(z) (call it @ for short)'? which satisfies a nonlinear 
differential equation of the form 


(9) = 4-92 +A-ot+B, 
where A and B are complex numbers, and gz) = dgy(z)/dz. This turns out to be 
quite an adventitious mathematical construction! An unexpected number of prob- 
lems are solved in a single stroke: First, the stated problem finds its solution in this 
construction, for any doubly periodic (with respect to A) meromorphic function can 
be obtained as a rational function in g, and g',. But it is also the case that any 
equation 


Y?=4-X3+4-X+B (A,BEC) (*) 


See Felix Klein’s classic expository account: On Riemann’s Theory of Algebraic Functions and 
Their Integrals (A Supplement to the Usual Treatises). Dover. 


14(meromorphic) 


given by: oz) = 1/z? plus the infinite sum of the terms {1/(z — A)? — 1/7} where A ranges 
through the nonzero elements of the lattice A. 
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[such that the cubic polynomial 4 - X¥° + A - X + B has no multiple roots] arises as 
the polynomial relating @, and gy’, where g, is the Weierstrass g-function 
attached to some lattice A. The algebraic curve in the (X,Y)-plane defined by 
such an equation (or rather, the completion of this curve in the projective plane) is 
called an elliptic curve. There is a single point at © on this curve; this point will be 
called the origin of the curve. It is an easy exercise in algebra to see that any 
smooth algebraic curve of degree 3 in the projective plane can be given, after any 
choice of an origin and an appropriate rational change of variables, a defining 
equation of the above form (called a Weierstrass equation). This means that if we 
are interested in the algebraic problem of finding all complex solutions of smooth 
homogeneous forms in three variables of degree 3 (or equivalently, of finding 
solutions of a “Weierstrass equation”) we are led to the analytic construction of a 
specific lattice A in C, of its associated g-function g,(z), and then the non-origin 
points on E are given by (X,Y) = (»,(z), 9(z)) where z ranges through the 
points of C which are not in A. Let us call the mapping 


C-A _*, elliptic curve — origin 
z> (X,Y) = (9(2),0"(2)) 


a (Euclidean) uniformization of our elliptic curve. One can, if one wishes, complete 
the picture to get a mapping 


C —— elliptic curve 


which identifies the complex points of our elliptic curve (including its origin) with 
the orbit space with respect to A, i.e., the Riemann surface'® C/A, the mapping 
being the covering mapping. 

The word “uniformization” is meant to carry the full load of its nuances here. 

We have uniformly parametrized the complex points of any elliptic curve by the 
points of C (more exactly, if you wish, by the orbits of these points under 
translation via the elements of a lattice A determined by the elliptic curve). 

The uniformization Y also uniformizes the conformal geometry of any particu- 
lar elliptic curve, in the sense that W identifies the conformal geometry of the 
Riemann surface, locally, with Euclidean conformal geometry (its inverse, locally 
defined, gives local uniformizers in the sense described above)”. 

So far, our equation (*) has complex coefficients A,B. Number theory will 
properly enter our picture when we consider elliptic curves Y* =4-X? + 
A+X +B where the coefficients A, B are algebraic numbers, or more specifically, 
rational numbers. To focus on the latter case which will be our eventual particular 
interest, let us refer to an elliptic curve whose coefficients A, B lie in Q as simply 


Tt has mystified generations of students that (algebraic) curves can be viewed as (Riemann) 
surfaces. The clash in terminology indicates that, as its name implies, Algebraic Geometry is at the 
meeting-ground between algebra and geometry: one thinks, for example, of C as the affine line G.e., a 
curve) if one is thinking algebraically and as the complex plane (i.e., a surface) if one is thinking 
geometrically. 


The uniformization Y has the further “uniformity” property, usually incorporated in the technical 
definition of a covering mapping, that any small enough disc on the elliptic curve has the property that 
its pre-image under @Y consists in a disjoint union of components, each of which maps isomorphically 
onto the given disc. 
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an arithmetic elliptic curve. But before we deal with arithmetic elliptic curves we 
have some hyperbolic geometry to do. 


(III). Periodicity on the (non-Euclidean) hyperbolic plane—the setting for the 
classical theory of modular functions. 


Let us turn now to hyperbolic geometry, the (independent) discovery of Bolyai, 
Gauss, and Lobachevsky. 

Hyperbolic geometry is a homogeneous geometry satisfying all the Euclidean 
axioms except for the fifth postulate, and possessing many lines through a given 
point, parallel to a given line; it now has a number of equivalent concretizations. 
The model particularly useful to us is the upper half-plane model. 

Here the points of the geometry are the points z = x + ty in the upper half of 
the complex plane H, i.e., x can be any real number and y any positive real. The 
lines are either vertical straight lines {a + iy} for a fixed real number a and all 
positive reals y, or else they are semi-circles abutting on the real axis. The upper 
half-plane model inherits a Riemann surface structure, and hence also a conformal 
geometry by virtue of its being an open subset in C. 


Upper half-plane 
model of hyperbolic geometry 


y / 


Two “lines” through a point 
P and “parallel” to another 
“line.” 


The translations T,:z+>z-+b for any real number Db are symmetries of 
hyperbolic geometry, but there are many more symmetries (in fact two other 
continuous parameters of them'®), perhaps the most important single one being 
inversion with respect to the unit circle, S:z +> —1/z. 


18 : ; ; ; 
Consider matrices of real numbers of determinant equal to 1, 1.e., 


en 
c,d 


with ad — bc = 1. Then z > az + b/cz +d is an orientation-preserving transformation of the upper 
half plane which is a symmetry of its hyperbolic geometry, and any orientation-preserving symmetry 1s 
given by such a matrix. 
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Fic. 11 


FIGURE 11 is meant to illustrate the tiling of the hyperbolic plane that is gotten 
by systematically applying composites of iterates of the unit translation, T,:z > 
z +1, and of the inversion S (and of their inverses), to the ‘“‘basic tile,’ which is 
the shaded region in the figure. Let [ be the group of symmetries of the 
hyperbolic plane gotten from such compositions of 7, and S. It is a fact that [ 
consists in all transformations of the form z+ az + b/cz + d where the coeffi- 
cients a,b,c,d are all integers and ad — bc = 1. There are a number of striking 
differences between I’ acting on the hyperbolic plane and a lattice A, generated by 
translations T,, and T).,, acting on the complex plane. First, the two translations of 
the complex plane Ty, and T,, commute with one another, which is not the case 
for translation and inversion of the hyperbolic plane, i.e., [ is a more interesting, 
noncommutative, group. And second, there is a natural way of identifying A with 
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an orbit in the complex plane (the orbit containing 0). There is no such identifica- 
tion for ['’. In Figure 12, the ‘“‘dots” (one in each tile) mark a single orbit with 
respect to I. 

If & is an elliptic curve given with its conformal geometry, it has, of course, its 
Euclidean uniformization coming from Weierstrass’s theory as described in II 
above. By a hyperbolic uniformization of E is meant a mapping 


H — { finite union of V-orbits} —,E- { finite set of points}, 


which is a covering mapping of the domain onto the range”’, which preserves 
orientation, is a local isomorphism of conformal geometries”', and which is 
periodic with respect to a subgroup of finite index in IT. This last condition just 
means that there is a subgroup I’ CT of finite index such that if z > y(z) is a 
transformation in I’, then 


V(y(z)) = Y(z). 
In other terms, % factors through the orbit space of the upper-half plane under 
the action of [’. 

For any such I’, there are only a finite number of distinct elliptic curves 
admitting a hyperbolic uniformization periodic with respect to I’. Moreover, it is a 
consequence of a theorem of Bely that an elliptic curve admits a hyperbolic 
uniformization if and only if it has a Weierstrass equation with coefficients A, B 
which are algebraic numbers, i.e., 4, B € Q. 


3. The conjecture of Shimura-Taniyama-Weil. A hyperbolic uniformization of E 
was defined to be a covering mapping periodic with respect to any subgroup of 
finite index in I’. There is, however, a specific class of subgroups of finite index in 
I’ which plays a dominant role in arithmetic. To explain why this class should play 
the role it does (and there are interesting geometric reasons for this) would lead us 
far afield. But the definition of the class is simple enough: For a positive integer N, 
let TCV) denote the group of matrices 


Oa) 
c,d}? 
where a, b,c, d are integers, ad — bc = 1, and which are congruent to the identity 


matrix (| 4 modulo N. 


The groups [(N) act as groups of symmetries of the hyperbolic plane (the 
above matrix acting by the rule z+az+b/cz+d), these groups all being 
subgroups of finite index in [’. Any subgroup of [T containing one of the [CV )’s is 
called a congruence subgroup. The terminology here is quite straightforward: a 
congruence subgroup of VT is simply one that is definable by congruence conditions 
on the coefficients of the matrices representing its elements. Not all subgroups of 
finite index in [ are congruence subgroups. (It is even true that, at least according 


‘There are two orbits which are slightly more degenerate than the rest. 
20, : 
As in footnote 17. 


*l Equivalently: is an analytic (unramified) covering mapping of Riemann surfaces. 


2 The uniformization Y extends to an analytic mapping of the upper half-plane onto the comple- 
ment of a finite set of points in E. 
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to one natural way of counting them, relatively few subgroups of finite index are 
congruence subgroups.) But this gives rise to the key. 


DeFINITION. Let E be an elliptic curve. A hyperbolic uniformization (of E) of 
arithmetic type is a hyperbolic uniformization of the elliptic curve E which is periodic 
with respect to a congruence subgroup 1" CT. 


Although (by Weierstrass) any elliptic curve admits a Euclidean uniformization 
(and, in fact with respect to a lattice A Cc C unique up to complex scalar change), 
and (by Bely) an elliptic curve admits a hyperbolic uniformization if and only if it 
can be defined by a Weierstrass equation with coefficients A, B which are algebraic 
numbers, the Shimura-Taniyama-Weil conjecture asserts, further, that 


Any arithmetic elliptic curve (i.e., any elliptic curve whose defining equation can 
be taken with coefficients in Q) admits a hyperbolic uniformization of arithmetic 
type.23:24 


Euclidean 
compex 


GEESE) 
LEZ LL 


F en 
LEE 
SE 
a == 
Euclidean 
. . \ 
uniformizer 
constructed by Is there also a 


Weierstrass hyperbolic uniformization 


of arithmetic type? 


Arithmetic elliptic curve 


Fic. 13 


*3The formulation we have just given of the conjecture would make it seem “unfalsifiable.” But in 
fact, there are more precise versions of the conjecture which predict, given an arithmetic elliptic curve 
E, exactly which [T'(N) would be involved in a hyperbolic uniformization of arithmetic type for 
E—these precise versions are known (by the work of Hecke, Eichler, Shimura, Weil, Deligne, Carayol, 
Faltings, and others) to be equivalent to the one given here. A technical point relevant to this 
equivalence is treated briefly in an appendix to this expository article. There are also stronger 
conjectures by Langlands (concerning automorphic representations of reductive groups) and by Serre 
(concerning 2-dimensional representations of Galois groups over @) which imply the conjecture of 
Shimura-Taniyama-Weil. 


*4as Serre remarked, it might be illuminating to formulate a precise conjectural characterization of 
the class of elliptic curves (necessarily definable over @) which admit hyperbolic uniformizations of 
arithmetic type. The conjecture of Shimura-Taniyama-Weil asserts, of course, that any elliptic curve 
definable over @ admits such a uniformization. Among the elliptic curves definable over quadratic 
number fields, for example, a necessary condition for them to have such a uniformization is that they be 
C-isogenous to their conjugate (cf. Goro Shimura, Class fields over real quadratic fields and Hecke 
operators, 95 (1972) 130-190, where the case of real quadratic fields is analyzed and examples are 
given). 
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It is the confluence of two uniformizations, the Euclidean one, and the 
(conjectural) hyperbolic one of arithmetic type, that puts an exceedingly rich 
geometric structure on an arithmetic elliptic curve, and that carries deep implica- 
tions for arithmetic questions. 


4. Why does the Shimura-Taniyama-Weil conjecture imply Fermat’s Last Theo- 
rem? This is not the place to go into the reams of evidence in favor of the 
conjecture, or of the overarching Langlands Program of which this conjecture is a 
minute part. But those readers who have borne patiently with me up to this point 
deserve some hints, however brief, of the connection between the conjecture which 
has been the focus of our talk and Fermat’s Last Theorem. 

Suppose, then, that we are given a counterexample to Fermat’s Last Theorem, 
1.e€., we are given a particular solution to a Fermat equation, a? + b” = c?, where 
a,b,c are nonzero integers and the exponent p is an odd prime number. We 
might safely suppose that p > 150,000 since Fermat’s last theorem has been 
established for odd exponents smaller than that bound. The subsequent argument, 
however, works for all primes p > 5, which is all that we assume at this point. We 
can always arrange the equation, by permuting (a, b,c) with appropriate changes 
of sign, if necessary, to get b to be even and a to be of the form 4k — 1. 
Hellegouarch [H] and, more recently, Frey [Fr 1, 2] consider the arithmetic elliptic 
curve given by the cubic equation 


y*=x(x —a”)(x +5?) (* *) 


which at first sight may not seem remarkable, but...(as is suggested in [Fr 2], 
formulated and set up in [S], and concluded in [R]) 


THeoreM. The arithmetic elliptic curve (* *) does not admit a hyperbolic uni- 
formization of arithmetic type. 


CoROLLARY. The conjecture of Shimura-Taniyama-Weil implies that the arith- 
metic elliptic curve (* *) does not exist, i.e., the conjecture of Shimura-Taniyama-Weil 
implies Fermat’s Last Theorem. 


For a sketch of the proof of the above theorem, the reader might consult the 
Bourbaki report of Oesterlé [O]. The proof makes essential use of the classical and 
also the more modern arithmetic theory of modular forms. One supposes that 
(« «)} does admit a hyperbolic uniformization of arithmetic type. Then, if w is a 
regular differential 1-form on the elliptic curve, pulling w back to the upper 
half-plane via the hyperbolic uniformization one gets a differential 1-form on the 
upper half-plane which after suitable normalization can be written as f(z) dz 
where f(z) is a cuspidal modular form f of weight 2 (the form f is modular for 
one of Hecke’s groups and is an eigenform of the Hecke operators) with integral 
Fourier coefficients: 


f(z) =1-e?"* +a,- 687 +0a,°0°T7 +00: (a, € Z). 


Using an approach suggested by a conjecture of Serre, and using the specific 
form of the discriminant and the conductor of (* «)”, Ribet has shown that the 


*°The discriminant of the cubic equation (* *) is a perfect pth power times a power of 2 (it is 
precisely (a?b?c? /2°)*) and the conductor of (* *) is an even square-free number. 
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Fourier coefficients of f,a@, = 1,a@,a3,°:° are congruent modulo p to the 
Fourier coefficients of a modular form g of weight two on Hecke’s group I,,(2). 
Ribet’s argument is startling in its originality and makes use, among many other 
things, of the very mysterious “Drinfeld switch” which occurs in the description of 
the “bad fibers” of Shimura curves. 

But by our good fortune there is only one modular form of weight two on I,(2), 
the Eisenstein series. That is, we know exactly what is, and therefore we know 
the Fourier coefficients of f, a, modulo p. 

Now, the theorem of Eichler-Shimura links the Fourier coefficients a, modulo 
p to questions of Q-rational p-torsion on the arithmetic elliptic curve (* *). From 
our explicit knowledge of a; mod p, we deduce that there exists a rational point of 
order p on (* *)*°. 

But any arithmetic elliptic curve, which like (* *) has all four points of order 2 
rational over Q, cannot have any rational p-torsion for p > 5 (see [M]). Contradic- 
tion! 

I hope, in these few minutes to have given some sense of the eclectic spirit of 
this conjecture, of how broadly it reaches out towards realms of mathematics that 
one might, at first, believe to be remote from Arithmetic, and yet how it gets to the 
heart of Arithmetic matters. 
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Technical appendix: questions of fields of definition. These questions have not 
been dwelt upon, in the expository account above, because one of the points of my 
exposition was to emphasize the purely “geometric” nature of the Shimura- 
Taniyama-Weil conjecture. Nevertheless, the more standard expression of this 
conjecture (and, in fact, the form of the conjecture that has direct arithmetic 
consequences) is that given any elliptic curve EF over Q, there is a nonconstant 
mapping defined over © from a modular curve X,(N) onto E. Here X,(N) is 
viewed as an algebraic curve over @ (given its “‘canonical model” structure over Q, 
as initiated in [Sh 1]). 

The point of this appendix is to make clear that we have not “‘cheated”’ in the 
statement we gave in the body of our expository account. That is, we will show that 
the “Shimura-Taniyama-Weil conjecture” as given in our expository text is equiva- 
lent to the statement in the preceding paragraph. 


*°Or at least on an arithmetic elliptic curve isogenous to (* *) via an isogeny of degree p rational 
over Q. 


1991] NUMBER THEORY AS GADFLY 609 


Specifically, we shall sketch the proof that if an elliptic curve defined over Q 
admits a nonconstant mapping from X(N) defined over C, for some N, then it 
admits a nonconstant mapping from X,(N) defined over Q as well (but possibly 
for a different value of N). Now this fact is surely a “folk-theorem” either known 
to, or else easily derivable by, any of the experts. We record some brief indications 
of its proof here simply because it doesn’t seem to be anywhere in the literature 
(but see [Ri] for very closely allied results). 

Let K be a field (characteristic 0 is all we shall need) and consider the category 
LL, Whose objects are abelian varieties over K, and whose morphisms are 
()-vector spaces obtained as follows: If A, B are abelian varieties over K, and 
Hom ,(A, B) is the abelian group of K-homomorphisms of abelian varieties, then 
the Q-vector space of morphisms from A to B in &%, is Hom,(A, B) ® Q. Thus, 
two abelian varieties are isomorphic in 7%, if and only if they are K-isogenous. 
Let A be an abelian variety over K, and let M be a finite-dimensional Q-vector 
space with continuous Gal(K/K)-action (i.e., the action factors through a finite 
quotient), where K is an algebraic closure of K. Then A ® M is the unique object 
of &% representing the contravariant functor B > {Hom;(B, A) ® M}JSM™4/®) 
where the superscript Gal(K /K) refers to the elements invariant under the 
(diagonal) action of Gal(K/K) on Hom;(B, A) ® M. 

Let N be a positive integer, and X,(N)_q the canonical model over Q of the 
modular curve attached to the classical group ['(N), as in [Sh 3]. Let J,CN) ,q be 
the jacobian of X(N) /Q which we view as object of the category 27. Let E be 
an elliptic curve over @, and let L/Q be a field extension. 


DEFINITION. The elliptic curve E is called modular over L if there is an integer N 
and a nonconstant mapping X(N ) yi E 1, (rational over L). 


Proposition. The elliptic curve E is modular over L if and only if E is modular 
over ( and more specifically, if and only if there is a nonconstant mapping 
X(N) q > E,@ (rational over Q). 


Sketch of proof. We may assume that E does not have complex multiplication 
over C, for if it did, then it would be modular over Q [Sh 4]. 

If E is modular over a given field, then it is also modular over any extension 
field, and if E is modular over C then it is modular over Q, so we need to prove 
that if E is modular over Q, then it is also modular over Q. Recall that there is a 
nonconstant mapping X(N’) — X(N) defined over Q, so (after a possible change 
of level N, and reduction from field of definition @ to a suitable finite Galois 
extension L./Q) we may assume that E /q is an elliptic curve without complex 
multiplications, admitting an L-rational nontrivial mapping X(N) > E. Let G = 
Gal(L/Q). 

By consideration of the Weil restriction from L to Q of this homomorphism of 
abelian varieties, and using the fact that J,(N),q is isogenous (over @) to a 
product of abelian varieties A, for newforms f (Prop. 2.3 of [Ril; for more about 
the varieties A See [Sh 2, 3]) we see that there is an irreducible G-module M and 
a newform f with Fourier coefficients in C such that A, is isomorphic in the 
category , to E ®M (ie., these abelian varieties are isogenous over Q). Let 
F CC be the coefficient field of the newform f (compare [Ri]). Then [F : Q] = 
dim A, = dimg(M). We also have that F = End,,(A,), by (Cor. 4.2 of [Ri)). 
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HENRY S. WARREN, JR. 
IBM Thomas J. Watson Research Center, P.O. Box 704, Yorktown Heights, NY 10598 


Here is the answer; what is the question? 


f(y) = cost Vy + Vy(1—y). (1) 


In this note we give three quite natural questions that have this same unlikely 
answer. 


Question 1. Two bodies, of masses m and M, are attracted by gravity. Find the 
distance y between them at time ¢. Assume that the bodies are initially at rest and 
separated by one unit of distance, that M >> m, and that GM = 1, where G is the 
universal gravitational constant. 


Solution. Since M >> m, we may assume that M does not accelerate. By 
Newton’s second law (F = ma) and his law of gravity, 


GmM 
F=ma=my" = - 5 
y 
Since GM = 1, this becomes 
1 
y" — ——. 
y2 
Hence y’y" = —y’/y’, or 1/2)((y')”)' = (yy, i.e., we have the “energy integral” 


(yy? = = 
—(y’)” — — = const = —1. 
2 y 


Integrating once more yields, after some calculation, 


v2t=f(y), 


where f is given by (1). 


Question 2. A wheel of unit diameter rolls to the right on the x-axis, without 
slipping. Find the locus of a point P on the rim that is initially at (0, 1). 


P(x, y) 
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Solution. The figure above shows the wheel after rotating an angle ¢. It has 
moved along the x-axis an amount rd, so the center has moved this amount. From 
the figure, 


x=rod+rsingd, (2a) 
y=r+rcos¢. (2b) 
After some calculation, and taking r = 1/2, we have 
x= f(y), 


where f is given by (1). 

Question 3. Find the perimeter P of the segment ABC of FiGuRE 2a below, in 
terms of the distance y of the chord AC from point D on the opposite side of the 
circle. The circle has unit diameter. 


Solution. Let O be the center of the circle, and draw auxiliary lines as shown in 
FiGuRE 2b. Let ¢ denote the equal angles AOB and COB, and let r be the radius 
of the circle. From FiGure 2b, 

P=2rd + 2rsind, 


y=r+rcos®¢. 


B B 
A C A Q/ C 
D D 
Fic. 2a Fic. 2b 


These equations are the same as equations (2) of question 2, with P/2 in place of 
x. Hence we have immediately 

P/2= f(y), 
where f is given by (1). A similar problem is to find the area a of the region 
bounded by arc ABC and by lines AD and CD in the figure below. 


Fic. 3 
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The result is 


2a =f(y), 
where again f is given by (1). 


An Occurrence of the Ballot Numbers in Operator Theory 


LAWRENCE H. RIDDLE 
Department of Mathematics, Agnes Scott College, Decatur, GA 30030 


Let S denote the (unilateral) shift operator on the Hilbert space /, that maps 
the sequence (xX, x5,%3,...) to the sequence (0, x1, x5, %3,...). The adjoint 
operator S* is a left shift, that is, S*(x,, x5, %3,...) = (%5, %3,...). Halmos has 
called the shift operator “the most important single operator, which plays a vital 
role in all parts of Hilbert space theory” [6]. It has many curious algebraic and 
analytic properties. It is an isometry of /, whose range is not the entire space. It 
has no eigenvalues while the eigenvalues of S* comprise the open unit disk. In 
fact, the adjoint operator is universal in the sense that every operator whose 
spectrum is included in the interior of the unit disk is similar to a restriction of the 
adjoint operator to an invariant subspace. In this note we show how iterating 
commutators of the shift operator gives rise to the class of sequences known as the 
ballot numbers. 

The ballot numbers, [ B, ,];°;.), which play a role in a wide range of combinato- 
rial arrangement problems [3], are defined by 


Byy = 1 
B; itm = 0 mM 2 i (1) 


By = B1j-1 + Bi ja. 12 1, 


where by convention B, _, = 0. One can interpret the integer B; ; as the number 
of ways to sequentially count votes given that you get i votes and your opponent 
gets i —j votes and you are never behind in the tally. TABLE 1 gives the first 
several rows of the ballot numbers considered as an infinite lower triangular 
matrix. 


TABLE 1: THE BALLOT NUMBERS 
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The result is 


2a =f(y), 
where again f is given by (1). 
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Let S denote the (unilateral) shift operator on the Hilbert space /, that maps 
the sequence (%,, %5,%3,...) to the sequence (0, x1, x5, %3,...). The adjoint 
operator S* is a left shift, that is, S*(x,, x5, %3,...) = (%5, X%3,...). Halmos has 
called the shift operator “the most important single operator, which plays a vital 
role in all parts of Hilbert space theory” [6]. It has many curious algebraic and 
analytic properties. It is an isometry of /, whose range is not the entire space. It 
has no eigenvalues while the eigenvalues of S* comprise the open unit disk. In 
fact, the adjoint operator is universal in the sense that every operator whose 
spectrum is included in the interior of the unit disk is similar to a restriction of the 
adjoint operator to an invariant subspace. In this note we show how iterating 
commutators of the shift operator gives rise to the class of sequences known as the 
ballot numbers. 

The ballot numbers, [ B; ,];°;-9, which play a role in a wide range of combinato- 
rial arrangement problems [3], are defined by 


Boy = 1 
B; itm = 0 m>1 (1) 
B; , = Bi-1,j-1 + Bj41 1 > 1, 


where by convention B, _, = 0. One can interpret the integer B; ,; as the number 
of ways to sequentially count votes given that you get i votes and your opponent 
gets i —j votes and you are never behind in the tally. TABLE 1 gives the first 
several rows of the ballot numbers considered as an infinite lower triangular 
matrix. 


TABLE 1: THE BALLOT NUMBERS 
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TABLE 2 


TABLE 3 


diagonal element of C, and let [C;,5], denote the jth off diagonal element of 
[C;, S]. Then we have 


PROPOSITION. For all i, j > 0, 
(a) C5 = (-1)’Biaj,, 
(b) [C;, S$], — (~1)' Bis j41,oj41- 


Proof. The proof for part (a) is by induction on i and makes use of the 
recurrence relations in equations (1) and (2). The case i = 0,1 is true by direct 
verification. Now assume that (a) holds for an integer i > 1 and for all j. For j = 0 
we have 


Ci41,0 = Cio 7 Gia = Bio + Bis i.2 = Bisi1 = Bisi0- 
Now for j > 1, 
C41, = ~C; j-4 + 2C; Ci j+1 


4 4] 
-(-1)' B;45-1,2j;-2 + 2( ~1)'B; 5,9; i (-1)’* Bi 4541,2j4+2 
(—1)’[(Bisj—1,2)-2 + Bisj,2)) + (Bisj2) + Bi 4j+1,2)+2)| 


= (—1)"[ Bisj,2j-1 + By 4) +1,2)+1] = (—1)'B, 4 )41,2)5 


which completes the proof for part (a). 
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The proof for part (b) is almost identical once we realize that 
[C;, S$]; = —[C,_4,S]j-1 + 2[C;_1, 5]; _ [Cy-1, SJ p41. 


Now use the same induction argument. @ 


In [11] it is shown that the ballot numbers are given by 


_ <~—(2n — k) 
n+ 1 nA 


Bk 


for 0 <k <n. From the proposition we thus obtain (after some simplification) 
that 


(zt 


a = (7 +2) 
Qit1\i-J 


_ _ J 
Ci ij ( 1) [C, §]; ( 1) i+] i-j 
In conclusion, we briefly mention two interpretations for the numbers in TABLE 
2 and TABLE 3. The first concerns the enumeration of all possible pairings of 2n 
points on a circle by the number of crossings of the chords. Riordan [9] tabulates a 
table of coefficients for the enumerating generating functions for this problem, 
which, ignoring minus signs, is exactly TABLE 2. We refer the reader to [9] for more 
details. For the second interpretation, define a path to be a finite sequence 
of pairs v, = (a,,b,) of nonnegative integers such that (i) v, = (0,0); and 
(ii) if v, = (a,,b,), then v,,,=(+a,,b,) or 0,4, = (a,,1+ b,). A path 
(U9, U1,---,U,) is of length n. The distance between two paths of length n is the 
absolute difference between the first coordinates of the last pair in each path. 
Finally, two paths of length n intersect if at least one set of corresponding pairs 
(other than (0, 0)) from each path are equal. Let P, , denote the number of pairs of 
nonintersecting paths of length i and distance /. In [10] Shapiro exhibits a table of 
values for P; ,, which he calls “a Catalan triangle,” and studies their properties. 
Again ignoring minus signs, we have that TasLe 3 is exactly Shapiro’s Catalan 
triangle. 


Acknowledgement. The author would like to thank John D’Angelo for stimulating his initial interest 
in this problem, and the referees for suggestions that improved the content of the note. 
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Derangements, Permanents, and Christmas Presents 


STEPHEN G. PENRICE! 
Department of Mathematics, Arizona State University, Tempe, AZ 85287 


The derangement problem, also known as the hat-check problem, the matching 
problem, and “le probleme des recontres,” was first solved by Montmort in 1713. 
There seem to be as many ways of describing the problem as there are names for 
it, and each description suggests some generalization, as several recent articles 
demonstrate (see [2], [3], and [7]). This note views the problem from a perspective 
that reaches back to my childhood. Because of the size of my family, we have 
traditionally held a drawing each year so that each child would be randomly 
assigned one brother or sister for whom he would buy a Christmas gift. Obviously, 
in order for such a drawing to be considered successful, no child can draw his own 
name. In mathematical language, we say that a derangement must occur. In any 
standard treatment of this problem, such as [6] or [11], it is shown that as the 
number of children gets large, the probability that a successful drawing occurs 
tends to e'. The solution is a straightforward application of the inclusion-exclu- 
sion principle. 

As with nearly everything else in life, however, the situation becomes more 
complicated as the children grow older. In our family, spouses of the children are 
included in the drawing, but this narrows the definition of a “‘successful drawing.” 
In addition to having no one draw his own name, we want no one to draw his 
spouse’s name, since presumably they exchange gifts anyway. Computing the 
probability of a successful drawing in this situation is difficult. In the case where all 
the children are married, we obtain bounds on this probability using two inequali- 
ties involving permanents of matrices, and these bounds are used to show that as 
the number of couples participating gets large the probability of a successful 
drawing goes to e~”. We show this by proving a more general result. 

Let k,n be integers, and S be a set of size kn. Let T,,7,,...,7,, be a partition 
of S into n sets, each of size k. Let P,(n) be the probability that a random 
permutation o will have the property that for all x in S and for all i, 1 <i <n, if 
x is in T;, o(x) is not in T;. The standard derangement problem is the case k = 1, 
and the ‘‘married” version of the ‘“‘Christmas Gift Problem” is the case k = 2, with 
the couples involved constituting the partition. Our claim is that for k fixed, 
lim P,(n) = e~* as n goes to infinity. This result is implicit in example 7 on p. 177 
of [11], but it does not seem well known. The permanent inequalities mentioned 
above, known as Minc’s conjecture and Van der Waerden’s conjecture, allow us to 
give a simpler proof. 


‘Research supported in part by NSF grant DMS 87-01475 and ONR contract N00014-85K-0494 at 
the University of South Carolina. 
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First we define the permanent and show how it is related to our problem. 


DeFiniTion 1. Let A = (a,;) be an m X m matrix. Define the permanent of A 
by 


per( A) = ye ty 4am) us Ginn(my? 
TT 


where > runs over all permutations 7 of {1,..., mb}. 

While the permanent seems similar to the determinant, it is much harder to 
evaluate since it is not invariant under row and column operations. However, it is 
easy to verify that the permanent is multilinear, and this fact is all we need for our 
results. 

Our plan now is to specify a (0, 1)-matrix A, ,, such that per(A,, ,,) is exactly the 
number of permutations of the set S described above which map each set in the 
partition outside itself. Let A, , = (a,;) where 1 <i, j < kn and where a,, = 0 if 
and only if g=q', where i=qk +r and j=qk+r',q,q,r,r'€Z, 0<r, 
r'<k,0 <q. A, , is the kn X kn matrix with k X k blocks of 0’s along the main 
diagonal and 1’s everywhere else. For example, A, ; is 


0 0 1 1 1 1 
1 1 1 1 
1 1 O 0 1 =4J 
1 1 0 0 1 41 
1 1 1 1 0 =O 
1 1 1 1 0 =O 
To see that per(A, ,,) counts the permutations we seek, let S = {1,2,..., nk} 


and let 7, = {1,2,...,k}, T, =tk+1,k 4+ 2,...,2k},...,7, ={n — Dk +1, 
(n — 1)k + 2,...,nk}. It is then clear that the non-zero terms of per(A,, ,,) 
correspond precisely to permutations which, for every i, map every x in T; to an 
integer not in T.. 

As we noted earlier, permanents are difficult to compute in general. In fact 
there is a formula for per(A,, ,,): 


nk . 
per( A, ,) = d (—1)'R,(nk — i)}, 


where 
" {k 
R= STF |: 


with the sum running over all i; +i, + --: +i, =i,0 <i, <k and (k), denoting 
the falling product k(k — 1)-::-(kK —i+ 1). We leave the derivation to the 
reader. (Hint: R, counts the number of ways of selecting i zeroes from A, ,, with 
no two in the same row or column. These R,’s are coefficients of a rook 
polynomial. For details on rook polynomials, consult chapter 7 of [11].) This 
formula does not seem to lend itself to straightforward analysis. However, Minc’s 
Conjecture and Van der Waerden’s Conjecture give bounds on per(A,, ,,) which 
are good enough to establish our result without analyzing this expression. We need 
two definitions to state these results. 
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DEFINITION 2. Let A = (a;,;) be an m X m matrix. The ith row sum of A is 
defined 


DEFINITION 3. Let A = (a,,) be an m X m matrix. Then A is doubly stochastic 


if 

(i) O<a,,<lforl <i,j<m 
(ii) r,=1forl<i<m 
(ili) c;=1forl <j<m. 


Van der Waerden made his conjecture in [13], and the result was proved in [4] 
and [5]. Expositions of Egoritchev’s proof can be found in [8] and [14]. 


THEOREM 4 (Van der Waerden’s Conjecture). Let A be a doubly stochastic 
m Xm matrix. Then 
m! 
A) >-—, 
per( A) fi 
with equality if and only if a,;, = 1/m for all 1 <i, j <m. 


Minc’s Conjecture was proposed in [10] and first proved in [1]. A simpler proof 
appears in [12]. 


THEOREM 5 (Minc’s Conjecture). Let A be an m X m (0,1) matrix with nonzero 
row SUMS 1,,...51,,. Then 


per(A) < (7!) 00 (7, !)7™. 


The next two lemmas are applications of these theorems. They are similar to 
results in [9] which give bounds on the number of perfect matchings in a regular 
bipartite graph. 


Lemma 6. Let A, , be as defined above. Then 
per( A, ,) = (An) !(1 - L/ny*". 


Proof. Applying the multilinearity of permanents and Van der Waerden’s 
Conjecture gives 


per(A,,,) = (kn —k)™ per Go Aa 
un (An)! 
> (kn -—k in 
( ) (kn) 


= (Xn) - ‘y" 
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It 


Lemma 7. per(A, ,,) < (kn — k)yi/en— ken, 


Proof. Apply Minc’s Conjecture with r; = kn — k for all 1 <i < kn. 
We are now in a position to prove the result. 


THEoreM 8. Let k be fixed. lim P,(n) = e~* as n approaches infinity. 


Proof. By the above remarks P,(n) = per(A, ,,)/(kn)!. By lemmas 6 and 7 


1 kn ((kn _ ky y/er—' kn 
[5] srt «SO 


is easy to see that the lower bound approaches e~“, and a straightforward 


calculation using Stirling’s formula shows the upper bound is asymptotic to the 
lower bound. 
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A distance function for a set X is a nonnegative, symmetric real-valued function 
X x X > R such that d(p,q) = 0 iff p = gq. A metric, or Euclidean distance 


function, is a distance function that satisfies the triangle inequality: d(p,q) < 
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It 
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1 kn ((kn _ ky yer" kn 
fi -—] <P(n) s KO 
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A distance function for a set X is a nonnegative, symmetric real-valued function 
X X X > R such that d(p,qg) = 0 iff p =q. A metric, or Euclidean distance 


function, is a distance function that satisfies the triangle inequality: d(p,q) < 
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d(p, x) + d(x, q) for any x € X. Topologically, distance functions are interesting 
because they provide a convenient way of describing a topology and because their 
geometric nature often facilitates the understanding of this topological structure. It 
then becomes a question of how a distance function should generate a topology. 

Intrinsically, a distance function describes the closeness of points in a set. Thus, 
for both a topology and a distance function, there is a concept of “neighborhood” 
of a point p. Consequently, it would seem that the topology determined by the 
distance function d is one for which the neighborhoods defined by d are the same 
as the neighborhoods defined by the topology. To this end, we let B( p, a) denote 
{x € X|d(p, x) < a} and call this the open ball about p of radius @ so that we 
have 


DEFINITION 1. A distance function d “determines” the topology Y provided 
that, for any point p, the set of open balls about p is a neighborhood base for p in 
the topological setting being described by 7; i.e, each open ball about p is a 
neighborhood of p and, when p is in any open set G in 7, then some open ball 
about p is also contained in G. 

Historically, when a distance function determines a topology in this manner, it 
has been called a semimetric; see [4]. In this context we shall say that d determines 
the semimetric topology 7. 

There is good reason to believe that this is the appropriate relationship between 
distance functions and topologies. In particular, note that, if d determines the 
semimetric topology .7, then a sequence x, converges to a point p in the 
topological sense described by 7 if and only if the distance d(x, p) converges to 
0; i.e., the intrinsic convergence given by the distance function is the topological 
convergence. Conversely, if 7 is a first countable topology of Y such that a 
Sequence converges with respect to 7 iff it converges relative to a distance 
function d, then it follows that d is an appropriate distance function for determin- 
ing the semimetric topology .7. 

There is an alternative topological perspective for the way in which a distance 
function d determines a semimetric topology Y. For any distance function d, the 
d-closure of a subset A of X is {x € X|d(x, A] = 0}, where d(x, A] = 
inf{(d(x, a)|la € A}, the distance of x from A. One can easily show that d 
determines the semimetric topology 7 iff the d-closure of any subset A of X is 
identical to the topological closure of A with respect to . Finally, let us note 
that, if d is a distance function for X, then d determines some semimetric 
topology iff its d-closure is a topological closure operator (as defined 
in [3]). 

Now that a relationship between a distance function and its topology has been 
established, we expect to develop certain properties (based implicitly on our 
Euclidean geometric experience), but there are some surprises in store. 


EXAMPLE 1. There is a distance function d which determines a semimetric 
topology Y and yet there are open balls generated by d that are not open. 

Let X = {a,|n = 1,2,...} U {0} be the subset of R with its usual topology 7, 
where a, = 1/3”. Then the distance function d for X such that 


d(x,y) =1, for x #y and xy #0; 
d(x,y) =|x — yl, otherwise, 


is a semimetric that determines .7. (It is a first countable space with the right 
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DEFINITION 2. A distance function d determines the symmetric topology 
provided that a subset A of X isin 7 iff, when a point p is in A, then there is an 
open ball B,(p,¢) about p contained in A. 


It follows immediately that any distance function d will determine a unique 
symmetric topology .Y in which every open set is the union of open balls, even 
though the open balls about p need not be neighborhoods of p in the context of 
J.In the case of Arhangel’skii’s example, note that d determines a Hausdorff 
symmetric topology Y such that a sequence x, converges to p in 7 iff d(x,, p) 
converges to 0. Moreover, when the closure determined by d is a topological 
closure, then the symmetric topology is one for which the topological closure of a 
set coincides with its d-closure. Unfortunately, it seems that the desired conver- 
gence property has also been lost. 


ExAmpLe 4. There is a distance function d which determines a symmetric 
topology Y in which the topological convergence is not the same as the distance 
function convergence. 

Let X = A U B, as defined in Example 2. There is a distance function d for X 
such that d(b,,a,,) = 1/3”, d(b,,0) = 1, and otherwise, d(x, y) = |x — y|. Note 
that the symmetric topology determined by d is one in which the sequence b, 
converges to every point in A, including 0. However, d(b,,0) does not converge 
to 0. 

The failure to achieve identical convergence structures (for the topology and the 
distance function) in this example lies not so much in the connection between the 
distance function and the topology, but rather, in the pathological properties of 
the topological convergence itself. The sequence b, is converging to almost 
anything! | 


Remark. If d determines a symmetric topology .Y in which convergent se- 
quences have unique limits, then a sequence x, converges to p iff d(x,, p) 
converges to 0. 


Proof. Suppose that d is such a distance function. It is immediate that, if 
d(x,,p) converges to 0, then x, converges to p in the symmetric topology 7 
determined by d. Conversely, suppose that a, converges to p, but d(a,, p) does 
not converge to 0. We may assume that there is a positive number e such that 
d(a,, p) = € for every n. We claim that for each point q that is not in A = {a,]| 
n = 1,2,...} there is an open ball about g that is disjoint from A. (This is obvious 
by our assumption, if g = p; otherwise, there is a subsequence of a, that con- 
verges to g which is impossible if convergent sequences have unique limits.) Thus, 
A is closed and p is not in A, even though a, converges to p. This contradiction 
establishes the converse. 

Thus, it seems that we have finally arrived at the intrinsic connection (even for 
metrics) between a distance function and its topology. 
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Completely metrizable topological spaces and locally compact Hausdorff spaces 
are two important classes of spaces that are different in many ways. However, each 
of these classes is contained in the class of Baire spaces, and this fact is one of the 
major motivations for the study of Baire spaces. 

The term Baire space was introduced by Bourbaki to describe topological 
spaces in which every nonempty open subset is non-meager (that is, not the union 
of a countable family of nowhere dense subsets). The result that every complete 
metric space is a Baire space is known as the Baire Category Theorem. Early 
versions of it were proved independently by Osgood [4] in 1897 and Baire [1] in 
1899. This theorem is one of the main ways that the notion of completeness is 
applied in analysis. Several of the important theorems of analysis, for example the 
Open Mapping Theorem, the Closed Graph Theorem and the Uniform Bounded- 
ness Theorem, require only that the space under consideration be a Baire space. 

The purpose of this note is to present and discuss a characterization of Baire 
Spaces which seems to be acknowledged in the literature only for the special case 
of complete metric spaces. 

The fact that the set Q of rationals is not a G,-set (.e., the countable 
intersection of open sets) in the real line R is well known and follows from the 
Baire Category Theorem for R. Most students find that this application of the 
Baire Category Theorem is not trivial. Moreover, since © is obviously meager in R, 
it leads to the following natural question: What is the status of meager G;,-sets in 
Baire topological spaces? The answer to this question is hardly surprising, but 
again is not obvious, namely, a topological space is a Baire space if and only if its 
meager G;-sets are nowhere dense. This observation is established in Kuratowski’s 
book ({3], Theorem 2, p. 417) in the special case of complete metric spaces, but is 
missing in many standard textbooks on General Topology as well as in Haworth 
and McCoy’s excellent monograph on Baire spaces [2], where some of its conse- 
quences are established. We believe the result deserves to be included among the 
standard characterizations of Baire spaces. The proof that we offer is very simple, 
and different from the proof of the special case in [3]. In order to present it let us 
recall a few concepts. 

Let X be a topological space and let A CX. We denote the closure (or 
interior) of A in X by A (orint A). Recall that a set A CX is nowhere dense if 
int A = @. A subset of X is meager if it is a countable union of nowhere dense 
sets. As already mentioned we shall call a topological space XY a Baire space if no 
nonempty open set in X is meager, 1.e., meager subsets of X have empty interiors. 
Recall that a subset A of X is an F.-set if it can be represented as a countable 
union of closed subsets of X, that is A is a G;-set if and only if X —A is an 
F -set. 
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THEOREM. A space X is a Baire space if and only if meager G;-sets in X are 
nowhere dense. 


Proof. For sufficiency, note that any meager nonempty open set would be a 
meager G,-set with nonempty interior, contradicting our condition. To show the 
necessity, let A be a meager G;-set in X. Then X — A is an F.-se* and hence 
A-A=AN(X — A) is also an F,-set being the intersection of a closed set and 
an F.-set. Let_A —A = UF, bea representation of A —A as a countable union 
of closed sets. Since A — A has empty interior (this is true for any set A in X!), 
each F, has empty interior and as a closed set with empty interior, F, is nowhere 
dense. Therefore, A — A is meager and since the union of two meager sets is 
meager, 4 = AU(A — A) is meager. Since X is a Baire space, int A = @ and 
hence A is nowhere dense. O 


A topological space X is defined to be a 7, space if whenever x and y are 
distinct points in X there is an open set containing x but not y. Aspace X is T, if 
and only if each singleton subset of X is closed. Recall that a point p in a space X 
is called isolated if the set {p} is open in X. 


Coro.iary | ((2], Proposition 1.29). Let X be a Baire T, space with no isolated 
points, and let G be a countable G;-set in X. Then G is nowhere dense in X. 


Proof. Simply notice that countable sets in a T, space without isolated points 
are meager and apply the theorem. O 


CoROLLARY 2. © is not a G;-set in R. Nonempty open sets in IK are uncountable. 


CoroLiary 3 ((2], Proposition 1.30). If X is a countable Baire T, space then the 
set of isolated points of X is dense in X. 


Proof. Observe that the set of non-isolated points of X is a meager G,-set and 
apply the theorem. O 


The interested reader may check that Propositions 1.31 and 1.32 in [2] can also 
be easily established by use of the theorem above. 
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Visual display of a (finite) group as a Cayley table is usually introduced fairly 
early in an elementary course on abstract algebra. It is instructive in that several 
basic properties can be read off from the pattern of such tables. In particular, a 
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Cayley table is a latin square, i.e., the rows and columns are permutations of one 
another. However, the associative law is not easy to discern by the naked eye. A 
natural question is: which latin squares are Cayley tables? 

We shall use 1,2,..., N to denote the entries in a latin square and a(i, J) will 
denote the entry at the ith row and jth column. Below are two latin squares. Can 
you tell which one is a Cayley table (or not)? 


12 3 4 5 12 3 4 5 
2 5 4 1 3 2 4 1 5 3 
3 1 2 5 4 3 1 5 2 4 
43 5 2 1 4 5 2 3 1 
5 4 1 3 2 5 3 4 1 2 


Exhaustive checking involves N° pairs of triplets. M. Frolov [4] observed that a 
Cayley table possesses a necessary property, which turns out to be sufficient to 
guarantee that a latin square is a Cayley table. This is known as the “Quadrangle 
Criterion” [3, p. 18], viz. if a, j) = ai, j,), ak, —) = alky, jy), ak, 1D) = alk, l,), 
then ati, /) = a(i,,1,). H. Brandt [2] showed that it suffices to assume a(i, J) is the 
identity element. J. Aczél referred to the “Quadrangle Criterion” as Reidemeister 
Criterion and deduced from it one version of Brandt’s criterion [1]. H. Zassenhaus 
[6, p. 4] and A. Hammel [5] phrased Brandt’s criterion as a “Rectangular Rule.” 

In this note I wish to explain another means of checking which seems more 
convenient and which is based on the idea of the celebrated theorem of Cayley 
(1878) which says that every (finite) group is isomorphic to a group of permuta- 
tions. Another reason for its pedagogical interest lies in this connection with 
Cayley’s Theorem, which, basic though it is, is seldom used again in an elementary 
course after it has been stated and proved. Students see it, probably as an 
illustration in working with notions such as permutation, cancellation, isomor- 
phism, but will not see it again (unless they study much later regular representa- 
tion of a group in an advanced course). After all, it is no easy job to ferret out a 
subgroup of order N in the full symmetry group S,, with many more elements! 
Thus, it is comforting to be able to apply this idea in a situation which students can 
appreciate in an elementary course. (Towards the very end of [5], A. Hammel 
employs this same technique, but the explanation given is too brief and omits a 
main step of argument, especially in view of the pedagogical interest mentioned 
above.) 


THEOREM. A Latin square is a Cayley table if the composite of two rows is some 
row in the table. 


As an example, consider the two latin squares at the beginning. The one on the 
left is not a Cayley table, for the 2nd and 3rd rows yield a composite row (4253 1) 
which is not a row of the square. The one on the right is seen to be a Cayley table 
after checking all composite rows. For instance, the 2nd and 4th rows yield a 
composite row (53412), which is the 5th row of the square. 

To prove the theorem, it remains to check if associative law holds in the set 
G = {1,..., N} with binary operation * defined by the given latin square. We shall 
assume 1 is the identity element. Define ¢ from G into A(G), which is the set of 
all bijections on G, by g(ax) = a* x for all x in G. The left cancellation law 
guarantees that g(a) is indeed in A(G). Although G is a set for all we know, A(G) 
is a group under composition. Associative law in G amounts to the relation 
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g(a)o o(b) = g(a*b) for all a,b in G. If that is true, then g(G) will be a 
subgroup of A(G). Conversely, if e(G) is a subgroup of A(G), then the associative 
law holds, for we claim that g(a)° e(b), which is g(c) for some c in G, is indeed 
g(a*b). To see this, we shall show that c must be a*b. Towards this end, 
consider g(a)o o(b)(1) and g(c)(1). The former is equal to e(a)(b *1) = p(a\(b) 
= a * b, while the latter is equal to c *1 = c. Hence, it reduces to check if @(G) is 
a subgroup of A(G). Since A(G) is a finite group we need only check if the 
composite of any two permutations in g(G) is still a permutation in ¢(G). This can 
be achieved by checking the rows of the latin square. 
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An Overview of the Calculus Curriculum Reform Effort: Issues for 
Learning, Teaching, and Curriculum Development 
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1. Introduction to the calculus reform effort. Substantial discussion is underway 
in the mathematics education and mathematics communities concerning calculus 
pedagogy and curriculum [6, 9, 28] and there is considerable momentum for a 
national reform of the college calculus curriculum. Prominent mathematicians 
have elected to become involved in this process, and are aware of the immensity 
and importance of the enterprise. Young [34] comments: ‘Calculus is our most 
important course... The future of our subject depends upon improving it.” A brief 
summary of national data alone provides compelling documentation of the need 
for change. Annually, about 600,000 students enroll in calculus in four-year 
colleges and universities. About half of these students are enrolled in mainstream 
“engineering” calculus; of these 300,000, only 140,000 finish the year with a grade 
of D or higher. The average section size is 34 students; only 7% of calculus courses 
use computers; and exam questions require students to “solve, sketch, find, graph, 
evaluate, determine, and calculate” in a straightforward fashion. Surveys suggest 
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g(a)o o(b) = p(a*b) for all a,b in G. If that is true, then g(G) will be a 
subgroup of A(G). Conversely, if e(G) is a subgroup of A(G), then the associative 
law holds, for we claim that g(a)o e(b), which is g(c) for some c in G, is indeed 
g(a*b). To see this, we shall show that c must be a*b. Towards this end, 
consider g(a)o o(b)(1) and g(c)(1). The former is equal to e(a)(b *1) = p(a\(b) 
= a* b, while the latter is equal to c «1 = c. Hence, it reduces to check if e(G) is 
a subgroup of A(G). Since A(G) is a finite group we need only check if the 
composite of any two permutations in (G) is still a permutation in @(G). This can 
be achieved by checking the rows of the latin square. 
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that students rarely are asked to deal with “higher level” problems, and that 
‘“‘nencil and paper algebraic manipulations are the order of the day” [32]. It is no 
small wonder that reform follows. 

Davis observes [8, p. 36]: ‘““When one sees large numbers of inexperienced 
teaching assistants presenting a ‘notational’ calculus to vast numbers of freshmen 
who may not be prepared to think about mathematics, it can seem that only the 
existence of non-conceptual ‘manipulative’ notational alternatives allows the whole 
enterprise to continue to operate... These students show every evidence of having 
suffered from their pursuit of a meaningless, ritualistic manipulation of symbols.” 

The National Science Foundation has funded a number of projects that are 
intended to “explore opportunities for revitalizing instruction in calculus” [21, 
p. 2], and to support development of calculus curricula for the twenty-first century. 
Description of several of these efforts, and others not funded by NSF, is provided 
in the CRAFTY document, Priming the Calculus Pump [6]. Project emphases 
include: mathematical considerations, technological possibilities, and in some 
cases, beliefs or findings about student learning. Mathematical themes include 
optimization, estimation and approximation, differential equations, functions of 
several variables, dynamical systems theory, and numerical methods. Changes in 
the order of presentation of calculus topics, changes in emphasis within a given 
topic (generally toward a more conceptual emphasis), and increased attention to 
applications are appearing. 

Several projects are based primarily on the availability of hardware and software 
that allow new possibilities for the teaching of calculus, including MAPLE, 
Mathematica, graphics packages, and symbolic/graphical calculators. Several pro- 
jects display great enthusiasm for the potential of technology to ameliorate the 
calculus problems.: Specifically, technology is proposed as a means of freeing 
students from algebraic manipulation, reducing the drudgery of calculation, sup- 
porting the learning of fundamental ideas, and allowing exploration of concepts. 

We have identified a set of central issues worthy of careful consideration in this 
reform effort. We propose that those involved consider: potential orientations 
toward curriculum and curriculum development; the body of research available on 
student learning of calculus-related topics; the nature of the role of technology; 
and the role of teachers and instructors in the development and dissemination 
process. Each of these areas is discussed more fully below. Finally, we identify 
several research topics that are arising as part of this reform process. 


2. Curriculum development and research on student learning. Much of the 
calculus reform discussion has centered on streamlining the traditional syllabus to 
create a “lean and lively” course [10]. Work available in mathematics education 
could be useful in this process [17]. For example, Romberg [25] characterizes 
curriculum as an “operational plan detailing what content is to be taught to 
students, how students are to acquire and use that content, and what teachers are 
to do in carrying out that curriculum.” Traditional curriculum development is 
driven by content features, including organization of content into broad categories, 
segmenting of the categories, sequencing of topics, and specific lessons by topic. 
Romberg and Tufte [26] have argued for an approach that considers students’ 
conceptions of mathematics, claiming that the prevailing conception is of mathe- 
matics as a static collection of concepts and skills to be mastered one by one. They 
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propose that “information about how individuals personally construct knowledge 
and store it in memory should be the basis of curriculum engineering.” 

Driver and Oldham’s constructivist model [12] for curriculum development is 
very useful in considering the current reform. This model specifies four compo- 
nents that influence the design of curriculum: decisions on content, information 
about students’ prior ideas, perspectives on the learning process, and teachers’ 
practical knowledge. These components influence curriculum design, which con- 
sists of the design of learning strategies and materials. Implementation and 
evaluation follow, producing feedback which informs the original four components 
in an ongoing system of development. 

Existing research on student learning in calculus has potential for strong impact 
on the development of calculus curriculum. A predominant theme in this area of 
research is that the learning of mathematics is a constructive process. This process 
occurs in the classroom context, with the learner attempting to make sense out of 
information by evaluating, connecting, and organizing it relative to prior experi- 
ences and existing knowledge structure. This line of research has challenged 
traditional assumptions about learning, such as the view of the student as a passive 
recipient of knowledge, or the notion that a misconception can be eradicated by 
the “perfect explanation” or by simply reorganizing the content. Research within 
this perspective suggests that knowledge about the variety and strength of student 
understandings must be taken into account when developing curriculum that seeks 
to facilitate learning and improve understanding. 

Studies that have investigated student understanding of mathematical concepts 
use techniques such as the clinical interview, the teaching experiment, and the 
analysis of student errors. The use of qualitative techniques allows the researcher 
to generate a more viable model of the student’s learning process than the more 
traditional quantitative methods [33]. A large portion of the research on student 
understanding has been conducted in the areas of early arithmetic, algebra, 
geometry, and physics [1, 2, 7, 18, 24, 29]. 

In contrast, most studies in calculus have been large-scale, quantitative attempts 
to model how attitudinal and experiential variables affect overall performance 
[9, 13, 14, 15, 20]. Although several of these studies have provided valuable 
information, most lack a consistent perspective of student learning and have had 
minimal influence on understanding or performance. 

A summary of the research in the area of student understanding of fundamental 
calculus concepts will be presented in the remainder of this section. The summary 
is divided into four areas: function, limits and continuity, the derivative, and the 
integral. In addition, several curriculum considerations suggested by this research 
will be discussed. 


2.1 Function. Several researchers have investigated student understanding of 
the concept of function. This is one of the most fundamental concepts underlying 
the calculus. Despite its importance, students come to calculus with a primitive 
understanding of the concept and hold deeply rooted and firm misconceptions [7]. 
Students understand a function to be a formula; a graphical representation with no 
formula has no meaning for most entering calculus students [11]. In addition, their 
concept of function is static, in that they consider the function by thinking about 
one point at a time [19]. This is at odds with the traditional early expectation in 
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calculus that students understand secant lines “moving”? on a curve to approach 
the tangent line, or are able to decide if “f(x) is approaching some L as x 
approaches a.” 

Dreyfus and Eisenberg [11] suggest some additional characteristics of student 
understanding of function based on their interviews with algebra students. First, 
they observed a strong subservience to symbolism. Second, students have a strong 
attraction to linearity in that they will treat general functions as linear functions. 
Third, there is a notion that a relation is a function only when it can be 
represented by a single formula. Finally, students tend to view algebraic data and 
graphical data as being independent. 

In our study involving clinical interviews with first-semester calculus students 
[16], we found that the students’ understanding exhibited similar characteristics. 
Many of the students interviewed were not able to provide any type of general 
definition of function but readily gave examples of functions by writing formulas. 
There is little evidence that the students see functions as objects of study in 
mathematics; rather, when a function is given, in equation form, usually one is 
expected to do something to it, such as substitute in a value. This part of studying 
functions (plugging in values) seems to be firmly established and becomes their 
way of working with other calculus concepts such as limit. 


2.2 Limits and continuity. Confrey [4] argues that students are likely to begin 
their study of calculus in one of three ways: with a discrete understanding of 
continuous ideas (time is a series of instants, motion a series of positions, etc.); 
with an independent transition to the idea of continuity; or with an algorithmic 
approach. Tall and Schwarzenberger [30] have investigated conflicts in the learning 
of limits and explored student responses to questions similar to those used by 
Confrey. They state that some of the difficulty arises from an information transla- 
tion of very sophisticated ideas and use of the example of the definition of a 
sequence (s,) of real numbers tending to a limit s: “Given any positive real 
number e > 0, there exists N (which may depend on e) such that |s, — s| < € for 
all n > N.” A typical informal translation is, ““We can make s, as close to s as we 
please by making n sufficiently large” [30, p. 44]. Tall et al. state that student 
difficulty may be inherent in the colloquial meaning of the terms used in this 
informal translation. For example, to many students the word “close” means near 
but not coincident. The student may reason from the informal translation that s,, 
can be close to, but not equal to, s. Other research in this area [16] suggests that 
students are quite successful at evaluating limits of the form lim f(x) as x > a 
where f is continuous at a. When given a simple limit problem in this format, with 
the appropriate graph displayed, most of the students in the study were able to 
solve the problem but showed very little geometric understanding. One of the 
students interviewed claimed that “the graph can’t help me find an answer.” 
Further probing revealed that the notion of “approaching” was not part of her 
understanding of limit. She saw limit problems as functions to be evaluated, and 
wasn’t sure about all the “extra” notation (the arrow, the word “‘lim’’). 


2.3 The derivative. Orton [22] conducted a large-scale qualitative study to 
investigate student understanding of differentiation and integration. Interviews 
were conducted with 110 students, 55 males and 55 females, between 16 and 22 
years of age. The students were majoring in mathematics and had recently “sat 
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for” an examination that included elementary calculus. Orton found that the 
routine aspects of differentiation were well understood. All but four students were 
able to differentiate quadratic polynomials. A higher frequency of errors occurred 
when students were asked to differentiate a function of the form y = a/x. Most 
of the errors involved calculating the derivative of the quotient as the quotient of 
derivatives. 

Answers on more conceptual tasks were far from satisfactory and the results 
suggest little “intuitive” understanding of the derivative, as well as some funda- 
mental misconceptions. 

Areas of student difficulty related to the tangent as the limit of a set of secants 
and to the ideas of rate of change of a straight line versus rate of change of a curve 
and rate of change at a point versus rate of change over an interval. According to 
Orton, the fundamental concept underlying rate of change is that of proportional- 
ity, which most of the students should have met in their study of graphs in 
elementary algebra. The results of this study suggest that the concept of propor- 
tionality may not be that elementary for a large number of students. 

Orton suggests several implications for the curriculum and teaching based on 
these results. First, the basic concepts of calculus need to be revisited and 
reinforced at various times during the students’ mathematics education. This in 
turn might mean stronger collaboration among secondary schools, colleges, and 
universities. Second, an initial approach to calculus concepts should be “informal” 
and involve numerical and graphical explorations. Orton states that real-life data 
should be employed, followed by a more algebraic development of the fundamen- 
tal ideas. In addition, the role of technology in these explorations needs to be 
considered. 


2.4 The Integral. Orton’s study [23] also investigated student understanding of 
integration. Results suggest that students are able to apply, with some facility, the 
basic techniques of integration. However, further probing indicates that they 
possess fundamental misunderstandings about the underlying concepts. Orton 
found that the procedure of dissecting an area or volume, making use of the limit 
process, and the reasons why such a method works, were not part of most of the 
students’ understanding of the integral. Activities need to be developed that 
explore the idea of limit in an intuitive way at all levels of the mathematics 
curriculum. Such explorations might involve exploring patterns, such as sets of 
polygons with an increasing number of edges, elementary number series, the 
Fibonacci sequence, and iterative solutions to equations [23]. Throughout these 
explorations the nature of the approximation process should be emphasized. 
Calculators and computers can serve as tools that aid such investigations. Orton 
comments, “the calculator does provide us with the opportunity of numerical 
approaches to calculus, and better understanding of the arithmetic may lead to 
better understanding of the algebraically equivalent procedures” [23, p. 11]. A 
curriculum is suggested that begins with an informal approach and becomes more 
formal with each new look. 

In summary, research has found student understanding of central calculus 
concepts to be exceptionally primitive. Students demonstrate virtually no intuition 
about the concepts and processes of calculus. They diligently mimic examples and 
crank out homework problems that are predictably identical to the examples in the 
text. Misconceptions exist as a result of student attempts to adapt prior knowledge 
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having trouble. Order of presentation, emphasis of topics, and development of 
concepts are all more widely variable in the freshman calculus context than they 
are at earlier levels. Thus the problem of “national impact” is not trivial in any 
way. 

In searching for a guide for thinking about this important issue for implementa- 
tion, the experience of Cobb et al. [3] is especially helpful. The curriculum 
materials used in their research were developed in the context of the actual 
classroom, and were based on ongoing observations of children’s mathematical 
activity in the classroom. This way of viewing the development of curriculum fully 
acknowledges the teacher’s role in the implementation of curriculum, and could 
possibly be adapted as part of the calculus effort. 

For any of the materials produced in these projects to become used in any 
widespread fashion, the dissemination process is indeed challenging. Whether the 
materials are organized with a novel mathematical perspective, as many are, or a 
new educational perspective, the dissemination should help convert people to the 
perspective. Given that materials may be used in a piecemeal fashion, when parts 
are appealing, the potential impact of this process will be to change people’s way 
of thinking about the teaching and learning of calculus. 


5. Research directions. A number of mathematics education research questions 
arise in conjunction with the calculus effort. Several questions relate to the scope 
and sequence of the mathematics content. Examples include: How does one decide 
which parts of the traditional curriculum can best be omitted? Is it more helpful to 
students to introduce the idea of limit before the idea of derivative? What are the 
effects of introducing substantial physical applications within the calculus course? 

Many questions arise that relate directly to student learning and background 
matters. Examples of relatively broad questions include: Does lack of algebraic 
facility truly hinder calculus learning? How does the experience of secondary 
school calculus relate to the experience of college calculus? How does student 
“intuition” develop? Do physical examples help in the learning of concepts? How 
can underrepresented groups best be served in this revision process? How does 
procedural proficiency relate to conceptual understanding? Is a strong conceptual 
base adequate foundation for applying the ideas of calculus in new contexts within 
the sciences and engineering? There are also some examples of very specific 
learning questions which are rather intriguing, especially when raised by practicing 
mathematicians: Does a pointwise view of function interfere with the understand- 
ing of continuity on an interval? How natural and convincing is the traditional 
means of introducing the tangent line through the use of secant lines? 

A third category of questions relates to pedagogical strategies and their interac- 
tion with mathematics content as well as student learning. Examples include: Will 
students be able to understand calculus concepts without extended manipulative 
practice? How does the use of writing enhance calculus learning? What are the 
relative benefits and disadvantages of large lectures? Can students learn calculus 
by working in pairs or groups? How can technology be helpful/detrimental with 
respect to student misconceptions? How do students use calculus textbooks, and 
what helpful features should be incorporated into textbooks? 

Finally, there are questions related to teachers. How do teachers use textbooks 
and supplementary materials at the level of calculus? Will there be “teacher 
resistance” to widespread use of technology? 
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6. Conclusion. The calculus reform effort may prove to be an exciting phe- 
nomenon for those interested in mathematics education at this level. There clearly 
are opportunities for exciting work in curriculum, student learning, use of technol- 
ogy, and teacher development to inform this process in a productive fashion. In 
addition, possibilities for ongoing research that might both guide and result from 
this process seem rich and interesting. 
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1. Introduction. One of today’s most exciting areas of mathematics is the study 
of dynamical systems. There are numerous unsolved problems and the field is 
extremely active. Not only mathematicians, but also ecologists, chemists, 
economists, and physicists have become involved in the field. The theory of 
dynamical systems is used in computer graphics, population models, and meteorol- 
ogy, to name a few. Many mathematicians feel that some knowledge of the subject 
is imperative. It is not difficult to accomplish this, since certain sophisticated topics 
can be learned rather easily at a very elementary level. In fact, they can be 
incorporated into the calculus sequence. Simple examples of dynamical systems 
may be shown to the student when introducing the composition of functions. 
Continuity and the Intermediate Value Theorem are the only tools necessary to 
prove a special case of a very beautiful theorem of Sarkovskii, thus making it 
accessible to a calculus student. The shift map, which is extremely important in the 
study of dynamics, can be used as an illustration of continuity on a more abstract 
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level. I shall begin with a simple example of a dynamical system suitable for an 
elementary calculus student. 

Suppose f is a function defined everywhere. Let f” denote the composition of 
f with itself n times, that is, f* =f°f, f? =fe(fe f), and so on. 

The set of points {xy, f(x), f (9), f*(%o),...} is called the orbit of x. under 
f. The point x, is called a period-n if f"(x)) =x, and vn is the smallest such 
number. 

An important area in the field of dynamical systems is precisely the study of 
orbits of points under different functions. 


Examp_e. Let f(x) = x? 

Graphical analysis, as shown in FiGurRE 1, may be used to “determine” the orbit 
of a point. Observe that f(x) = x? has 2 fixed points: 0 and 1. The points in the 
interval (0, 1) all go to 0, while the points in (1, ©) all go to infinity. A student may 
try to look at the orbits of other elementary functions such as 2x + 4, x°, sin x, e”. 


(x9) f(xo) Xo 
f(x) = x? 
Fic. 1. 


Now, consider a more general quadratic function: f(x) = a(x +h)* + k. It is 
interesting to see how the orbits of points are affected by changes in the 
parameters. One can ask a student to analyze graphically the orbits under f with 
specific values of a, h, and k. This turns out to be a nontrivial problem. 

A well-known quadratic function, often used as a population model, is F(x) = 
wx — x) [1, pp. 31-38]. The dynamics of F,, a seemingly simple function, turn 
out to be quite variegated. It can be easily verified that for uw > 1, F, has two fixed 
points, 0 and p, = (wu — 1)/u, where 0 <p, <1, and that F(x) > —o as 
n — © for any x <0 or x > 1. Thus, all the interesting dynamics of F, occur in 
the unit interval. If 1 < w < 3, graphical analysis shows that lim, _,., F(x) = Di, 
for all x in (0,1). The dynamics of F, become increasingly complicated as u 
passes through 3. See Figures 2—5, where a solid line represents F?. 

Once pu > 4, certain points leave the unit interval after one iteration, since the 
maximum value of F,m@ > 4, is larger than 1. See Figure 6. 
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0 0.1 0.210.310.4 0.5 0.6 0.7 0.80.9 1 Vi 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 
x x 
Fic. 2. Seven iterations of 3x( — x). Fic. 3. Seven iterations of 3.3x(1 — x). 


perereny, 


O 0.1 0.2 0.3 0.4 0.5'0.6 0.7 0.8 0.9 ' 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8'0.9 1 
x Xx 


Fic. 4. Seven iterations of 3.5x(1 — x). Fic. 5. Seven iterations of 3.7x(1 — x). 


1f(xo) x 


F, = pxQ — x), p> 4. 


Fic. 6. F, = wx(1 — x), p> 4. 
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Note that the only fixed points of ao are 0000... and 1111.... Also observe 
that 010101... and 101010... are the only period-2 points. What are the other 
periodic points of a? Any period-n point has a repetitive “block” of n entries. 

A simple example where the shift map is very useful involves a map of the 
circle. Let S' be the circle of radius one. We measure the angles in turns. Let 
f: S’ — S’ be the map that doubles the angle from the x-axis. We can label the 
upper half of the circle by 0 and the lower half by 1. 


s} 0 fx) 


f7(x) 
1 x © 001... 


Fic. 7. f: S' > S!, an angle doubling map. 


Now, to every point on S! we may associate a sequence in D analogous to the 
way such sequences are associated to numbers in [0, 1] via base 2 expansions. We 
define a function H: S' > D by H(x) = 59515) °:* , where s; = 0 if f'(x) is in 
the upper half of the circle, and s, = 1 if f’(x) is in the lower half of the circle. 
Note that H is not a continuous function. Two points x, and x, may be arbitrarily 
close to each other, yet on the opposite sides of the half circle, with the result that 
d(H(x,), H(x,)) > 1. Nonetheless the map f can be viewed as the shift map. That 
is, the following diagram commutes. 


gi fog 


| |x 
D—> D 


It is much more difficult to show topological conjugacy of F,, where pp > 2 + V5, 
and the shift map. The proof of this conjugacy is shown in [1, pp. 44-47]. 


3. Sarkovskii’s theorem. In studying the dynamics of any function it is quite 
important to know its periodic points. There is a striking theorem of Sarkovskii 
dealing with periodic points of continuous real functions. A special case of 
Sarkovskii’s Theorem, dealing with a period-3 point, was discovered independently 
by Li and Yorke [3] and later reproved by Straffin [4]. The general theorem, using 
Sarkovskii’s ordering of positive integers, as well as the special case, are discussed 
in [1, pp. 60-68] and in [2]. Depending on the ability of the class, the special case 
of this theorem may be presented either in a strong intermediate or an advanced 
calculus course. The proof shown below is at a level suitable for an undergraduate. 
This early introduction to dynamics can spruce up the calculus sequence for faculty 
as well as for students. 


THEOREM. Suppose f is a continuous function whose domain and range is the set 
of real numbers. If f has a period-3 point, then it has period-n points forn = 1,2,... . 


Proof. Let a be a period-3 point of f. Let b = f(a), c = f(b). Without loss of 
generality we can assume that a < b <c. (If not, let g(x) = —f(—x).) 
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(a, b) 


a = (c, a) 


Fic. 8. A function with a period-3 point a. 


It is geometrically clear that a continuous curve joining the points (b,c) and 
(c,a), a <b <c, must intersect the line y = x. This implies that there is at least 
one fixed point x of f in the interval [b,c]. More formally, the existence of the 
fixed point in [b, c] follows from the Intermediate Value Theorem, since the image 
of the interval [b,c] under f contains [b,c]. Thus we know that f has a period-1 
point. 

Now, [a, b] < [a,c] < f*({a, b]). By a similar argument as above one concludes 
that there is at least one fixed point, x,, of f* in[a, b]. Do we know, however, that 
there is at least one fixed point of f* which is not also a fixed point of f? By 
continuity and the facts that f(b) = c, and c > b, we can make a simple argument 
that there indeed is a genuine period-2 point in [a, b] as follows. If there are fixed 
points of f in [a,b] then there is a maximal fixed point z, <b. That is, if 
x € [a, b] is a fixed point, then x <z, <b, since f(b) =c and f is continuous. 
Furthermore, by continuity of f we can choose ¢ > 0 small enough so that 
if z, =z, +e then a < f(z,) <b. There are no fixed points of f in [z,, bd]. 
Let f(z,)=a,. Then f([z,,b]) Dla,,c] > [b,c] and f({b,c]) > [a,b]; thus 
f°€z,,b)) > a,c] > [z,,b]. Therefore f* has a fixed point in [z,,b]; that is, 
there exists x, € [a,b] such that f7(x,) =x, and f(x,) # x. 

Next let H) = [a,b] and H, = [b,c]. To prove that there are period-n points 
for n > 3, we construct a sequence of intervals {J,}*_, having the following two 
properties: 


lL A,y=Khol5h lh}... 
2. fid,) = Ih. 


This construction is possible since the image of H, under f contains H,, and thus 
there is a subinterval of H, whose image is exactly H,. Proceeding inductively we 
get the desired nested sequence of intervals. 

Now we shall construct another sequence of intervals, {J,}?_,. By the second 
property f(,) = Ip, so f?U,) = fU,) = fC) > H. Since f*U,) > Hp, there is a 
subset of I,, say J, such that f*(J,) = H,, while f(J,) C H,. Similarly, there exists 
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J, CI,, such that f?(J,) = Hy while f*(J,) CH, and f(J,) C H,. We form a 
sequence of intervals {J,}?_, such that: 

1. J, cI, for alli > 0 

2. fi(J,_,) = Hy while fiJ,_,) CH, for all j <i,i> 1. 
Observe that f"(J,_,) = f(H)) > H, DJ,_,. Therefore, f” has a fixed point, p, 
in J,_,; that is, f"(p) =p. It is indeed a period-n point of f, since p, 
f(p), f?*(p),... f" *(p) € H,, f" (p) Ee Hy and fp) =p EH, again. If 
f”~*(p) is in the interior of H,, then the proof is complete. If f”~'(p) is on the 
boundary, f”~'(p) = a or b, which implies that p = f"(p) = b or c. This, in turn, 
implies that f(p) € H, or f*(p) € H, which indicates that n = 2 or 3, contradict- 
ing n > 3. O 


It should be noted that the theorem has a very weak hypothesis while the 
conclusion is rather amazing. The existence of a period-3 point implies the 
existence of all other periods. The general theorem of Sarkovskii is even more 
powerful. This theorem involves a special ordering of the natural numbers, defined 
as follows: 


3P5P7> +--+ b2-3P2-5>--- P22-3p27-5D-:: 
D2°>-3p23-5p -+- p23 p2*p2P 1. 


SARKOVSKII’s THEOREM. Suppose f: RK — R is continuous and has a period-k 
point. If k > r, then f also has a period-r point. 


The function F(x) = wx(1 — x) provides an example where both Sarkovskii’s 
Theorem and a shift map is used. It can be shown by elementary computations that 
F235 has a period-3 point. By Sarkovskii’s Theorem F;.,, must have periodic 
points of all other periods. Computations, however, do not reveal any other 
periodic points. A special type of a shift map is used to locate the other periodic 
points (see [1, §1.13]). 

I would like to thank Professor David Knee for his helpful comments on this 
paper. I would also like to express my appreciation to the referees for several 
suggestions and to my student, Robert Ryan, for providing some of the graphs. 


REFERENCES 


1. Robert L. Devaney, An Introduction To Chaotic Dynamical Systems, second edition, Addison- 
Wesley, Redwood City, California, 1989. 

2. W. de Melo and S. J. van Strien, Lectures on One-Dimensional Dynamical Systems, to appear. 

3. T. Y. Li and J. A. Yorke, Period three implies chaos, this MONTHLY, 82 (1975) 986-988. 

4. Philip D. Straffin, Jr. Periodic points of continuous functions, Math. Mag., 51 (1978) 99-105. 


A Simple Interpretation of the Complex Contour Integral 


ALAN GLUCHOFF 
Department of Mathematical Sciences, Villanova University, Villanova, PA 19085 


Is there a simple interpretation for the familiar complex contour integral 
{-f(z)dz, one as accessible as those for the integrals of real-valued functions? My 
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working examples by the formula /’f(z(t))z'(t) dt can be a baffling experience; 
what sense is a beginning student to make of the results 


F Ll TT 
i Re zdz = Ti or —dz = —? 
|z| =1 1 Z 2 

I would like to point out a way of looking at /{. f(z) dz which relies only on the 
geometry and algebra of complex numbers and can be introduced early in the 
discussion of integration. Using this idea, several examples will be worked out to 
show how values for some elementary integrals can be found. 

Suppose a function f is defined on O, an open subset of C and C is a contour 
from A to B inside O. We will assume that /{.f(z)dz has been given a rough 
definition as lim L7_, f(zp(z, — z,_,), where {z,}/_» are subdivision points of C, 
with z) = A and z, = B;z; is a point on C between z,_, and z,, but will not 
worry about the details of convergence. Now if L(C) is the length of C and 
|Z, — Z,_,| is the distance from z,_, to z, in the complex plane, suppose {z,}?_, 


are chosen so that |z, — z,_,| = L(C)/n, and that zf = z,_,; then we have 
“ “ (2, — 2-1) 
DL F(R) (e = Ze) = Me |Z_ Ze 
k=1 k=1 “kK ~ 4k-1 


L(C) 2 (2, — 2-1) 
= Z _ TT 5 
n XE k 1) Iz, ~z,_]| 
thus 
| I (2% — Z%-1) 
—— z)dz = lim — Z,—1) 
HO [f® fim =X f¢ TE oe 
Now as n> ~,(z, —z,_,)/|z, —Z,_,| is nearly equal to the “unit tangent 


vector” to C at z,_,, that is, the complex number of modulus one whose vector 
representation is tangent to C at z,_,. So, U/L(C))/- f(z) dz can be approxi- 
mated by the average value of the product of f at z, with the unit tangent vector 
at z,, for {z,} a subdivision of C. Passing to the limit, one obtains 
(1/L(C)) {[ef(z) dz = average for z on C of [f(z)- unit tangent at z], and if we 
write T(z) = unit tangent to C at z, we have the formula 


1 
Cy LAC) de = ae LITA 


This is the proposed interpretation; we make a few comments before passing on to 
examples. 

First, the average appearing above can be viewed as a vector average in the 
plane. The idea of the integral as an average should be familiar to students from 
earlier calculus courses, but I have not seen it introduced into the complex setting 
in this way. One can of course arrive at this interpretation merely by considering 
the formula {?f(z(t))z'(t) dt, where z(t) is a parametrization of C, but ideally our 
argument would be introduced immediately after a general discussion of the 
integral as a limit of Riemann sums. This parallels the discussion of integrals of 
real-valued functions and forms a counterpart to the “conformality” interpretation 
of differentiability at a point of nonvanishing derivative, which also relies only on 
algebraic and geometric properties of complex numbers. 
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From this result it is a short step to showing that, by approximation of an arbitrary 
contour by polygonal line segments, 


B* — A’ 
[zaz = —___—_—— 
C 2 


for any path connecting A to B, thus exhibiting path independence. 


In some cases an exact answer is not obtainable by this approach, but an 
approximation can be provided, giving some properties of the answer (for example, 
that the answer is purely real or imaginary). 


Example 5. [.1/zdz where C is the straight line from 1 +i to 2 + 21: Now 
1 1 1 1 il+i 
V2 Is a. Kc Zz v2 |? 

so if z=t+uec, 1<t<2, 1/z=1/t1 +1); thus the average is 

QV1 << [1 /tv2 |. Although it is not clear what the average is, we can see from this 

that {.1/z dz is positive and between 1/2 and 1. [Actual value = log2 = 0.693...]. 


Example 6. {- log zdz,log z is principal value, C is the contour |z| = 1 be- 
tween ~—i and i, counterclockwise. We have 1/(7/2)/- log z dz = av, < liz log z] 
= av,-cl—z Arg z]. Considerations of the image of C under —z Arg z imply an 
estimate of si for this average, —7/2 <s < 0, thus we can estimate the value of 
the original integral as yi, —77/4 < y < 0. [Actual value is —2i)]. 


Example 7. {-1 dz,C any simple closed path. We have 


1 
—_fidz= wT 
ne I. z= au TZ), 


and it is intuitively clear that this average is 0; thus the integral has a value of 0. 


It is hoped that the estimation of integral values using this approach gives the 
student a feeling for the roles played by the function and contour in determining 
{-f(z) dz, can act as a check on results obtained by the more standard computa- 
tions, and generally transforms the integral into an object of substance. Of course, 
the simplicity of the functions and contours chosen makes estimation easy; for 
more complicated examples computer plots of the vector fields formed by f(z)T(z) 
for each z on C might yield graphical information which could be used for an 
estimate. This follows the ideas in [1] and [2], where an alternative approach to 
interpreting the integral involving the separation of {.f(z) dz into [.W-Tds + 
i [oW - Nds, where W = (Re f, —Im f), and computer-aided analysis of the two 
real line integrals is discussed. 
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ELEMENTARY PROBLEMS 


E 3451. Proposed by M. A. Khan, Lucknow, India. 


Prove that for any positive integers m,n, N we have 
min(n, N) (N N-j)" min(m, N) (N _ i)” 


2X S(1, J) (Nj)! = au S(m, i) ——— (No! 


where S(n, j) is the Stirling number of the second kind given by 


E 3452. Proposed by C. A. Nicol and J. L. Selfridge, University of South 
Carolina, Columbia, SC. 


If m is an odd integer greater than 3 and ¢ is the Euler function, prove that 
there exists a prime p such that p|(2°” — 1) but p tn. 


E 3453. Proposed by Allen J. Schwenk, Western Michigan University, Kalamazoo, 
MI. 


Suppose n is a positive integer greater than 2. Determine the smallest positive 
number c, with the following property: Given any n distinct real numbers. there 
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exist two of them, say x and y, which satisfy 
x —Yy 


O< < 
1 + xy 


Ch: 


E 3454. Proposed by John A. Baker, University of Waterloo, Ontario, Canada, 
and motivated by the work of Hans G. Kellerer, Munich, Germany. 


Suppose f: R — R is a function such that 


2f(x +1) = f(x) + f(2x) 
for all real x. 
Prove that if f is twice continuously differentiable, then f must be constant. 


E 3455. Proposed by D. G. Rogers, University of Aberdeen, Scotland, and 
Howard University, Washington, DC. 


It is known that if n = 0 (mod 4) or n = 1 (mod 4), then there exist permuta- 
tions (x1, X5,...,x,,) of (1,2,...,) such that the differences |x, — kl, 1<k <n, 
are all distinct. (Cf. E3269 [1988, 554; 1989, 843].) Prove that the number of such 
permutations is a multiple of 4. 


E 3456. Proposed by A. S. Cavaretta, Kent State University, Kent, OH. 


Suppose 0 = my < m, < -++ <™m, are integers such that m, = i (mod 2). 
(i) Prove that a real polynomial 


Cot Cex + res +e,x"%, with coc, # 0 


has at most n real zeros, each zero being counted according to its multiplicity. 
(ii) Prove that the generalized Vandermonde determinant 


xo! x1 eee xn 
m 
xg" xy xr 
is non-zero if Xo, X,,..., x, are any nm + 1 distinct real numbers. 


SOLUTIONS OF ELEMENTARY PROBLEMS 


Feast or Famine of Friday the Thirteenths 


E 3292 [1988, 873]. Proposed by Solomon W. Golomb, University of Southern 
California, Los Angeles, CA. 


(i) What is the maximum number of consecutive days with no occurrences of 
Friday the thirteenth? 

Gi) What is the minimum number of consecutive days with four occurrences of 
Friday the thirteenth? 
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1 1 rae 1 
Xo! xf) wee xn 
m m ee @ m 
xg" xy xr 
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To conclude the argument for (ii) we examine each of the 21 possible ordered 
pairs of consecutive calendar years. Such an examination reveals that no string of 
14 consecutive months contains four Friday the 13ths and that strings of 15 
consecutive months containing four Friday the 13ths occur only for the pairs Cd 
and Ga. These cases give the following 428-day periods containing four Friday the 
13ths: 


Case I. From September 13 of a year of type C through November 13 of the 
following year of type d, 

Case II. From May 13 of a year of type G through July 13 of the following year 
of type a. 


Case I occurs 14 times and Case II occurs 15 times during a 400-year period. 

In each instance of Case I of (ii) the 428-day stretch containing four Friday the 
13ths is immediately preceded by a 426-day stretch containing no Friday the 13ths, 
according to Case 1 of (i). This phenomenon occurs in the years 


1714-1716, 1742-1744, 1770-1772, 1810-1812, 1838-1840, 1866-1868, 1894-1896, 
1906-1908, 1934-1936, 1962-1964, 1990-1992, 2018-2020, 2046-2048, 2074-2076, 


and in years differing from these by multiples of 400. 

Similarly in each instance of Case II of Gi) the 428-day stretch containing four 
Friday the 13ths is immediately followed by a 426-day period containing no Friday 
the 13ths, according to Case 2 of (i). This phenomenon occurs in the years 


1707-1709, 1735-1737, 1763-1765, 1791-1793, 1803-1805, 1831-1833, 1859-1861, 
1887-1889, 1927-1929, 1955-1957, 1983-1985, 2011-2013, 2039-2041, 2067-2069, 
2095-2097, 


and in years differing from these by multiples of 400. 
Finally we remark that Case 1 of (i) occurs without an immediately following 
instance of (ii) in the years 


1725-1726, 1753-1754, 1781-1782, 1798-1799, 1821-1822, 1849-1850, 1877-1878, 
1900-1901, 1917-1918, 1945-1946, 1973-1974, 2001-2002, 2029-2030, 2057-2058, 
2085-2086, 


and in years differing from these by multiples of 400; also Case 3 of (i) occurs in 
the years 1723-1724, 1751-1752, 1779-1780, 1819-1820, 1847-1848, 1875-1876, 
1915-1916, 1943-1944, 1971-1972, 1999-2000, 2027-2028, 2055-2056, 2083-2084, 
and in years differing from these by multiples of 400. 


Editorial comment. For n < 16 Harvey Lipman calculated the minimum num- 
ber f(n) of consecutive days with n occurrences of Friday the 13th and the 
maximum number F(n) of consecutive days with n occurrences of Friday the 13th. 
In particular he obtained the following values: 


n 0 1 2 3 4 5 6 


f(r) 1 29 183 428 547 792 
F(n) 426 699 944 1126 1399 1644 1735 


The given problem asks for F(0) and f(4). 
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There have been at least two earlier Monthly problems concerned with Friday 
the 13th, namely E 36 [1933, 295; 1933, 607] and E 1541 [1962, 919; 1963, 759]. 
E 36 was devoted to B. H. Brown’s famous remark that the 13th of the month is 
more likely to occur on Friday than on any other day of the week. 

E 1541 asked for the maximum and minimum number of “Friday the 13ths” 
that can occur in a year. If a calendar year is meant, Bankoff’s above tables 
provide the answers three and one respectively. If ‘“‘a year’? may be any 12-month 
period, the present problem provides the answers three and zero respectively; 
unfortunately the editorial comment after the solution of E 1541 erroneously 
stated that it is possible for a 12-month period to contain four Friday the 13ths. 


Solved also by C. Aschbacher, W. D. Banks, C. W. Dodge, M. P. Eisner, B. S. Garbow, D. L. Grant, 
R. K. Guy, W. Janous, J. Knapp, S. Kravitz, H. Lipman, M. J. Mackinnon, A. Nijenhuis, V. Schindler, 
J. H. Steelman, R. Stong, M. Vowe, J. T. Ward, C. H. Webster, and S. Wolf. 


The Expected Number of Selectable Subsets 


E 3344 [1989, 734]. Proposed by Richard Stanley, Massachusetts Institute of 
Technology, and J. Michael Steele, Princeton University. 


Suppose 7 points p,, P>,..-, Pp, are independently chosen at random in the unit 
square {(x, y):0 <x < 1,0 <y < 1}. Find the expected number E(n) of subsets § 
of N = {p,, P2,---, P,} for which there exists an increasing function y = f(x) such 
that the elements of S lie below the graph of f while the elements of N — S lie 
above the graph. 


Solution by Richard Stong, University of California at Los Angeles. The expecta- 
tion is rio(7) /k!. Call such a set S a selectable set. With probability 1, we may 
assume the y coordinates of the points are all distinct. Hence we can modify the 
requirement for selectability so that the selected set lies on or below the function. 
If S=t{p; = (x%,, y,} is selectable, then S$ is selectable by the step-function 
fs(x) = supty,: x; <x}. Let T=7(S) be the minimal subset of S for which 
fr = fs; this is unique and consists of the points of S that are “upper leftmost’’ in 
the region of the square on and below f,;. Note that 7 forms a chain in the usual 
chain-product partial ordering on the points in the unit square. 

Conversely, for any subset 7 C N that forms a chain in this ordering, we can 
define S = o(T) to be the set of points in N on or below f, and observe that f-, 
selects this set. Furthermore, S = a(7r(S)), so o and 7 establish a bijection 
between the selectable subsets and the subsets that form chains in the coordinate 
ordering. Therefore, the expectation can be rewritten as the sum over all subsets of 
indices M c {1,...,n} of the probability that the points indexed by M form a 
chain in the coordinate ordering. This probability is simply 1 /|M|!. 


Editorial comment. Richard K. Guy noted that some related results are given in 
Louis W. Shapiro, A lattice path Jemma and an application in enzyme kinetics, 
J. Statistical Planning and Inference 14 (1986), 115-122, and in Roger B. Eggleton 
and Richard K. Guy, Catalan strikes again! How likely is a function to be convex?, 
Math. Magazine 61 (1988), 211-219. 


Solved also by M. Bazant (student), D. Callan, C. Schoen, A. J. Schwenk, and the proposers. 
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Suppose f and g are nonnegative continuous functions compactly supported on 
the real line. Hdlder’s inequality gives 


sup f f(x — y)g(y) dy <IIfllpllalla 


1 


for positive numbers p, g with p~! + q~! = 1. Prove or disprove that 


[sup {f(2 —y)g(y)} dx = Ilfllpllelle. 


Solution I by Philip Spencer and Richard Stong, Harvard University, Cambridge, 
MA. The stated inequality does hold. To see this, let y) be any point at which g 
attains its maximum ||g||... Then sup,{f(x — y)g(y)} > f(x — yoJllgll. and thus 
f sup,{f(x — y)g(y)} dx > Ilfllillgll.. Similarly, f sup,{f(x — y)g(y)} dx > 
I fll gl. Combining these two bounds gives 


[sup {f(x —y)ga(y)} de > If lly lleleP fll Mell”? 


But 
1/p 


iP aee = JUPME* fC) ax] >| [raya] = If 


1/p 


and similarly |lgll1/“Ilello/? > llgll,, from which the desired inequality follows. 


Solution II by Andrew Kresch, student, Yale University, New Haven, CT. The 
assertion is true. Applying Hélder’s inequality to the functions g?~' and f?7! 
yields 


le Mlle? a = [F(x 9)? “(a(v))** ay 


for any x. This may be rewritten as 


Wflle sll > [fe 9)? "(aQy) ay, 
from which it follows that 


sup { f(x —y)g(y)}IFIp ‘gla 


2 [fG ~y)g(y)(f(x—-y))? “(e(y))* ay. 


Integrating both sides with respect to x and making the substitution z = x — y on 
the right yields 


Ile Tels? f sup {f(x —y)g(y)} dx 


> [(F(z))" dz f(e(v))" ay = IF ellgllé, 


from which the desired inequality follows directly. 
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Editorial note. 


A stronger result is proved by L. Leindler (On a certain 
converse of Hodlder’s inequality, Acta Szeged, 33 (1972) 217-223), who showed that 


[sup {f(x —y)g(y)} de > p'/?q'/4If Ilpllell, 


for any positive p and q with p-'+q7! 


= 1. Leindler’s result is best possible in 
the sense that equality holds if f is the characteristic function of [0,1/p] and g is 
the characteristic function of [0, 1/q]. 


Solved also by A. Riese and the proposer. 


A Sum of Permutation Averages 


E 3371 [1990, 151]. Proposed by A. M. Garsia and B. A. Sethuraman, University 
of California at San Diego, La Jolla, CA. 


Let S,, denote the symmetric group on the n symbols 1, 2,...,”. Given o in S,, 
let 


M(oa) = {i: l<i<n,ie o(i)}. 
Put 


P(c) =|M(c)| Ys {i + o(a)}. 


iEM(a) 
Prove that 


» P(o) =(n4+ 1). 


oes, 


Solution by Ron Adin, The Hebrew University, Jerusalem, Israel. For each 
a0 © S,, define o* € S, by o*(i) =) if and only if om + 1 —j) =n + 1-1. The 
map o — o* is an involution on S,. Since i >j if and only ifn+1—-jo>n+ 
1 — i, we obtain |M(o*)| = |M(o)\|, and hence 
P(o) + P(o*) =|M(o)' YY [i+o(i) + (n+1-o(i))+n+1-i] 
ieM(o) 
= 2(n + 1). 
Therefore, 20 P(o) = LI P(e) + P(o*)] = 0 2(n + 1) = 2n + 1)! 
Solved also by P. Bloomsburg, P. Brosnan (student), D. Callan, R. J. Chapman, T. H. Crocker, 
B. M. M. de Weger (Netherlands), K. L. Dove, J. L. Drost, A. A. Jagers (Netherlands), A. Kresch 


(student), R. Lipsett, J. H. Nieto (Venezuela), J. H. Steelman, R. Stong, D. Wolfe, Central Michigan 
University Problem Group, National Security Agency Problems Group, and the proposers. 


The Sum of Arctangents of Reciprocal Squares 


E 3375 [1990, 239]. Proposed by R. J. Chapman, Merton College, Oxford, U.K. 
Express 


an 3 arotan(1/n’) 
n=1 
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Editorial note. A stronger result is proved by L. Leindler (On a certain 
converse of Hodlder’s inequality, Acta Szeged, 33 (1972) 217-223), who showed that 
[sup {f(x —y)g(y)} de > p'/?q'/Ilf llollglla 

y 
for any positive p and g with p-!'+q7! 


= 1. Leindler’s result is best possible in 
the sense that equality holds if f is the characteristic function of [0,1/p] and g is 
the characteristic function of [0, 1/q]. 


Solved also by A. Riese and the proposer. 


A Sum of Permutation Averages 


E 3371 [1990, 151]. Proposed by A. M. Garsia and B. A. Sethuraman, University 
of California at San Diego, La Jolla, CA. 


Let S,, denote the symmetric group on the n symbols 1, 2,...,. Given o in S,, 
let 
M(o) = {i:1 <i<n,izo(i)}. 
Put 


P(c) =|M(c)| Ys {i + o(a)}. 


iEM(a) 
Prove that 


» P(c) =(n+ I)!. 


oes, 


Solution by Ron Adin, The Hebrew University, Jerusalem, Israel. For each 
a0 © S,, define o* € S, by o*(i) =) if and only if om + 1 —j) =n + 1-—-i. The 
map o — o* is an involution on S,. Since i >j if and only ifn+1—-j2n+ 
1 — i, we obtain |M(o*)| = |M(o)|, and hence 

P(o) + P(o*) =|M(o)|' 
ieM(o) 


= 2(n + 1). 
Therefore, 2° P(o) = LI P(o) + P(o*)] = © 2(n + 1) = An + 1)! 


yy [it o(i) +(n+1-o(i))+n4+1-i] 


Solved also by P. Bloomsburg, P. Brosnan (student), D. Callan, R. J. Chapman, T. H. Crocker, 
B. M. M. de Weger (Netherlands), K. L. Dove, J. L. Drost, A. A. Jagers (Netherlands), A. Kresch 
(student), R. Lipsett, J. H. Nieto (Venezuela), J. H. Steelman, R. Stong, D. Wolfe, Central Michigan 
University Problem Group, National Security Agency Problems Group, and the proposers. 


The Sum of Arctangents of Reciprocal Squares 


E 3375 [1990, 239]. Proposed by R. J. Chapman, Merton College, Oxford, U.K. 
Express 


an 3 arotan(1/n’) 
n=1 
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in closed form (in terms of a finite number of trigonometric and hyperbolic 
functions). 


Solution by Amites Sarkar, Winchester College, Hampshire, Great Britain. 


an x arctan(1/n?) anf » Arg} 1 zal 


ott 178 


Noa 


A {ae (w(1 + i)/V2) 
: m(1 +i)/v2 


tan( arctan tan( (7/ v2) - tanh( 7 / v2) 
tan tan(7/V2) + tanh(/V2 ) + tanh(a/ 

( 

( 


tan(a/y2) — tan(w/v2) — tanh(1/v2 ) a/V2 ) 
tan tan(3/V2) + tanh(7/v2) + tanh(7/y2 ) | 
The numerical value of this quantity is about 6.798. 


Editorial note. The problem and its generalizations have made frequent earlier 
appearances dating back to J. W. L. Glaisher in 1878 and most recently as Problem 
399 of The College Mathematics Journal. Readers cited previous appearances in 
works of S. L. Loney, S. Ramanujan, T. J. I. Bromwich, E. W. Hobson, L. B. W. 
Jolley, M. S. Klamkin & M. L. Glasser, and N. Lord. For references and a more 
complete history, see The College Mathematics Journal, vol. 21, No. 3, May 1990, 
pp. 253-4. 


Solved also by 38 others and the proposer. 


Large Graphs with No 4-cycle 


E 3387 [1990, 428]. Proposed by Douglas B. West, University of Illinois, Urbana, 
IL. 


Determine the maximum possible number of edges in a simple graph on ten 
vertices that contains no 4-cycle, and find all graphs achieving this maximum. 


Solution by Arie Bialostocki, University of Idaho, Moscow, Idaho, and 
J. Schonheim, Tel Aviv University, Israel, essentially as given in reference [1]. The 
maximum number of edges is 16, achieved by the two graphs drawn below. 
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Suppose G is a 10-vertex graph with no 4-cycle. Let v be a vertex of maximum 
degree, S its set of neighbors, and T its set of non-neighbors. Suppose |S| = k, so 
that |7| = 9 — k. No vertex of S or T can have two neighbors in S, so there are at 
most [k/2] edges within S and at most 9—k edges between S and T. The 
remaining edges are within 7. For k = 4,5,6,7,8,9, the maximum number of 
edges in T with no 4-cycle is 6, 4,3, 1,0,0, respectively. Including edges incident 
with v, the number of edges in G is bounded by 17,15, 15,13, 13,13 for k = 
4,5,6,7,8,9, respectively. Since maximum degree 3 implies G has at most 15 
edges, G must have maximum degree exactly 4 to exceed 15 edges. 

Consider k = 4. We claim that G cannot have 17 edges, because this requires 6 
edges in 7, achievable only by a vertex of degree 4 and two non-incident edges, 
which prevents placing 5 edges between S and J. Moreover, 16 edges are 
achievable only by 6 edges in 7, 2 in S, and 4 between S and T (the first graph 
above), or by 5 edges in T, 2 in S, and 5 between S and T (the second graph 
above). For the first graph, placing one edge between S and T readily forces the 
others. For the second graph, the only way to place 5 edges in T and 5 between S 
and T is for the vertices in T to induce a 5-cycle, after which the remaining edges 
are again readily forced. 


Editorial comment. Let f(n) be the maximum number of edges in an n-vertex 
graph with no 4-cycle. The asymptotic behavior of f has been well studied. This is 
related to the problem of Zarankiewicz, and a thorough discussion can be found in 
[2]; for large n we have (n/2)(n'/* — n'/9) < f(n) < (n/2)(n — 3/4)” + 1/2). 
The constructions, provided in 1966 by Erd6s-Renyi-Sés [5] and by W. G. Brown 
[3], use projective planes, available when n = g* + q+ 1, where q is a prime 
power. Recently, Zoltan Fiiredi [6] has proved that these are best possible for 

= q’ +q+1and qa prime power exceeding 13, where f(n) = $q(q + 1)’; he 
also characterized the extremal graphs. 

The editors were not aware that the exact answers for small have also been 
determined. Bialostocki and Sch6nheim determined f(10) and f(11). A determina- 
tion of f(n) for n < 21, along with all the extremal graphs, was apparently made 
by William McCuaig in 1984 but never published. These results were also obtained 
by C. R. J. Clapham, A. Flockhart, and J. Sheehan [4]. Letting g(n) be the number 
of extremal graphs, the results are 
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ADVANCED PROBLEMS 


6664. Proposed by Paul Erdos, Hungarian Academy of Sciences, Budapest. 


Let G be a graph whose edges can be covered by n complete subgraphs with n 
vertices each (i.e., G is the union of n copies of K,, with no restriction on shared 
vertices). 

(a) Prove that the chromatic number of G is less than 1 + nyn — 1. 

(b) Prove that this bound is asymptotically best possible, i.e., if f(m) is the 
maximum chromatic number of a graph constructed in this way, then f(n) = 
{1 + o()}n?/””. 


6665. Proposed by José Luis Palacios, New Jersey Institute of Technology, Newark, 
NJ, and Dennis P. Sandell, Swedish University of Agricultural Sciences, Garpenberg , 
Sweden. 


Let S, (m > 0) be a simple symmetric random walk, i.e., Sy = 0 and S, = X, + 
X,+-:: +X, for n > 0, where the X, are independent identically distributed 
random variables with PCX, = 1) = PCY, = —1) = 4. Let N be an arbitrary 
positive integer and let 7 be the first time that the difference between the 
maximum and minimum of the random walk is N, 1.e., let 

T = min{ 7: max S,—- min S, = Nt. 


O0<k<n O<k<n 


Find the expected value of T. 


6666. Proposed by Marcel F. Neuts, University of Arizona, Tucson, AZ. 


Suppose A and B are n by n matrices with positive real entries, suppose A is 
non-singular, and let a denote the largest eigenvalue of BA. 
(a) Prove that there exists an n by n matrix X with positive entries satisfying 


X =A + XBX (*) 


if and only if a < 4. 

(b) If a < ¥, prove that 

(i) there is a solution X, of (*) with positive entries such that if X, is any other 
solution of (*) with positive entries, then all the entries of X, — Xo are non-nega- 
tive; 

(ii) the maximum eigenvalue of BX, is 1 — v1 — 4a )/2; 

Gii) if X, is any solution of (*) with positive entries other than the minimal 
solution X,, then the maximum eigenvalue of BX, is 1 + v1 — 4a@)/2. 
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ADVANCED PROBLEMS 


6664. Proposed by Paul Erdos, Hungarian Academy of Sciences, Budapest. 


Let G be a graph whose edges can be covered by ” complete subgraphs with n 
vertices each (i.e., G is the union of n copies of K,, with no restriction on shared 
vertices). 

(a) Prove that the chromatic number of G is less than 1 + nyn — 1. 

(b) Prove that this bound is asymptotically best possible, i.e., if f(m) is the 
maximum chromatic number of a graph constructed in this way, then f(n) = 
{1 + o(1)}n?/?. 


6665. Proposed by José Luis Palacios, New Jersey Institute of Technology, Newark, 
NJ, and Dennis P. Sandell, Swedish University of Agricultural Sciences, Garpenberg , 
Sweden. 


Let S, (n > 0) be a simple symmetric random walk, i.e., $, = 0 and S, = X, + 
AX, +++: +X, for n > 0, where the X; are independent identically distributed 
random variables with PCX, = 1) = P(X, = —1) = 4. Let N be an arbitrary 
positive integer and let 7 be the first time that the difference between the 
maximum and minimum of the random walk is N, i.e., let 

T = min{ 7: max S,—- min S, = Nt. 


O<k<n O<k<n 


Find the expected value of T. 


6666. Proposed by Marcel F. Neuts, University of Arizona, Tucson, AZ. 


Suppose A and B are n by n matrices with positive real entries, suppose A is 
non-singular, and let a denote the largest eigenvalue of BA. 
(a) Prove that there exists an n by n matrix X with positive entries satisfying 


X =A + XBX (*) 


if and only if a < j. 

(b) If a < 4, prove that 

(i) there is a solution X, of (*) with positive entries such that if X, is any other 
solution of (*) with positive entries, then all the entries of X, — Xp are non-nega- 
tive; 

(ii) the maximum eigenvalue of BX, is 1 — v1 — 4a )/2; 

Gii) if X, is any solution of (*) with positive entries other than the minimal 
solution X,, then the maximum eigenvalue of BX, is (1 + v1 — 4a@)/2. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Inscriptible Polygons 


6571 [1988, 359]. Proposed by Glenn Ierley, Michigan Technological University, 
Houghton, MI. 


(a) Let A(n) be the maximum area of a polygon with n sides of lengths 
1,2,...,n, where n > 4. It is known that the maximum area occurs for a polygon 
inscribed in a circle. (Cf. G. Pélya, Mathematics and Plausible Reasoning, Volume 
1, Princeton, 1954, pp. 174-177.) Let B(n) be the area of a regular polygon with n 
sides and perimeter 1 + 2 + :--: +n. Prove that 

A(n) 7? 
— ~ 3 (n > 00), 
B(n)  3n 

(b) For 1/2 < q < 1 let A(q, n) be the maximum area of a polygon with n sides 
of lengths 1, g,q*,...,q” ‘', respectively, where n is large enough so that g + q’ 
+ +++ +q"~!> 1. Let B(q,n) be the area of a regular polygon with n sides and 
perimeter 1+ q+ q*+-:: +q"~'. Prove that c(q) = lim, _,,, A(q, n)/B(q, n) 
exists and find 


1 —c(q) 
im TTS 
q>1” (1-@) 


Solution by William A. Newcomb, Lawrence Livermore Laboratory, Livermore, 
CA. The limit for (b) is 7*/9. We shall consider the more general problem of 
maximizing the area of an n-gon with sides of prescribed positive lengths /, > lL, 
> +: Sl, satisfying 1, </, + -:: +1,. Steiner’s Theorem, cited by Pélya, states 
that this maximum is achieved by such an n-gon when its vertices lie on a circle. 
We shall first prove that the radius of such a circle is uniquely determined. 

Let F(U,r) = sin~'(//@r)). Observe that a chord of length / divides a circle of 
radius r >//2 into minor and major arcs of angular measure 2F(/,r) and 
27 — 2F(,r). If a polygon is inscribed in a circle, then at most one side, the 
largest, might subtend a major arc. Accordingly, in our case, the sides of length 


l,,...,/, must cut off minor arcs of total angular measure 
2F(r) = ) 2F(L,r) = ) 2sin-'(1,/(2r)). (1) 
jH2 j=2 
Now F(r) <7, and of course r > /,/2. Link chords of length /,,...,/, (Gin any 


order) so that their vertices lie on a circle of radius r. Then the distance between 
the endpoints is 

G(r) = 2rsin F(r), (2) 
and r is a possible radius for a circumscribing circle precisely when G(r) = 1,, 
F(r) <a and r >1,/2. 

Observe from (1) that F is a strictly decreasing function on [/,/2, ©) D [l, /2, ©). 
We define s as follows: if FU,/2) < m then s = 1,/2; if FU,/2) > 7 then s is the 
unique value in [/,/2,) such that F(s) = 7. We now show that G(r) is strictly 
increasing on [s,%) and that the equation G(r) = I, has exactly one solution there. 
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First, some reduction yields 


G'(r) = 2sin F(r) — 2cos(F(r)) tan( F(J;,r)). (3) 
j=2 
If F(r) € [0, 7/2], then the superadditivity of the tangent implies that G’(r) > 0; 
if F(r) € (1/2, 7) the same conclusion is immediate. Finally, it is easy to see that 


lim G(r) =, + °°: +1, 


r— oo 


a sum that exceeds /, by hypothesis. If s =/,/2, then F(s) = FU,/2) < 7, so 
G(s) < 2s =1,. If s > 1,/2, then F(s) = 7, and hence G(s) = 0. In either case 
G(s) <1,, so the equation G(r) = 1, has exactly one solution in the allowable 
range, and so the circle’s radius is uniquely determined. 

We turn now to a more qualitative description. Suppose a polygon with sides /, 
as above is inscribed in a circle of radius r. If the arc subtended by /, is minor, 
then by adding the interior angles we find that 


Qi = y 2F(I,,r). (4) 


Since the right-hand side of (4) is decreasing in r on [/,/2,), it follows that this 
case 1s equivalent to 


I< ¥ 2F(L,,1,/2) =7 +2 y sin-!(1,/1,). (5) 
j=2 


j=l 


In case (5) does not hold, /, subtends a major arc and 
2n = 27 —-2F(l,,r) + ) 2F(L,r). (6) 
j=2 


For the two particular problems posed it will turn out that we need only deal in 
detail with the first case. 


Let us now consider (a). We have /, =n,...,/, = 1 and 
d sin7'(L,/1,) > sin7'(4) + sin7'(4) + sin7'(4) = 5177, (7) 
j=2 


so that by (5) we are in the first case. Let R = R() be the radius of the inscribing 
circle. Then, by (4), 


T= y sin '(j/(2R)). (8) 


Since for 0 < a < 1, we have 


equality (8) implies that C,n? < R(n) < C,n’, so j/(2R) = O(n~'). From the 
Taylor expansion 


sin-'a =a+a?/6+ O(a’), 
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we obtain from (8) the expansion 


n(n+1)  n?(n + 1)° 


™=—GR + ~qoar? 7 OC”), ) 
which easily inverts to 
n(n + 1) T - 
Furthermore, 
ao ayts2 RS 2\ 1/2 
A(n) = Le sAR~ G72} =F Li(t~ G/2R))) 
j= j= 
n 3 “5 
— R y _ a o| = 
2 24 8 R R 
y) 2 
_ ue 1) 7 = n ue 1) + O(1) 
n?(n + 1) 2ar* - 
= eae — 312 + O(n )} (11) 


On the other hand, it is immediate that 


2 


aT 
1 - 3n2 + O(n~*) 3 (12) 


n(nt+1)° aw n(n +1)’ 
= ——_——— cot — = ——___— 
16 n 167 


from which the desired result follows. 

For (b) we prove a stronger result, that A(q) = lim, _,,, A(qg,n) and B(q) = 
lim,,_,. B(g,) each exist. Indeed, B(q,n) is the area of a regular n-gon with 
perimeter (1 — g”)/(1 — q), so B(q) is the area of the circle with circumference 
(1 — q)~', namely (477)~'(1 — q)~?. We shall prove the existence of the first limit 
by using the monotone convergence theorem. 

Suppose P is the maximal (convex) n-gon with sides /, > --- > J, and suppose 
I< 2l,. We claim that the maximal area of the (7 + 1)-gon with sides /,,...,1,, 
/ is greater than the area of P. Indeed, given P, simply attach an isosceles triangle 
with sides /,, /,, and / onto the side of P with length /,. This gives a particular 
(n + 1)-gon with sides /,,...,/,, | with area greater than the area of P; the 
maximal area is even larger. (That some restriction on / is needed for monotonicity 
can be seen by examining the quadrilateral with sides 1, 1, 1, and 2.99.) 

Since gq” <q"! <2q"~', it follows that A(qg,n) < A(g,n + 1). Since the 
sequence {A(q,n)} is uniformly bounded by B(q), we see that lim, _,., A(q, 1) 
exists. 

If D%-} sin” '(q’) < 7/2, then we are in the second case. However, if q is 
sufficiently close to 1, this will not happen (in fact, 

06 og 
y sin! q! > > 
j=1 


for g > .602527...). We now write g = 1 —« where « is sufficiently small. We 
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already know that B(qg) = (47e7)~'. To find A(q) we proceed as before. The 
radius R,, = R(q,n) of the n-gon with sides 1, g,...,q”' is determined by 


n—1l 
r= ¥ sin-'(q//(2R,)) (13) 
j=0 
and (cf. (11)) 
. 1/2 
n—1 R . q?/ 
A(q,n) = Salt -~—_, (14) 
J=0 2 (2R,,)° 
Since everything converges uniformly, define R by 
00 gi 
T= sin{ (15) 
j=0 2R 
so that 
1/2 
R 2 . q?l 
A = — Jt. — ; 16 


The previous estimate of sin~! a, together with (15), shows that R has order 
(1 — gq)! =«7!. Again, by Taylor series estimates, 


1 
—_—__ + —_——_ 
2Re 144R%¢ 
Again this may be inverted, in this case to 


+ O(e°). (17) 


T= 


1 E 
R=—- +—+ O(e7). (18) 
TE 


3j 


A(q) 


——_ ee 1H 0 2 
Ze W6Re(l+qtqey 


: ae O 19 
= — + e 
Ade? 36 (¢) ( ) 


Finally we have 


A(q) 7? 
c(q) = —— =1- —e* + O(e?), 20 
(4) = Fey = 1 Se? + O08) (20) 
so the desired limit is 77/9. 


Editorial comment. Both Newcomb and the proposer also proved that 
_ AGt+Ss) 2 
lim ——=—_ = — 


5-0 868 v7 


Solved also by the proposer. 
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when s #5, for s in a reduced residue system modulo c. Thus, in view of the 
multiplicativity of y, we have 


det M = | 1" «| — 189-7? det _A?, 
k=1 


where N = N. is the number of solutions of x* = 1 (mod c). Now let 6, = —1lifc 
has a primitive root (i.e., if c is either 4, a power of an odd prime, or twice a power 
of an odd prime), and let 6, = 1 otherwise. Then by Gauss’s generalization of 
Wilson’s theorem (Theorem 11.6 or Theorem 12.17 of [2]) we have 


Cc 


[lk =6.. 


k=1 
Also (—1)*/* = 6, (Theorem 11.5 of [2]). Thus 


det M = 6.x(6,)(—1)°”? det A2. 


Accordingly, in order to establish the results claimed above, it suffices to prove the 
following result. 


LEMMA. If c is a squarefree number exceeding 2, then 
(c)/2 _ 
det A2 = (-1) "°°? T] p28 /”) (*) 
ple 
where the product is taken over the primes p dividing c. If c is a positive integer 


divisible by the square of some prime, then det A2 = 0. 


To prove the lemma we note that 


Cc 
(A) = Loe = Ci +k), 
t=1 
which is Ramanujan’s sum and has the evaluation 
C(itk)= YL du(c/d). 
dlc, j+k) 
(Cf. [1] p. 160.) 

Consider first the case when c is squarefree. We proceed by induction on the 
number of prime factors of c. The case when c has just one odd prime factor was 
covered in the solution of part (a). We suppose c = pm, where p is a prime, 
ptm, and m > 2. 

If p = 2, i.e., if c = 2m, with m odd, then $(c) = d(m) and a reduced residue 
system modulo c is also a reduced residue system modulo m. By the multiplicativ- 
ity of Ramanujan’s sum ((1], p. 163) we have 


(AZ) ie = Com(d +k) = CF + K)C, CG +) = (-1)"C,.C +). 
Since j and k are both odd 
(Az) ix =C,(J +k) = (A*,) ik: 
Thus if (*) holds for m, it also holds for c. 


Suppose p is an odd prime, c = pm, p + m, and m > 2. Let S be a reduced 
residue system modulo c. Then ({1]), p. 125) S can be partitioned into p — 1 
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distinct sets S,,...,5,_,, where the elements of S; are congruent to 1 modulo p 
and form a reduced residue system modulo m. We arrange the sets in the order 
S,,..., 8, -;; within each set, we put the elements in increasing order (from 1 to 
m — 1) of congruence class modulo m. With this ordering the elements of S may 
be labeled a,,43,..., Ay). If dj, is the j,k entry of AZ, we have 


; 

dip = Yo eit”) = C.(a,; + ay). 

t=1 

From the multiplicativity of Ramanujan’s sum we have 

dj, = C,(a; + a,) °C,,(4a; + a,). 
Since 

C,(4; + a,) = -1 if a; + a, #0 (mod p) 

and 


Ci(a;+4,) =p-—1 ifa,;+a,=0(mod p), 


we find that A has the form 


A, A a (p ~ 1) 4%, 
Ai, AR (p= DAY, -Ai, 
“Ai, (p= 1A =A -Ay, 

(P14, AR =A ~Ai, 


Performing the same operations as in the solution of (a) except that we now apply 
them to square blocks of size @(m) instead of to numbers, we find that A? has the 
same determinant as 


0 0 vas 0 Az 
0 0 “++ pA? 0 
0 par, -:- 0 0 
pA? 0 vie 0 0 


Now the number of rows of this matrix is (p — 1)d(m) = 0 (mod 4) and clearly 
(p — 1)¢(m)/2 row exchanges bring it to the block-diagonal form 


pA2 O -. 0 0 
0 par +++ 0) 0 
0 0 «+ pA 0 


0 0 aan 0 A? 
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Hence det A2 = p’?~ (det A2,)?~!, so that, if (*) holds for m, we have 


p-l 
det A2 = por 2nen| _ 1) 0/2 [aor 
q\|m 


_ »(p-2 ~2 
— p”? MC/PIT |g IP(C/d) 
q|m 
where qg runs over the prime factors of m. Thus our induction is complete in the 


case c = pm, p odd, p|m, m > 2, and so (*) is proved. 
Next, assume that c = p”, p prime, n > 1. Then 


0 if(c,j +k) <p"! 
C(jtk) = ( —p"! if(c,j +k) =p"! 
p" — pp"! if(c,j +k) =p”. 
In this case the sum of the elements in each row of A? is zero and so det A? = 0. 
Finally, assume that c = p"m, p prime, n > 1, p + m. Since 


C(j +k) =C (ji +k): Ci +), 


we see again that the sum of the elements in each row of A® is zero and 
so det Az = 0. 


REFERENCES 


1. Tom Apostol, Introduction to Analytic Number Theory, Springer-Verlag, 1976. 
2. Oystein Ore, Number Theory and Its History, McGraw-Hill, 1948. 


Solved also by the proposer. 


Homeomorphisms of Compact Metric Spaces 


6612 [1989, 846]. Proposed by Ebrahim Salehi, University of Nevada, Las Vegas. 
Suppose X is a compact metric space with metric d and suppose T: X — X is 
continuous. If 


inf d(T"x,T”y) > 0 
neN 
for each pair x, y of distinct elements of X, prove that T is onto. 


Solution by Eugene Curtin, Southwest Texas State University, San Marcos, TX. 
We establish that T is a homeomorphism of X onto X without using the 
assumption that T is continuous. Let T° denote the identity map on X and define 
d': X X X — Rby 

d'(x,y) = inf d(T"x,T"y). 
NU{0} 
The hypothesis on 7 shows that d'(x, y) = 0 if and only if x = y. The other metric 
Space axioms are trivial to check on (X,d’). For each x,y in X we have 
d(x, y) > d'(x, y), so the identity map T°: CX, d) — CX, da’) is continuous. Since a 
continuous biection f: X — Y of a compact metric space X onto a metric space Y 
is a homeomorphism (see [1, p. 98]), CX, d') is topologically equivalent to CX, d); in 
particular, it is compact. Note that d'(Tx, Ty) > d'(x, y) for all x, y in X. Hence it 
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REFERENCE 
1. E. T. Copson, Metric Spaces, Cambridge University Press, Cambridge, 1968. 


Solved also by J. M. Borwein & A. C. Thompson (Canada), Nathan S. Feldman, Michael Fichter 
(Germany), Ilias Kastanas, Daniel J. Opitz, Harold Steinmacker (Austria) and the proposer. One 
incomplete solution also was received. 


A Problem Related to Engel’s 1 + ab Problem 


6619 [1990, 75]. Proposed by Arthur L. Holshouser, Charlotte, NC, and 
Benjamin G. Klein, Davidson College, NC. 


Suppose that N = 4” (8k + 7) for non-negative integers h and k. 
(a) Show that there exist infinitely many distinct 5-tuples (a,b,c, d,e) of 
positive integers such that 


N = (a* + b* +c* + d* + e*)/(abcde + 1). (*) 
(b) Show that there is no 4-tuple (a, b,c, d) of positive integers such that 
N = (a? +b? +c’? + d*)/(abcd + 1). (* *) 


Solution by R. J. Chapman, University of Exeter, U.K. It is well known that such 
integers N are exactly the nonnegative integers not expressible as sums of three 
integral squares, and that every nonnegative integer is the sum of four integral 
squares (Lagrange’s theorem). We shall also use the trivial fact that each such WN is 
at least 7. 

For (a), suppose that (a,b,c,d,e) is a solution of the given Diophantine 
equation (*) withO <a <b<c<d«<eand b> 0. We claim there is an integer 
a’ > a such that (a’, b,c, d, e) is also a solution. Clearly 


a’ — (Nbcde)a + (b* +c? +d*+e*—N) =0 
is a monic quadratic equation in a whose other root 
a' = Nbcde — a 
is an integer such that (a’, b,c, d, e) is also a solution to (*). If a’ < a, then 


2a > Nbcde > 1b > 2b > 2a, 
a contradiction, so a' > a. Now by Lagrange’s theorem 


N = (0° + b§ +. c6 + dj + e5)/(1 + 0+ bycgdyey), 


where 0 < by < Cy < dy < ey. Hence our procedure has a starting point. It next 
yields a solution (ay, bp, Cy, do, 9) in positive integers. By rearranging and repeat- 
ing the procedure we obtain an infinite family of solutions, sincea +b+ct+d+t+e 
increases at each stage. 

For (b) let (a, b,c, d) be a solution of the given Diophantine equation (* *) in 
positive integers with a <b <c<d. We claim there exists an integer d' with 
0 <d' <d such that (a,b,c, d') is also a solution. This would yield a contradic- 
tion, since by rearranging and repeating we must eventually get d' = 0, 1.e., that N 
is the sum of three squares. Let d’ = Nabc — d. By an argument similar to that of 
(a) we find that (a, b,c, d') also solves (* *). Since 


N(1 + abcd') =a? +b* +c* +d”, 
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A Problem Related to Engel’s 1 + ab Problem 


6619 [1990, 75]. Proposed by Arthur L. Holshouser, Charlotte, NC, and 
Benjamin G. Klein, Davidson College, NC. 


Suppose that N = 4” (8k + 7) for non-negative integers h and k. 
(a) Show that there exist infinitely many distinct 5-tuples (a, b,c,d,e) of 
positive integers such that 


N = (a° + b*? +c* + d* + e*)/(abcde + 1). (*) 
(b) Show that there is no 4-tuple (a, b,c, d) of positive integers such that 
N = (a? +b? +c? + d*)/(abcd + 1). (* *) 


Solution by R. J. Chapman, University of Exeter, U.K. It is well known that such 
integers N are exactly the nonnegative integers not expressible as sums of three 
integral squares, and that every nonnegative integer is the sum of four integral 
squares (Lagrange’s theorem). We shall also use the trivial fact that each such N is 
at least 7. 

For (a), suppose that (a,b,c,d,e) is a solution of the given Diophantine 
equation (*) withhO <a <b<c<d<eand b> 0. We claim there is an integer 
a’ > a such that (a’, b,c, d, e) is also a solution. Clearly 


a’ — (Nbcde)a + (b* +c* +d*+e*—N) =0 
is a monic quadratic equation in a whose other root 
a' = Nbcde — a 
is an integer such that (a’, b,c, d, e) is also a solution to (*). If a’ < a, then 
2a > Nbcde >'1b > 2b > 2a, 
a contradiction, so a’ > a. Now by Lagrange’s theorem 


N = (07 + bp + cf + dj + ef)/(1 + 0° bocodye), 


where 0 < by < Cy < dy < €g. Hence our procedure has a starting point. It next 
yields a solution (dy, by, Cy, dy, Ey) in positive integers. By rearranging and repeat- 
ing the procedure we obtain an infinite family of solutions, sincea +b+ct+dt+e 
increases at each stage. 

For (b) let (a, b,c, d) be a solution of the given Diophantine equation (* *) in 
positive integers with a <b <c<d. We claim there exists an integer d' with 
0 <d' <d such that (a,b,c, d’) is also a solution. This would yield a contradic- 
tion, since by rearranging and repeating we must eventually get d' = 0,1.e., that N 
is the sum of three squares. Let d’ = Nabc — d. By an argument similar to that of 
(a) we find that (a, b,c, d') also solves (* *). Since 


N(1 + abcd') = a* + b* +c? +d”, 
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d' must be nonnegative. If d' > d, then 2d < Nabc or 
2d(1 + abcd) < abc(a* + b? +c? +d’), 
so that 
0 < 2d <abc(a* +b? +c* —d?*). 


From a <b <c we deduce d? < 3c? and 
N + Nabc* < N(1 + abcd) = a? + b*? +c* +d? < 6c’. 
But N > 7, so this is false, Hence d' < d and the result follows. 


Editorial comment. Both R. J. Chapman and Christos Athanasiadis remarked 
on the similarity of this problem to the celebrated “1 + ab problem” of Arthur 
Engel (from the 29th International Mathematical Olympiad). Chapman remarks 
that the above method of proof yields the following result. If a,,...,a, are 
non-negative integers and 

a2 4+ +a? 
N — eee 
1+a,a,°-"'a 


r 


is an integer, then N is the sum of (r — 1) squares of integers. Engel’s problem is 
the case r = 2 and part (b) of the given problem is the case r = 4. 


Solved also by Christos Athanasiadis (student) and the proposers. 


Subsets of the Plane Intersecting Every Straight Line in Exactly Two Points 


6624 [1990, 162]. Proposed by K. A. Ross, University of Oregon, Eugene. 


(i) Show that there is a subset A of the plane R’ that intersects every straight 
line in exactly two points. 

(ii) Can the set A be selected to be Lebesgue measurable? 

(iii)* Can the set A be selected to be a Borel set? 


Partial solution by Robert High, New York City and Fred Galvin, University of 
Kansas. For (i) let « be the least ordinal of cardinality equal to the continuum (we 
do not require the continuum hypothesis). Let the set of all lines in the plane be 
placed in a 1-1 correspondence with the set of ordinals less than x. Now let S$ 
begin as the empty set. For each line /, with A <x we will adjoin at most two 
points to S so that at each stage the cardinality of S (namely, the cardinality of A) 
will be strictly less than «. Consider all lines determined by points already in S. 
Their number is at most the square of the cardinality of A, and this is strictly less 
than the cardinality of «. If /, is among the lines determined by points already in 
S, then no points are added to S for /,. Otherwise, the set G, of points on /, but 
not on any line determined by points already in S is infinite (as the family of such 
lines has cardinality strictly less than x, and each meets /, in at most one point). 
Adjoin one or two points from G, to S, as required to ensure that S has exactly 
two points of /,. Note, by definition of G,, that this action does not add to S any 
points on any previously considered line Le with B < A. Analogously, no future 
action will add points on /, to S. Continue this process transfinitely: the resulting 
S is the desired set A. 
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For (ii), use the same construction restricted to a measure zero subset of the 
plane that meets every line in continuum many points. For example, use (R xX C) 
U(C X R) where C is the usual Cantor set of measure zero. Vertical lines 
intersect R X C in continuum many points. All the other lines intersect C X R in 
continuum many points. Construct A as a subset of this set; this ensures that it, 
too, has Lebesgue measure zero. 


Editorial comment. In his paper “Infinite combinatorics and definability,” An- 
nals of Pure and Applied Logic 41 (1989), 179-203, Arnold W. Miller points out (p. 
193) that (i) is a theorem of S. Mazurkiewicz, C. R. Sc. et Lettres de Varsovie 7 
(1914), 322-383, that (iii) is a recognized open problem, but that in D. G. Larman, 
“A problem of incidence,” J. London Math. Soc. 43 (1968), 407-409, it is shown 
that no “two-point set” can be an F. set. Miller (p. 195) also mentions the 
unpublished result of D. Mauldin that tWo-point sets are totally disconnected. The 
editors thank Fred Galvin for bringing this reference to their attention. 


Solved also by the proposer. 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


Beyond the Third Dimension. By Thomas F. Banchoff, Scientific American Library, 
New York, 1990, xii + 210 pp. 


H. S. M. CoxETER 
Department of Mathematics, University of Toronto, Toronto, Ontario, Canada M5S 1A1 


This beautiful book gently introduces the reader to the idea of many dimen- 
sions. Its abundant pictures include portraits of Albert Einstein and Marston 
Morse, Salvador Dali and the author, Immanuel Kant, and K. F. Gauss. On page 2 
there is a facsimile of the title page of the first edition of Flatland [i], “Price 
half-a-crown,” which includes 


O day and night, but this is wondrous strange 
And therefore as a stranger give it welcome. 


Many of the illustrations are elegantly coloured. Those on page 89 are reminis- 
cent of Oliver Byrne [2] which was dedicated “‘to the right honourable the Earl of 
Fitzwilliam, etc. etc. etc.” In contrast, the author reminds the reader (on page 102) 
that Schlafli [6] published ‘a long work on higher-dimensional geometry which 
contained not one picture.’’ Such lack of human consideration was repeated by 
von Staudt [7; 8]. 

Page 78 includes a photograph showing the distribution of orifices on a pollen 
grain, which explains why the problem of packing disks (or “‘caps’”’) on a sphere is 
commonly ascribed to P. M. L. Tammes, a botanist. 

Page 86 introduces the “Greek Geometry Game” by a coloured drawing of Max 
Bill’s polygonal spiral, which consists of m — 1 sides, all having the same length, of 
each regular n-gon for n = 3,4,5,6,7,8. The Greeks used their favourite instru- 
ments, the ruler and compasses, to construct all these polygons except the 
heptagon. The impossibility of so constructing the regular heptagon was finally 
established by Wantzel [9]. 

Among the most striking objects ““beyond the third dimension” are the 4-dimen- 
sional analogues of the five Platonic solids. These are the six convex regular 
polytopes, which the author begins to describe in Chapter 5, paying special 
attention to the 600-cell, whose six hundred facets or cells are regular tetrahedra 
[3, pp. 247-281; 4, p. 42]. The figure on page 100 is a projection into a plane, of 
600-cell’s 120 vertices, 360 of its 720 edges, 120 of its 1200 faces, and 36 of its 72 
“equatorial” decagons (analogous to the three “equatorial” squares in the regular 
octahedron). Those 120 faces appear undistorted as equilateral triangles, of various 
sizes because of different foreshortening [4, p. 48]. The accompanying figure shows 
two antipodal triangles and the three decagons that contain their sides. 

The decagons appear in two shapes, typical of 15 + 15 of the 36. The remaining 
6 appear as undistorted regular decagons. In fact, the author’s figure can be 
derived by copying the reviewer’s figure and rotating it about the center through 
all multiples of 12°. (in this process, some edges will be repeated.) 
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The 600-cell’s remaining 360 edges, which the author omitted, form another set 
of 36 equatorial decagons, quite different because they form no equilateral 
triangles. He returns to the subject of regular polytopes in Chapter 6, where he 
reproduces “‘perspective” views of the 5-cell, 16-cell and 24-cell from Hilbert and 
Cohn-Vossen [5, pp. 150-152], pointing out that these views are analogous to 
Schlegel diagrams. 

Being an expert on computer graphics, the author makes striking use of that 
technique in showing how solids and hypersolids change their apparent shape 
when they are rotated. 

This colorful book, with its three pages of ‘Further readings” and useful index, 
will be appreciated by the thousands of people who are curious about the concept 
of hyperspaces, and by the still greater number of people who enjoy beautiful 
pictures. 
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Global Differential Geometry. Editor, S. S. Chern. MAA Studies in Mathematics, 
Volume 27, The Mathematical Association of America, Washington, DC, 1989, 
li + 254 pp. 
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Department of Mathematics, Brown University, Providence, RI 02912 


This book is an excellent collection of articles for teachers and students who are 
looking for a supplement to a standard geometry undergraduate curriculum, or 
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This book is an excellent collection of articles for teachers and students who are 
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who want to get an idea about the key problems that are informing this important 
subject. Twenty years ago, the first edition of this work appeared, including earlier 
versions of many of the same articles. The best way to appreciate this book is to 
read the introduction to the first edition, which was inexplicably left out of the 
second. Many books do print both versions, with the first giving an overview of the 
Original project and the second keying into the new developments in the field, and 
that approach would clear up the question about why certain articles have been 
included. The original title of the MAA Studies volume was Studies in Global 
Geometry and Analysis and the first entry was Marston Morse’s short expository 
article ““What Is Analysis in the Large?” This was followed by Shiing-Shen Chern’s 
Chauvenet prize-winning essay “Curves and Surfaces in Euclidean Space,” and 
together these articles introduced the two notions of analysis and classical differen- 
tial geometry which came together to form global differential geometry. The new 
title Global Differential Geometry reflects a shift in emphasis and Morse’s essay 
does not serve any longer as the introduction to the subject. Without some 
explanation in the introduction, it seems totally out of place near the end of this 
new volume. 

Beginning readers should probably read the articles in this book in a different 
order from their sequence in the table of contents. The best beginning is still the 
article by Chern on curves and surfaces. It can then be followed by the article by 
Harley Flanders on Differential Forms, which was very good in the first edition 
and it is even better now, representing a short version of his soon-to-be-reprinted 
book on the same subject. Only then should the reader proceed to the article 
which is now the lead, “Vector Bundles with a Connection.” Without a good 
understanding of the classical differential geometry of curves and surfaces, a 
reader will have difficulty following the first article as it moves rather quickly into a 
fairly abstract framework. Within a few pages, the reader is confronted with “the 
vector space of sections of the tensor product of a vector bundle with the 
cotangent space of a manifold.” The notions of “tensor product” and “cotangent 
space” are not defined in the article. Readers not familiar with these concepts 
from a course in algebra or advanced calculus are likely to be lost at this point. Of 
course it is impossible to include all relevant background for a vast subject in a 
short survey article, but terminology can stand in the way of the author’s stated 
aim of showing that “a connection on a vector bundle is basically a very elementary 
concept.” 

The original volume was somewhat uneven, and that problem remains in the 
new one. The article of Lamberto Cesari on “Surface Area”’ is still quite technical 
(due to the technical nature of the subject) and the article by L. A. Santal6 on 
“Integral Geometry” is still mysterious (due to the mysterious nature of the 
subject). The text of neither article has been changed in the last twenty years, 
although some recent references have been appended in each case. The same is 
true for the article by Shoshichi Kobayashi “On Conjugate and Cut Loci.” 

Three new articles are especially welcome, presenting an exposition of recent 
work in several different areas: Robert Osserman’s essay “Minimal Surfaces in 
R°?,” Hermann Karcher’s article “Riemannian Comparison Theorems,” and Patrick 
Eberlein’s “‘Manifolds of Nonpositive Curvature.” The first of these uses only 
notions of advanced calculus and a first course in complex variables, so it should 
be accessible to advanced undergraduates. The second and third bring in numer- 
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ous concepts from modern differential geometry, and both make an attempt to be 
self-contained, so they represent a good introduction into the formalism as well as 
the concepts of global differential geometry. 

All in all, this volume, like its predecessor, serves a very important purpose, that 
of introducing a broad range of readers to the methods and scope of global 
differential geometry, a field that will continue to grow in significance. What, we 
may wonder, will the next edition of this book look like twenty years from now? 


TELEGRAPHIC REVIEWS 


Edited by Lynn Arthur Steen, with the assistance of the Mathematics Departments of Carleton, Macalester, 
and St. Olaf Colleges. Books and software submitted for review should be sent to Reviews Editor, American 
Mathematical Monthly, St. Olaf College, Northfield, Minnesota 55057. 


Telegraphic Reviews are designed to alert readers in a timely manner to new books and 
computer software appropriate to mathematics teaching and research. Special codes classify 


reviews by subject area and appropriate use: 


T: Textbook 
C: Computer Software 


P: Professional Reading 
L: Undergraduate Library 
S: Supplementary Reading 13: Grade Level 


1-4: Semesters 
**: Special Emphasis 
??: Questionable 


Readers are advised that price information is subject to change, that computer software is 
often available also on other machines, and that hardware variations often cause software 
incompatibilities. Selected books and software packages receive a second, more extensive 


review in the MonrTHLY. 


General, L. CRC Handbook of Mathematical Curves 
and Surfaces. David H. von Seggern. CRC Pr, 1990, 
286 pp, $49.95. [ISBN: 0-8493-0155-6] Pictures 
of families of functions on right-hand pages, fac- 
ing displays of formulas with relevant parameters: 
algebraic curves, transcendental functions, orthog- 
onal and non-orthogonal polynomials, special func- 
tions (e.g., Legendre, Bessel), probability densities, 
catastrophe curves, spirals and helices, algebraic and 
transcendental surfaces, and nondifferentiable curves 
and surfaces (including polygons and polyhedra). A 
handy reference for those without a computer. LAS 


General, S*(15-18), P*, L*. Problems in Ap- 
plied Mathematics: Selections from SIAM Review. 
Ed: Murray S. Klamkin. SIAM, 1990, xxv + 588 
pp, $36.50 (P). [ISBN: 0-89871-259-9] A compila- 
tion of 380 of SIAM Review’s most interesting prob- 
lems dating back to the journal’s inception in 1959. 
This outstanding collection of instructive problems 
is organized into twenty-two broad sections such as 
probability, combinatorics, series, special functions, 
and so forth. Most of the problems include moti- 
vational material describing how the problem arose; 
the solutions, some of which approach the status of 
short papers, are accessible to a wide audience (say, 
at the level of the American Mathematical Monthly). 
Solutions generally include references to further lit- 
erature and pertinent comments by Klamkin. There 
is something in this volume for everyone, and is a 
welcome addition to the problems literature. LCL 


General, $(13-14), L**. Conquering Mathemat- 
ics: From Arithmetic to Calculus. Lloyd Motz, Jef- 
ferson Hane Weaver. Plenum Pr, 1991, xiv + 305 
pp, $23.50. [ISBN: 0-306-43768-6] An informal dis- 
cursive survey of mathematical basics—what many 
people should have learned in school but didn’t— 
beginning with a thorough discussion of the whys 
and wherefores of arithmetic. Focuses throughout, 
and especially at the end in an epilogue on the rela- 
tions between mathematics and physics, on the “pre- 
dictive power” of mathematics. Written for literate 
readers who have forgotten (or never learned) basic 
mathematics but can nevertheless comprehend logi- 


cally and linguistically sophisticated exposition. LAS 


General, S, L*. Beyond Numeracy: Ruminations 
of a Numbers Man. John Allen Paulos. Alfred A. 
Knopf, 1991, xiii + 285 pp, $22. [ISBN: 0-394- 
58640-9] Cultural literacy for mathematics: sev- 
enty brief topical essays, arranged alphabetically, 
covering mathematical vocabulary from algebra to 
Zeno including expected topics such as binary num- 
bers, chaos, differential equations, prime numbers, 
statistics, and topology as well as unexpected di- 
versions (ethics, human consciousness, humor, even 
“Oulipo”). Ends with a chronological list of the “top 
forty” mathematicians from antiquity to the present, 
each with brief identifiers, and an afterword reflection 
on the fact that only White males made the list. LAS 


General, S? Only Problems, Not Solutions! Floren- 
tin Smarandache (POB 42561, Phoenix, AZ 85080), 
1991, 37 pp, (P). (ISBN: 1-879585-00-6] Thirty-five 
easily stated, but probably difficult, unsolved prob- 
lems involving integers, sequences, equations, config- 
urations. Many are generalizations of known results 
(references included). Virtually no discussion. LCL 


General, P. Winning Women Into Mathematics. 
Eds: Patricia Clark Kenschaft, Sandra Keith. MAA, 
1991, ix + 78 pp, $11 (P). [ISBN: 0-88385-453-8] A 
very varied collection of materials gathered by the 
MAA Committee on the Participation of Women 
that documents both the increasing role of women in 
mathematics as well as the continuing impediments 
they face. Chronicles committee activities (including 
“micro-inequity” skits they have arranged at meet- 
ings); highlights (with many photographs) numer- 
ous “firsts” for women in mathematics; lists cultural 
impediments that women face in mathematical ca- 
reers (e.g., “women may also be more reluctant to 
write in a style that seems to embody conceit” ); sug- 
gests actions; and describes numerous intervention 
projects. LAS 


General, S**, C**, P*. Theorist. (Version 1.11). 
Allan Bonadio. Prescience (939 Howard St., No. 306, 
San Francisco, CA 94103), 1991, $399.95. Learning 
Guide, iv + 242 pp, (P); Reference Manual, xiii + 


672 


1991] 


360 pp, (P). A powerful yet intuitive Macintosh pack- 
age for symbolic manipulation and visualization that 
provides a handy electronic blackboard for anyone 
using mathematics. Symbol palette eases equation 
entry. Menu options control manipulations (factor, 
substitute, transform simplify, expand, etc.), while 
window buttons control graphing (zooming, resolu- 
tion, resizing, perspective, algebraic details). Anima- 
tion option creates and saves frames in PICS format 
and then redisplays them in rapid sequence. Em- 
ploys notebooks to store equations, graphs, and ma- 
nipulations; comments containing text or graphics 
can be added with button-controlled display. Results 
can be saved, edited, and printed in standard Mac- 
intosh style. Greater ease, less power than larger 
symbolic algebra systems; excellent documentation 
in two parts—a tutorial guide to graphing, algebra, 
and calculus, and a reference manual with careful 
explanations of all options. LAS 


General, S*, P, L**. Fractals, Chaos, Power 
Laws: Minutes from an Infinite Paradise. Man- 
fred Schroeder. WH Freeman, 1990, xviii + 429 
pp, $32.95. [ISBN: 0-7167-2136-8] An inviting ex- 
position of the worldly and mathematical manifes- 
tations of self-similarity: music and “pink noise;” 
roulette and pinball machines; quasiperiodic mat- 
ter and renormalization; and, of course, fractals in 
all their well-known forms. Illustrates with numer- 
ous examples the fascinating interplay between the 
mathematical and physical consequences of changes 
in scale and self-similarity. Clearly written, using 
formulas and graphs effectively for a literate but not 
highly scientific audience. LAS 


General, P. Mathematical Sciences, Technology, 
and Economic Competitiveness. Ed: James G. 
Glimm. National Academy Pr, 1991, x + 114 pp, 
(P). [ISBN: 0-309-04483-9] An important innova- 
tive study of the role of mathematics, especially of 
simulation and modelling, in U.S. industries, from 
economic planning through product design, manu- 
facturing, and quality control to inventories, mar- 
keting, and maintenance. Examples drawn from di- 
verse industries illustrate much useful mathematics: 
statistical quality improvement, discrete and combi- 
natorial optimization, simulation, and probabilistic 
models. LAS 


General, S, P. Environmental Risk Reporting: The 
Science and the Coverage. Eds: Sharon M. Fried- 
man, Carol L. Rogers. Lehigh Univ (Dept. of Jour- 
nalism, Univ. Center 29, Bethlehem, PA 18015), 
1991, 48 pp, (P). Brief papers from a day-long AAAS 
symposium on the journalistic challenges of cover- 
ing environmental risks: how to explain and quantify 
risks in a way that is meaningful and helpful to the 
public. Little explicit mathematics, but much valu- 
able insight into how ordinary people (journalists and 
their audiences) (mis)interpret simple statements of 
magnitude, rates, and probabilities. LAS 


General, S*, L*. Flatland: A Romance of Many 
Dimensions. Edwin Abbott Abbott. Princeton Univ 
Pr, 1991, 103 pp, $7.95 (P). [ISBN: 0-691-02525- 
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8] A new edition of a classic social satire, at least 
the dozenth that has appeared since the original was 
published in 1884. Tom Banchoff’s introduction re- 
veals diverse issues from Abbott’s agenda that are 
still of interest today ranging from parochial per- 
spectives (e.g., treatment of women) to visualization 
through slicing (e.g., exploratory data analysis, CAT 
scanners). A timely republication well-suited to to- 
day’s computer-graphics generation. LAS 


Elementary, S, L. The Method of Coordinates. 
I.M. Gel’fand, E.G. Glagoleva, A.A. Kirillov. 
Birkhauser Boston, 1990, x + 73 pp, $14.50 (P). 
[ISBN: 0-8176-3533-5] Second volume in the newly 
re-published series of booklets from Moscow State 
University’s correspondence school for rural sec- 
ondary school pupils. Introduces Cartesian coor- 
dinates in 2, 3, and 4 dimensions, with tangen- 
tial discussions of other systems. First English edi- 
tion published in 1967 by MIT Press (TR, October 
1967). LAS 


Mathematics Appreciation, T(13: 1), S. Mathe- 
matics and Global Survival, Second Edition. Richard 
H. Schwartz. Ginn Pr, 1990, xvi + 197 pp, (P). 
(ISBN: 0-536-57540-1] Reviews a wide variety of 
elementary mathematical topics (e.g., percents, ra- 
tios and proportions, graphical methods, arithmetic 
and geometric sequences, probability, and statistics) 
using “real-world” data on population growth, the 
arms race, energy utilization, waste management, 
and other issues. Includes a bibliography, a list of 
possible term paper topics, and solutions to many of 
the exercises. AO 


Mathematics Appreciation, T(13-14: 1), S. 
Universal Patterns, Book One, Revised Edition. 
Martha Boles, Rochelle Newman. Pythagorean Pr, 
1990, xxi + 266 pp, $29.95 (P). [ISBN: 0-9614504- 
3-6] Main thrust is to show relationships among 
things which appear disconnected. Focuses on na- 
ture, mathematics, and art. Mathematics is used to 
define abstract relationships found in nature while 
art can create relationships in nature. Both art 
and mathematics can interpret nature and nature 
can suggest directions in both mathematics and art. 
A treasure of geometric constructions fills the text 
with accompanying discussion of related natural and 
artistic phenomena. Chapters: basic constructions, 
unique relationships, special triangles, dynamic rect- 
angles, Fibonacci numbers, spirals. Lots of prob- 
lems and suggested projects. Good references, ap- 
pendices; answers to selected problems; glossary; in- 
dex. RJA 


Mathematics Appreciation, S. Lecture Notes in 
Mathematics-1458: Global Solution Branches of Two 
Point Boundary Value Problems. Renate Schaaf. 
Springer-Verlag, 1990, xix + 141 pp, $20 (P). [ISBN: 
0-387-53514-4] Introduces the parameterized equa- 
tions for the two point boundary value problems, and 
the time map T of the equations; derives a time map 
formula for branches bifurcation which overcomes the 
singularity; covers applications and results, includ- 
ing Newmann problems and Dirichlet’s problem; also 
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studies the stability of the time map and some com- 
puter plots. BL 


Finite Mathematics, T(13). Finite Mathemat- 
ics. William B. Owen, Wm. Frederick Cutlip. Har- 
court Brace Jovanovich, 1991, xlix + 678 pp, $35 net. 
[ISBN: 0-15-527546-1] For an applications-oriented 
course for students of business, the life sciences, 
and the social sciences, assuming one year of high 
school algebra. Covers linear mathematics, prob- 
ability, statistics, and mathematics of finance with 
MINITAB examples. JPH 


Education, P, L*. What Works: Building Natural 
Science Communities. Project Kaleidoscope (ICO, 
1730 Rhode Island Ave., NW, Suite 1205, Washing- 
ton, DC 20036), 1991, xviii + 100 pp, (P). An analy- 
sis of productive undergraduate programs in science 
and mathematics, stressing mechanisms that make 
learning experiential, personal, and rooted in “natu- 
ral science communities.” Includes chapters on sci- 
ence, mathematics, human resources, faculty, and 
facilities; texts of colloquium presentations by na- 
tional policy leaders; and vignettes of illustrative pro- 
grams. LAS 


Education, P, L*. Math Panic. Laurie Buxton. 
Heinemann, 1991, xv + 260 pp, (P). [ISBN: 0-435- 
08313-9] A startling account of anxiety and despair 
recalled by British adults as they reflected on their 
struggles with mathematics, both in school and at 
work. Author Buxton builds from these experiences 
a very believable analysis of how reason is frequently 
blocked by emotion, leading to practical suggestions 
for mathematics teachers at all levels. Revised Amer- 
ican edition of a book originally published in England 
in 1981. LAS ) 


Education, P, L. The State of Mathematics 
Achievement. Ina V.S. Mullis, et al. Educational 
Testing Service, 1991, 532 pp, (P). [ISBN: 0-88685- 
15-7] Detailed report of the 1990 national assess- 
ment of mathematics at grades 4, 8, and 12, including 
trial state-wide data at eighth grade. Findings gen- 
erally confirm earlier studies: only 5% of high school 
seniors understand geometry and algebra; fewer than 
half have consistent command of decimals, percents, 
and fractions. At the eighth grade level, states with 
fewer urban areas, fewer disadvantaged students and 
fewer minority students performed above average, as 
did students who watched less television, read more 
at home, and attended school regularly. Policy fac- 
tors such as dollars per student, required courses, 
teachers’ salaries, and length of school year appear 
to be uncorrelated with performance. LAS 


Education, P*, L. Mathematical Modeling in the 
Secondary School Curriculum: A Resource Guide of 
Classroom Ezercises. Eds: Frank Swetz, J.S. Hart- 
zler. NCTM, 1991, viii+ 136 pp, $9.50 (P). [ISBN: 0- 
87353-306-2] Presents twenty-two successful class- 
room modeling exercises developed and class-tested 
under the auspices of the Mathematical Sciences Pro- 
gram of the Pennsylvania State University at Harris- 
burg. The mathematics required ranges from ele- 
mentary through precalculus. Also includes teaching 
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hints and additional information (e.g., a bibliogra- 
phy) for secondary school instructors interested in 
teaching mathematical modeling. AO 


History, S*, P, L*. Ethnomathematics: A Multi- 
cultural View of Mathematical Ideas. Marcia Ascher. 
Brooks/Cole, 1991, ix + 203 pp, $38.95. [ISBN: 0- 
534-14880-8] An innovative study of mathematical 
ideas (e.g., numbers, space, logic, patterns, chance) 
embedded in the cultural traditions of “small scale” 
societies (Carolina Islands, Navajo, Maori, Warlpiri, 
...). Intended to demonstrate that “mathematics” 
is a distinctly Western construct, but that “math- 
ematical ideas” are universal and culturally depen- 
dent. Thoroughly researched and well-documented; 
concluding chapter on implications treads on contro- 
versial issues of values, education, and purpose. LAS 


History, P*, L**. Augustin-Louis Cauchy: A Bi- 
ography. Bruno Belhoste. Transl: Frank Ragland. 
Springer-Verlag, 1991, xii + 380 pp, $79. [ISBN: 
0-387-97220-X] A fascinating account of the profes- 
sional career of Cauchy, an arrogant, argumentative, 
rigid, pro-cleric royalist whose immense mathemati- 
cal contributions occurred in the midst of successive 
revolutions and counter-revolutions in nineteenth- 
century France. Cauchy’s political views and per- 
sonality led to his repeated rejection for academic 
positions; his reputation as a terrible teacher caused 
repeated conflicts with university authorities; yet 
his drive for rigorous truth made him the greatest 
French mathematician of his generation. Carefully 
researched with extensive notes pointing to original 
manuscripts, this biography offers an effective bal- 
ance of historical and mathematical detail, including 
a valuable chapter that explores the philosophical 
rationale of Cauchy’s uncompromising approach to 
politics, religion, and mathematics. LAS 


History, L*. Rabdology. John Napier. Transl: 
William Frank Richardson. Charles Babbage Insti- 
tute Reprint Ser. for History of Computing, V. 15. 
MIT Pr, 1990, xxxvii + 135 pp, $40. [ISBN: 0-262- 
14046-2] The original user manual: the first com- 
plete English translation of Napier’s 1617 Latin Rab- 
dologia, a guide to the use of “Napier’s Bones,” a 
“high-speed promptuary for multiplication.” From 
the Dedicatory Epistle: “To perform calculations is 
a difficult and lengthy process, the tedium of which 
deters many from the study of mathematics.” In ad- 
dition to lengthy but very clear instructions for mul- 
tiplication, division, and extraction of roots, Napier’s 
brief treatise contains the first written reference to 
the decimal point, and a remarkable geometric al- 
gorithm for “location arithmetic on a chessboard” 
based on binary numbers developed “as more of a 
lark than a labor.” LAS 


History, T(14-17), S, L. A History of Mathe- 
matics, Second Edition. Carl B. Boyer, Uta C. 
Merzbach. Wiley, 1991, xx + 715 pp, $24.95 (P). 
(ISBN: 0-471-54397-7] A text essentially unchanged 
from the 1989 revised edition (TR, October 1989). 
That edition, however, was designed for classroom 
use. In this edition the exercises have been dropped 
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with the ‘outside-the-lecture-room’ reader in mind. 
(1968 edition, TR, January 1969; 1985 Princeton 
Univ. Pr. edition, TR, February 1986.) KS 


History, S(13-18), P, L. The Development of 
Newtonian Calculus in Britain, 1700-1800. Niccold 
Guicciardini. Cambridge Univ Pr, 1989, xii + 228 
pp. [ISBN: 0-521-36466-3] Devoted to the “calcu- 
lus of fluxions,” this book argues that Britain did 
not experience the post-Newton mathematical de- 
cline that it is reputed to have suffered. Examines 
research and teaching of this calculus during 18th 
century; includes sociological discussions. Considers 
importance of work of, among others, Newton, Cotes, 
Taylor, Stirling, Maclaurin, Simpson. KS 


History, L**. Women in Science: Antiquity 
through the Nineteenth Century: A Biographical Dic- 
tionary with Annotated Bibliography. Marilyn Bailey 
Ogilvie. MIT Pr, 1988, xiii + 254 pp, $12.50 (P). 
[ISBN: 0-262-15031-X] The introduction is a histori- 
cal essay about science and women. Each biographi- 
cal sketch includes lists of major work by the subject 
herself and works about her. A good source book for 
courses on women and science. MLR 


Logic, P. Classification Theory and the Number of 
Non-Isomorphic Models, Second Revised Edition. S. 
Shelah. Stud. in Logic & Found. of Math., V. 92. 
North-Holland (US Distr: Elsevier Science), 1990, 
xxxiv + 705 pp, $242.75. [ISBN: 0-444-70260-1] 
This edition includes corrections, list of open prob- 
lems, and four new chapters covering developments 
in classification theory since the First Edition (TR, 
August-September 1979). Note price! KES 


Graph Theory, P. Random Graphs ’87. Eds: 
Michal Karonski, Jerzy Jaworski, Andrzej Rucirski. 
Wiley, 1990, ix + 368 pp, $84.95. [ISBN: 0-471- 
92749-X] Twenty-four expository and research pa- 
pers from the Third International Seminar on Ran- 
dom Graphs and Probabilistic Methods in Combina- 
torics held at Poznan, 1987, plus open problems. JPH 


Graph Theory, P. Topics on Domination. S.T. 
Hedetniemi, R.C. Laskar. Annals of Disc. Math., 
V. 48. North-Holland (US Distr: Elsevier Science), 
1991, 280 pp, $143. [ISBN: 0-444-89006-8] A com- 
prehensive collection on the theory of dominating 
sets in graphs. Twenty-one papers on the standard 
domination number, on new variations, and on algo- 
rithmic topics, plus a bibliography. JPH 
Combinatorics, S(18), P. Submodular Functions 
and Optimization. Satoru Fujishige. Annals of Disc. 
Math., V. 47. North-Holland (US Distr: Elsevier 
Science), 1991, ix + 270 pp, $97.25. [ISBN: 0-444- 
88556-0] Exposition of submodular function theory 
and its constructive aspects. Treats submodular sys- 
tems, base polyhedra, network flow problems, the 
intersection problem, connections between submod- 
ular functions and convex analysis, nonlinear opti- 
mization problems. AD 


Discrete Mathematics, T(14: 1). Discrete Math- 
ematics: An Introduction to Concepts, Methods, and 
Applications. Jerrold W. Grossman. Macmillan, 
1990, xxvi + 911 pp, $36.75. [ISBN: 0-02-348331-8] 
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Treats elementary logic, set theory, functions and re- 
lations, algorithms, induction and recursion, combi- 
natorics. More thorough treatment of graph theory 
than books at a comparable level. AD 


Number Theory, P. Diophantine Equations and 
Inequalities in Algebraic Number Fields. Wang Yuan. 
Springer-Verlag, 1991, xvi + 168 pp, $69. [ISBN: 
0-387-52019-8] A concise up-to-date survey of the 
field; topics include the circle method, exponential 
sums, Weyl’s sums, singular series and integrals, 
Waring’s problem, and inequalities for forms. SG 


Group Theory, S(18), P. Topics in Representa- 
tton Theory. Ed: A.A. Kirillov. Adv. in Soviet Math., 
V. 2. AMS, 1991, xi + 247 pp, $83. [ISBN: 0- 
8218-4101-7] An out-growth of the continuing sem- 
inar on Representation Theory at Moscow Univer- 
sity: seven major papers on various topics by G.I. 
Olshankii, Y.A. Neretin, R.S. Ismagilov, A.I. Molev, 
O.D. Ovsienko, V. Yu. Ovsienko, and D.V. Yur’ev. 
Introduction; references. JS 


Group Theory, P. Arboreal Group Theory. Ed: 
Roger C. Alperin. Math. Sci. Res. Instit. Public., 
V. 19. Springer-Verlag, 1991, x + 368 pp, $45. 
[ISBN: 0-387-97518-7] Proceedings of a conference 
held at MSRI in 1988. Focuses on groups acting on 
trees and applications to hyperbolic geometry. AD 


Algebra, S(18), P. Quadratic and Hermitian 
Forms over Rings. Max-Albert Knus. Grund. der 
Math. Wissenschaften, B. 294. Springer-Verlag, 
1991, xi + 524 pp, $98. [ISBN: 0-387-52117-8] Uti- 
lizing recent advances in techniques, particularly al- 
gebraic topology and K-theory, this book “exposes 
the theory of quadratic and Hermitian forms in a very 
general setting.” Topics include Azumaya algebras, 
Clifford algebras, stability and cancellation, polyno- 
mial rings, and Witt groups. Extensive bibliography, 
index. JS 


Algebra, T(15-17: 1, 2), L. Abstract Algebra with 
a Concrete Introduction. John A. Beachy, William 
D. Blair. Prentice Hall, 1990, xxiv + 325 pp. [ISBN: 
0-13-004425-3] To ease transition into abstraction, 
the text begins with leisurely chapters on number 
theory and function (including permutations) before 
defining groups in Chapter 3. Sophistication level 
rises with chapters on fields, group structure theory, 
and Galois theory. Unusual effort is made to help 
the student understand the nature of algebra and 
the importance and technique of proof. Exercises 
(no solutions), bibliography, index. JS 


Calculus, S, C. GyroGraphics. (Version 3.0). 
Nancy Anne Johnson. CipherSystems (171 Willow 
Drive, Stillwater, OK 74075; 405-377-4432), 1990, 
$75. An effective graphing tool for IBM-PC compat- 
ible computers that makes it easy to create two- and 
three-dimensional graphs in a number of forms (ro- 
tated, polar, cylindrical, spherical, parametric, etc.); 
to rotate them stepwise or continuously; to zoom 
and unzoom; and to examine vector fields as well as 
tangent and acceleration vectors in easily sequenced 
“layers.” Good selection of examples in an opening 
menu; straightforward options to input user-defined 
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functions. Requires graphics board; performance im- 
proves significantly with greater machine power. LAS 


Calculus, T(13: 1, 2). Single-Variable Calculus 
With Discrete Mathematics. John Feroe, Charles 
Steinhorn. Harcourt Brace Jovanovich, 1991, xii + 
1021 pp, $38 net. (ISBN: 0-15-581062-6] An evo- 
lutionary rather than revolutionary calculus reform 
textbook. Ideas from discrete mathematics are pre- 
sented near the beginning and used throughout the 
text (e.g., limits are introduced via sequences). The 
last four chapters treat topics that explore relation- 
ships between discrete and continuous mathematics: 
generating functions, probability, and difference and 
differential equations. AO 


Real Analysis, T(15-16), S, L. Standard and 
Nonstandard Analysis: Fundamental Theory, Tech- 
niques and Applications. R.F. Hoskins. Math. & 
Its Applic. Ellis Horwood (US Distr: Prentice Hall), 
1990, 269 pp, $65.95. [ISBN: 0-13-840844-0] The 
first book to introduce the basic ideas of nonstandard 
analysis along-side standard topics in analysis. Thus, 
the book presents a standard introduction to analy- 
sis (number systems, convergence, functions, conti- 
nuity, differentiability, integration), but at the same 
time introduces nonstandard concepts at appropriate 
junctures. Little more than a working knowledge of 
elementary calculus is required, and the text makes 
no direct use of any explicit form of the “transfer 
principle,” which is the key to the application of non- 
standard analysis in its full sense. The book could 
be used as a standard beginning analysis textbook, 
with the use of infinitesimals and other nonstandard 
ideas taken at face value simply as an aid to compre- 
hension. LCL 


Complex Analysis, T(17: 3), L. Complez Vari- 
ables for Scientists and Engineers, Second Edition. 
John D. Paliouras, Douglas S. Meadows. Macmil- 
lan, 1990, xiii + 586 pp. [ISBN: 0-02-390561-1] 
Straightforward, minimal introduction to basics of 
complex analysis followed by applications to bound- 
ary value problems, aerodynamics, and Schwarz- 
Christoffel integral. Maximum modulus theorem and 
Liouville’s theorem covered in final chapter on theo- 
retical results. Good selection of problems of varying 
difficulty. Solid, but rather dry presentation. BH 


Differential Equations, P. Boundary Value Prob- 
lems for Operator Differential Equations. V.I. Gor- 
bachuk, M.L. Gorbachuk. Math. & Its Applic., V. 
48. Kluwer Academic, 1991, xi + 347 pp, $149. 
[ISBN: 0-7923-0381-4] A study of equations of the 
form y(t) + A(t)y(t) = 0 (te[a,b]), where A(t) is 
a semi-bounded, self-adjoint operator on a separa- 
ble Hilbert space. Contains recent development of 
spectral analysis of the Sturm-Liouville operator dif- 
ferential equation (with an infinite-dimensional space 
and unbounded potential A(t)). MLR 


Differential Equations, P. Mathematical and Nu- 
merical Aspects of Wave Propagation Phenomena. 
Eds: Gary Cohen, Laurence Halpern, Patrick Joly. 
SIAM, 1991, xiv + 794 pp, $68 (P). [ISBN: 0-89871- 
276-9] Proceedings of the first international confer- 
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ence on mathematical and numerical aspects of wave 
propagation phenomena held in Strasbourg, France, 
April 1991. MLR 


Differential Equations, T(14-15: 1-3), L. Dif 
ferential Equations, Theory and Applications. Ray 
Redheffer, Dan Port. Jones & Bartlett, 1991, xiv + 
722 pp, $48.75. [ISBN: 0-86720-200-9] A straight- 
forward, well-written text with a variety of inter- 
esting applications. Topics range from the usual 
elementary differential equations to the more ad- 


vanced Sturm-Liouville theory and nonlinear prob- 
lems. MLR 


Numerical Analysis, P. The Numerical Solution 
of Two-Point Boundary Problems in Ordinary Dzf- 
ferential Equations. L. Fox. Dover, 1990, xi + 371 
pp, $9.95 (P). [ISBN: 0-486-66495-3] Reprint of the 
1957 edition first published by Clarendon Press. De- 
tailed coverage of numerical methods for boundary 
value and eigenvalue problems in ordinary differential 
equations of order up to four, including appropriate 
treatment of different boundary conditions. Includes 
also a chapter on the solution of linear systems cor- 
responding to above methods, and a discussion con- 
cerning the difference between the methods for the 
initial value problems and the boundary value prob- 
lems. LS 


Numerical Analysis, T(13: 3). Numerical Math- 
ematics. Gunther Hammerlin, Karl-Heinz Hoff- 
mann. Transl: Larry Schumaker. Undergrad. Texts 
in Math. Springer-Verlag, 1991, xi + 422 pp, $39.95 
(P). [ISBN: 0-387-97494-6] Covers the usual clas- 
sical topics of numerical analysis, but also includes 
recent developments such as up-to-date treatments 
of splines, multidimensional analysis, linear opti- 
mization methods, and Monte-Carlo methods. Em- 
phasizes numerical insights as well as mathematical 
vigor. LS 


Operator Theory, T(18), P. Operator Theory in 
Function Spaces. Kehe Zhu. Pure & Appl. Math., 
V. 139. Marcel Dekker, 1990, ix + 258 pp. [ISBN: 
0-8247-8411-1] Toeplitz, Hankel, and composition 
operators on Bergman and Hardy spaces of the unit 
disk. Main emphasis is on boundedness, compact- 
ness, and membership in the Schatten classes. Min- 
imal prerequisites. MLR 


Functional Analysis, P. Geometry of Banach 
Spaces, Duality Mappings and Nonlinear Problems. 
Ioana Cioranescu. Math. & Its Applic., V. 62. 
Kluwer Academic, 1990, xiv + 260 pp, $99. [ISBN: 0- 
7923-0910-3] A duality-map approach to nonlinear 
operators related to nonlinear functional and evolu- 
tion equations. Develops the necessary convex anal- 
ysis and Banach-space geometry. MLR 


Analysis, P. Variations on a Theme by Kepler. Vic- 
tor Guillemin, Shlomo Sternberg. Colloq. Public., 
V. 42. AMS, 1990, vii + 88 pp, $43. [ISBN: 0-8218- 
1042-1] The authors “touch on the mysterious role 
that groups, especially Lie groups, play in reveal- 
ing the laws of nature.” Kepler didn’t know it, but 
his second law (equal areas in equal times) is due to 
the orthogonal group O(3), while the first and third 
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stem from O(4). For that matter, “even the symplec- 
tic groups in eight- and twelve-dimensional space get 
involved.” BC 

Analysis, T(18), S, P, L. Analysis IV: Linear and 
Boundary Integral Equations. Eds: V.G. Maz’ya, 
S.M. Nikol’skii. Encyclop. of Math. Sci., V. 27. 
Springer-Verlag, 1991, 233 pp, $59. [ISBN: 0-387- 
51997-1] First part contains survey of current state 
of theory of linear integral equations, presenting nec- 
essary abstract operator theory before discussing, in 
detail, Fredholm operators. Latter sections inves- 
tigate one- and higher-dimensional singular opera- 
tors. Second part concentrates on theoretical as- 
pects of boundary integral equation methods, giv- 
ing survey of classical theory of harmonic potentials. 
Includes applications to problems of mathematical 
physics. KS 

Analysis, T(18), P. Lecture Notes in Mathematics- 
1461: Abel Integral Equations. Rudolf Gorenflo, Ser- 
gio Vessella. Springer-Verlag, 1991, vii + 215 pp, 
$25 (P). [ISBN: 0-387-53668-X] Elementary theory 
and many applications are discussed in the first 
part. Harder topics, such as the behavior of Abel 
integral operators in function spaces, are also pre- 
sented. MLR 


Differential Geometry, S(18), P. Lie Groups I- 
IT. Mikio Ise, Masaru Takeuchi. Transl. of Math. 
Mono., V. 85. AMS, 1991, iv + 259 pp, $89. [ISBN: 
0-8218-4544-6] ‘Translations of lecture notes. Part 
I, originally done by Ise, is a fast-paced introduc- 
tion to Lie groups and algebras, concluding with 
classification and structure theory. Part II, by 
Takeuchi, discusses Riemannian and Hermitian sym- 
metric spaces, leading to the Cartan classification 
theory. References, problems for Part II. JS 


Differential Geometry, P. Symplectic Geometry, 
Groupoids, and Integrable Systems. Eds: P. Da- 
zord, A. Weinstein. Math. Sci. Res. Instit. Pub- 
lic., V. 20. Springer-Verlag, 1991, xi + 311 pp, 
$48. [ISBN: 0-387-97526-8] Contains twenty papers 
from the Séminaire Sud Rhodanien de Géométrie 
organized at MSRI in 1989. Eight papers are in 
French. OJ 


Differential Geometry, P. Theory of Singulari- 
ties and its Applications. Ed: V.I. Arnold. Adv. 
in Soviet Math., V. 1. AMS, 1990, ix + 333 pp, 
$125. [ISBN: 0-8218-4100-9] Contains seventeen 
papers by participants of the singularities seminar 
at Moscow University. Translated from Russian. OJ 


Geometry, P. Geometry’s Future. Ed: Joseph 
Malkevitch. COMAP, 1991, ix + 105 pp, (P). Pa- 
pers and recommendations from a small workshop 
convened by COMAP to explore new approaches 
to geometry: visualization, robotics, convexity, “ge- 
ometry and the imagination,” volumes, high school 
teachers. Authors T. Banchoff, D. Crowe, V. Klee, 
J. Malkevitch, W. Meyer, M. Senechal, and W. 
Thurston plan an innovative text as a sequel. LAS 


Geometry, S, C. Poincare: An Ezcurston into Hy- 
perbolic Geometry, User’s Manual. (Version 3.01). 
George D. Parker. 1990, 18 pp, $50 site license; 
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$20 individual. A menu-driven program for IBM- 
PC compatible computers that enable students to 
explore hyperbolic geometry. Primary data (points, 
lines, and constants) can be entered either directly or 
indirectly (via computation or by graphing); menu 
options use these data to graph and calculate re- 
lations involving lines, triangles, angles, perpendic- 
ulars, polygons, pencils, cycles, congruences, and 
more. Easy to use; includes documentation file. Re- 
quires graphics board; operates in either color or 
monochrome; alternate version (on source disk) for 
machines with math coprocessor. LAS 


Algebraic Topology, T(17: 2), P. A Basic Course 
in Algebraic Topology. William S. Massey. Grad. 
Texts in Math, V. 127. Springer-Verlag, 1991, xvi 
+ 428 pp, $49.95. [ISBN: 0-387-97430-X] Com- 
bines most of two previous books into one volume. 
The first five chapters—from Algebraic Topology: An 
Introduction—cover the classification of surfaces, the 
fundamental group, and covering spaces (First Edi- 
tion, TR, November 1967; Second Edition, TR, April 
1978). The next ten chapters—from Singular Homol- 
ogy Theory—cover the standard topics in homology 
and cohomology using cubes rather than simplices 
to define singular homology (TR, June-July 1981). 
Geometric motivation of ideas, historical comments, 
applications to geometric/topological problems, and 
engaging exposition make this an attractive text. Ex- 
ercises. OJ 


Topology, S(17). General Topology I: Basic Con- 
cepts and Constructions, Dimension Theory. Eds: 
A.V. Arkhangelskii, L.S. Pontryagin. Encyclop. of 
Math. Sci., V. 17. Springer-Verlag, 1990, 202 pp, 
$59. [ISBN: 0-387-18178-4] Translated from the 
Russian. First half gives a densely written, rather 
dry introduction to the basics of topology which as- 
sumes a rather high level of mathematical sophistica- 
tion on the part of the reader. Second half presents 
some of the more important concepts in dimension 
theory, and includes a nice historical introduction. 
No exercises. BH 


Dynamical Systems, S, P, L. Fractals: Non- 
Integral Dimensions and Applications. Ed: G. Cher- 
bit. Transl: F. Jellett. Wiley, 1991, xv + 249 
pp, $69.95. [ISBN: 0-471-92798-8] Translation of 
the 1987 Masson (Paris) Fractals: Dimensions non- 
entieres et applications containing papers from the 
1985-86 Hausdorff Seminar sponsored by the bio- 
physics research group of the University of Paris. Pa- 
pers are generally expository, covering a wide range 
of related topics (stochastic processes, dimension, 
biometry, entropy, etc.). Includes a reflective per- 
sonal commentary by Mandelbrot, and a brief pref- 
ace by J.-P. Kahane. LAS 


Control Theory, P. Lecture Notes in Control and 
Information Sciences-150: Robust Stabilization in 
the Gap-topology. L.C.G.J.M. Habets. Springer- 
Verlag, 1991, vii + 126 pp, $25 (P). [ISBN: 0- 
387-53466-0] Theoretical treatment of stabilization, 
with algorithms that make it practical. BC 


Stochastic Processes, P. Ezcessive Measures. 
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R.K. Getoor. Prob. & Its Applic. Birkhauser Boston, 
1990, ix + 189 pp, $39. [ISBN: 0-8176-3492-4] The 
author presents a unified treatment of recent results 


on the potential theory of excessive measures. Highly 
technical, but timely. TAV 


Statistics, S(17-18), L. The Engineering Statisti- 
cian’s Guide to Continuous Bivariate Distributions. 
T.P. Hutchinson, C.D. Lai. Rumsby Scientific, 1991, 
xxii + 346 pp, $44 (P). [ISBN: 0-646-02413-2] This 
reference guide lists bivariate distributions, their 
properties, and how they have been used as models in 
the statistical and engineering literature. Marginals 
are generally transformed into some convenient stan- 
dard form such as the uniform, normal, or exponen- 
tial before specifically bivariate aspects are consid- 
ered. Nearly fifty pages of references, most of these 
to literature published in the last few years. RWJ 


Statistics, L, P*. Spatial Statistics and Digital 
Image Analysis. National Research Council. Na- 
tional Academy Pr, 1991, ix + 234 pp, (P). [ISBN: 0- 
309-04376-X] Ten expository essays introducing the 
growing field of spatial statistics—the study of spa- 
tially referenced data. Primary applications are to 
image analysis (e.g., in satellite reconnaissance of 
missle silos or wheat crops, in tomography, in atmo- 
spheric and oceanographic studies), with spill-over 
to ecology, stereology, speech recognition, and other 
types of pattern analysis. Many applications are 
driven by computation (to find patterns in data) and 
applied to environmental issues. A timely invitation 
to a field of rapidly increasing importance. LAS 


Statistics, P. Sample Size Methodology. M.M. 
Desu, D. Raghavarao. Stat. Modeling & Decision 
Sci. Academic Pr, 1990, xii + 135 pp, $32.50. [ISBN: 
0-12-212165-1] Summarizes sample sizes needed to 
meet error probability, power, and variance con- 
straints for various estimation and hypothesis testing 
problems. Includes ranking and selection problems, 
design of clinical trials, and sampling from finite pop- 
ulations. Relevant tables referenced in Appendix. 
Contains some problems. RWJ 


Statistics, S**, C**, P*. MacSpin: 93-Dimen- 
stonal Data Analysis Software. Abacus Concepts 
(1984 Bonita Ave., Berkeley, CA 94704; 415-540- 
1949), 1989, 207 pp, (P). Version 3.0 Release Notes, 
1991, ii + 52 pp, (P); $295; site licenses avail- 
able. A powerful Macintosh software package for 
exploring multidimensional data sets by displaying 
rotating “clouds” of three-dimensional data, with a 
fourth dimension represented by slices controlled by 
a slider that can be moved continuously for anima- 
tion. Standard Macintosh interface tools provide 
options for entering data (including cut-and-paste 
from spreadsheets and other sources), for transform- 
ing variables, and for printing various versions of la- 
belled graphs. Pop-up windows give statistical sum- 
maries of variables. Version 3.0 also includes stan- 
dard tools for two-dimensional charts interfaced to 
the MacSpin data sets; “overlays” to crop, move, re- 
size, and add captions to graphs produced by Mac- 
Spin; variable scaling; and a MacPaint-style graphic 
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tool palette. LAS 

Computer Literacy, S*, P*, L*. Writing Space: 
The Computer, Hypertezt, and the History of Writ- 
ing. Jay David Bolter. Lawrence Erlbaum As- 
soc, 1991, xiii + 258 pp, $22.50 (P); $45. [ISBN: 
0-8058-0428-5; 0-8058-0427-7] A compelling, schol- 
arly apologia for the new paradigm of electronic (hy- 
pertext) literacy where writing is “open to perfor- 
mance by the reader.” “Writing itself is not merely 
influenced by technology, but rather zs technology.” 
Classicist Bolter analyzes today’s revolution in text 
in terms of the whole history of writing, looking at 
visual writing space, conceptual writing space, and 
the mind as a writing space. Touches on a wide range 
of users from deconstructionism to cultural literacy, 
from Homeric oral traditions to artificial intelligence. 
A superb reading for introductory courses in com- 
puter literacy. LAS 


Elementary Computer Science, P, L. The Age 
of Intelligent Machines. Raymond Kurzweil. MIT 
Pr, 1990, xiii + 565 pp, $39.95. [ISBN: 0-262-11121- 
7| <A grand tour of the past, present, and future of 
computers and computer science. Filled with pho- 
tographs, graphs, and illustrations. Accessible to 
non-specialists, but interesting to specialists. In- 
cludes articles by Minsky, Dennett, Turkle, Hofs- 
tadter, Papert, and more. Topics range from philo- 
sophical roots to mechanical roots, from artificial in- 
telligence to the science of art. MK 


Languages, S, P, L*. Commentary on Standard 
ML. Robin Milner, Mads Tofte. MIT Pr, 1991, 160 
pp, $19.95 (P). [ISBN: 0-262-63137-7] A compan- 
ion volume to Definition of Standard ML by Milner, 
Tofte, and Harper. Gives an in-depth explanation 
of the semantic theory of MZ. A notable feature is 
the inclusion of information about why certain design 
decisions were made. AO 


Languages, T(16-17: 1), L*. Introduction to the 
Theory of Programming Languages. Bertrand Meyer. 
Intern. Ser. in Comput. Sci. Prentice Hall, 1990, xvi 
+ 447 pp. [ISBN: 0-13-498510-9] A modern, formal 
treatment of mathematical models of programming 
languages. The topics covered include syntax, es- 
pecially abstract syntax, denotational and axiomatic 
semantics, and the complementarity of these two ap- 
proaches to programming language semantics. AO 
Algorithms, T(16-17: 1, 2), L*. Algorithms from 
P to NP, Volume 1: Design & Efficiency. B.M.E. 
Moret, H.D. Shapiro. Benjamin/Cummings, 1991, 
xv + 576 pp, $38.75. [ISBN: 0-8053-8008-6] Dis- 
cusses algorithm design techniques and illustrates 
these techniques by applying them to the design of 
efficient algorithms for some well-known problems in 
combinatorial optimization. Algorithms are given 
with proofs of correctness, an analysis of their com- 
plexity, and a discussion of implementation details 
(often including an empirical study of competing de- 
signs). AO 

Computer Systems, 1T(13). 
Designing with the 68000 Family. 
Prentice Hall, 1991, xiii + 641 pp. 


Programming and 
Tibet Mimar. 
[ISBN: 0-13- 
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731498-1] Intended to provide a thorough introduc- 
tion to the hardware and software architecture of the 
Motorola 68000 family of processors, including the 
68010, 68020, and 68030. It covers the 68000 se- 
ries architecture, instruction set, interrupt handling, 
memory system, and coprocessor design. GMS 


Computer Systems, S*, C, P, L*. LEazplor- 
ing Mathematics with Mathematica: Dialogs Con- 
cerning Computers and Mathematics. Theodore W. 
Gray, Jerry Glynn. Addison-Wesley, 1991, 535 pp. 
[ISBN: 0-201-52809-6] An eclectic, chatty tour of 
the computer algebra system Mathematica show- 
ing off its power with applications both traditional 
(e.g., iteration, fractals, graphics, Fourier trans- 
forms) and bizarre (mathematical transformations 
of Beethoven’s Ninth Symphony). Text is written 
as a glib rhetorical dialogue; topics illustrate mu- 
sic, graphics, and school mathematics. The entire 
text was prepared as a Mathematica Notebook—text, 
sound, graphics—and is provided electronically on a 
CD-Rom inside the back cover, thus enabling Math- 
ematica users to interact directly with the examples 
by running and modifying them. LAS 


Computer Systems, T(15-16), P, L. An Intro- 
duction to Text Processing. Peter D. Smith. MIT 
Pr, 1990, xii + 300 pp, $32.50. [ISBN: 0-262-19299-3] 
A survey of past and present practice in computer ap- 
proaches to handling natural language text: storage, 
editors, compression, encryption, formatting, concor- 
dances, hyphenation, spelling checkers, syntax anal- 
ysis, authorship studies, translation. Major exam- 
ples come from UNIX tools; many references to other 
packages. Exercises and projects make the book suit- 
able as a text. LAS 


Computer Systems, P. Application Environment 
Specification (AES) User Environment Volume, Re- 
vision B. Open Software Foundation. Prentice Hall, 
1991, xii + 1131 pp. [ISBN: 0-13-043530-9] A de- 
tailed specification of the OSF’s Application Envi- 
ronment Specification (AES). The bulk of this doc- 
ument is made up of the manual pages for all of the 
AES User Environment interfaces. AO 


Computer Systems, S*, P. UNIX System V.4 
Command Summary. SSC, 1990, 80 pp, $8 (P). 
[ISBN: 0-916151-44-1] A concise envelope-size pam- 
phlet with summaries of syntax and flags for most 
UNIX commands that are accessible to ordinary 
users. Includes expanded sections on use of tools 
such as ftp, sdb, sed, and telnet. LAS 


Computer Graphics, S*, P, L**. The Second 
Computer Revolution: Visualization. Richard Mark 
Friedhoff, William Benzon. WH Freeman, 1989, 215 
pp, (P). [ISBN: 0-7167-2231-3] An extraordinary 
exposition of vision and images, profusely illustrated 
in full color, beginning with Edwin Land’s revolu- 
tionary “retinex” theory of color vision and proceed- 
ings to the most recent computer-created artificial 
realities. Covers image enhancement, object-based 
images, procedural (including fractal) methods, ani- 
mation, CAD systems, and much more. Each topic is 
explained clearly in simple concise terms, and illus- 
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trated with apt visual images. A remarkably inter- 
disciplinary endeavor, both well-written and vividly 
illustrated. LAS 

Computer Science, T(16-17: 1), S, P, L. Ge- 
netic Algorithms in Search, Optimization, and Ma- 
chine Learning. David E. Goldberg. Addison- 
Wesley, 1989, xiii + 412 pp. [ISBN: 0-201-15767-5] 
A user-friendly introduction to genetic algorithms: 
computer search methods based on principles of nat- 
ural selection and genetics. Includes problems and 


computer assignments. Suitable as a text or for self- 
study. BC 


Computer Science, T(16-18: 1), P, L**. Math- 
ematics for the Analysis of Algorithms, Third Edi- 
tton. Daniel H. Greene, Donald E. Knuth. Progress 
in Comput. Sci. & Appl. Logic, V. 1. Birkhauser 
Boston, 1990, viii + 132 pp, $24.95. [ISBN: 0-8176- 
3515-7] This edition includes some improvements 
in the exposition, minor corrections, and new ap- 
pendices including examination problems (and so- 
lutions) from 1982 and 1988 (First Edition, TR, 
August-September 1982). Based on materials de- 
veloped for an advanced computer science course at 
Stanford University. Presumes that the reader is 
familiar with the fundamentals of complex variable 
theory and combinatorial analysis. AO 


Computer Science, S(15-16), P*, L*. ACM 
Turing Award Lectures: The First Twenty Years, 
1966-1985. ACM. Anthology Ser. Addison-Wesley, 
1987, xviii + 483 pp, (P). [ISBN: 0-201-54885-2] 
These lectures by the first 23 winners of the Turing 
Award give a wide-ranging perspective on the past, 
present, and possible future of computer science. A 
valuable set of essays. BC 


Computer Science, S, P, L*. Technology 2001: 
The Future of Computing and Communications. Ed: 
Derek Leebaert. MIT Pr, 1991, xvi + 392 pp, $29.95. 
[ISBN: 0-262-12150-6] Fourteen essays by leaders 
and visionaries in computer R&D on the next ten 
years of computing: microprocessors, supercomput- 
ing, networks, electronic documents, telecommunica- 
tions, imaging, public policy—all replete with impli- 
cations for education. A thought-provoking glimpse 
into the world today’s students will inherit. LAS 
Applications, T(15: 2), P, L. Applied Analysis 
by the Hilbert Space Method: An Introduction with 
Applications to the Wave, Heat, and Schrodinger 
Equations. Samuel S. Holland, Jr. Pure & Appl. 
Math., V. 137. Marcel Dekker, 1990, xii + 560 pp, 
$55. [ISBN: 0-8247-8259-3] Well-written text for a 
one-year undergraduate applied mathematics course. 
Prerequisites are calculus (three semesters), linear al- 
gebra, and basic physics. Includes necessary sections 
on differential equations. Gets to the applications 
quickly. MLR 


Applications, P*. Walsh Series and Transforms: 
Theory and Applications. B. Golubov, A. Efimov, 
V. Skvortsov. Math. & Its Applic., V. 64. Kluwer 
Academic, 1991, xiii + 368 pp, $169. [ISBN: 0-7923- 
1100-0] Translated from a 1987 Russian work. An 
account of the theoretical foundations of Walsh series 
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and transforms. For engineers, technical specialists, 
and applied mathematicians. Applications to digital 
information processing, digital filtering, and digital 
holograms. Appendices with background informa- 
tion on group theory, measure theory, the Lebesgue 
integral, and functional analysis. Brief commentaries 
and historical references for each chapter. Extensive 
list of references. JDEK 


Applications (Behavioral Science), S, L*. Ne- 
gotiation Games: Applying Game Theory to Bar- 
gaining and Arbitration. Steven J. Brams. Rout- 
ledge Chapman and Hall, 1990, xviii + 297 pp, 
$15.95 (P); $49.50. [ISBN: 0-415-90338-6; 0-415- 
90337-8] A model-based (not a mathematics-based) 
survey of negotiations (e.g., Cain vs. God, Sadat vs. 
Begin, Kennedy vs. Khruschev), rooted in game the- 
ory, but extended to common systems of bargaining 
and arbitration. Chief message: the impediment to 
resolving conflicts is often rationality, not irrational- 
ity. LAS 


Applications (Biological Science), T(15-17), S, 
P, L*. The Geometry of Biological Time. Arthur T. 
Winfree. Study Edition. Springer-Verlag, 1990, xiv 
+ 530 pp, $32 (P). [ISBN: 0-387-52528-9] Corrected 
paperback reprint of the 1980 original (hardcover) 
edition (TR, December 1980). Diverse biological ap- 
plications of periodic dynamics: circadian cycles, en- 
ergy metabolism, Mitotic cycles, waves, growth, and 
much more in 23 chapters that employ a wide range 
of methods from undergraduate mathematics. Mas- 
sive bibliography (only through 1978) documents the 
roots of this very rapidly growing field. LAS 


Applications (Physical Science), S(17-18), P. 
Fluz Coordinates and Magnetic Field Structure: A 
Guide to a Fundamental Tool of Plasma Theory. 
W.D. D’haeseleer, et al. Ser. in Computat. Physics. 
Springer-Verlag, 1991, xii + 241 pp, $98. [ISBN: 
0-387-52419-3] For non-specialists as well as estab- 
lished researchers in the fields of plasmas and con- 
trolled fusion. Systematic and rigorous treatment 
of the mathematical framework of magnetic-flux- 
coordinate systems (in which magnetic field lines ap- 
pear as straight lines). Focus is on the geometrical 


properties of vectors and tensors in curvilinear coor- 
dinates. JDEK 


Applications (Physical Science), P. Spectral 
Theory of Families of Self-Adjoint Operators. Y.S. 
Samoilenko. Math. & Its Applic., V. 57. Kluwer 
Academic, 1991, xvi + 293 pp, $129. [ISBN: 0-7923- 
0703-8] Studies the mathematical models of physi- 
cal systems; presents the connected theory of count- 
able families of self-adjoint operators on a Hilbert 
space and their spectral properties; deals with count- 
able collections of self-adjoint operators and their 
joint spectral properties; illustrates connection with 
Lie groups. LS 


Applications (Physical Science), P. Continuum 
Models and Discrete Systems. Ed: G.A. Maugin. 
Interaction of Mech. & Math. Ser. Longman Sci- 
entific & Tech (US Distr: Wiley), 1990, $150 each, 
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(ISBN: 0-470-21592-5]. Volume 1, 324 pp; Volume 
2, 324 pp. Texts from most of the short papers pre- 
sented at an international symposium (CMDS6) held 
in June 1989 at the University of Bourgogne in Dijon, 
France. Papers in each volume are grouped in four 
parts: complex fluids; deformable solids; microstruc- 
ture, thermodynamics, and geometry; and nonlinear 
excitations. LAS 


Applications (Physical Science), P. Dynamics of 
Polyatomic Van der Waals Complezes. Eds: Nadine 
Halberstadt, Kenneth C. Janda. NATO ASI Ser. B., 
V. 227. Plenum Pr, 1990, ix + 542 pp, $120. [ISBN: 
0-306-43612-4|] Thirty-nine papers from a NATO 
workshop on theoretical and experimental studies of 
weakly bound molecular complexes. Focuses on ex- 
tending current knowledge of triatomic systems to 
polyatomic molecules and clusters. BC 
Applications (Physics), S*(15-16). Problems in 
Theoretical Physics, Volumes J-IJ. Willi-Hans Steeb. 
Bibliographisches Institut, 1990. Volume I: Introduc- 
tory Problems, 197 pp, (P), [ISBN: 3-411-14451-3]; 
Volume II: Advanced Problems, 290 pp, (P). (ISBN: 
3-411-14611-7] Volume I contains 120 problems on 
topics which should be familiar to undergraduates 
having completed courses in linear algebra and com- 
plex analysis. Whenever appropriate, connections to 
physics are cited. Volume IJ contains 136 advanced 
problems in ordinary and partial differential equa- 
tions, vector and tensor analysis, differential forms, 
generalized functions, quantum theory and gauge 
theory. Un-wordy and clear solutions are given for al- 
most every problem. Both volumes are lightly sprin- 
kled with helpful asides. Unfilled pages (created by 
insistence on beginning each problem at the top ofa 
page) cry for more. JDEK 


Applications (Physics), S. Algebraic and Spectral 
Methods for Nonlinear Wave Equations. N. Asano, 
Y. Kato. Pitman Mono. & Surveys in Pure & Appl. 
Math., V. 49. Longman Scientific & Tech (US Distr: 
Wiley), 1990, xvii + 418 pp, $158. [ISBN: 0-470- 
21417-1] Presents algebraic and spectral methods 
for nonlinear wave equations or nonlinear partial dif- 
ferential equations; introduces the method of contin- 
uous groups, the method of inverse scattering formu- 
lation, and fundamental materials of the Lie’s trans- 
formation method; covers the development and new 


results in the last twenty years in these and related 
fields. LS 


Reviewers 


RJA: Richard J. Allen, St. Olaf; RB: Richard Brown, St. Olaf, 
BC: Barry Cipra, St. Olaf; AD: Amy Davidow, Macalester; 
SG: Steven Galovich, Carleton; BH: Bruce Hanson, St. Olaf; 
JPH: Joan P. Hutchinson, Macalester; OJ: Ockle Johnson, 
St. Olaf; RWJ: Roger W. Johnson, Carleton; MK: Michael 
Kahn, St. Olaf; JDEK: Joseph D.E. Konhauser, Macalester; 
LCL: Loren C. Larson, St. Olaf; BL: Brian Loe, Carleton; AO: 
Arnold Ostebee, St. Olaf; MLR: Margaret L. Reese, St. Olaf; 
MPR: Matthew P. Richey, St. Olaf; KS: Karen Saxe, St. Olaf; 
GMS: G. Michael Schneider, Macalester; JS: John Schue, 
Macalester; LS: Li Sheng, Macalester; LAS: Lynn Arthur 
Steen, St. Olaf; MU: Milton Ulmer, Carleton; TAV: Theodore 
A. Vessey, St. Olaf- 
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different graphic calculator models are 
available, each offering hundreds of 
powerful, built-in functions. 

With our printers, you can create 
oH-7000c = “hard copy” of graphs and calculations 
for photocopying onto test papers or transparencies. 
The FA-80 Data Interface lets you store unlimited 
data on audio cassettes. 


Store 
unlimited data 
on audio cassette. 


Create 
printouts 


Pre-load students’ 
calculators with 
lesson data. 


The possibilities are endless. That’s why the NCTM 
curriculum calls for the use of graphic calculators. Many 
new textbooks are being published especially for use 
with this technology. 

So, take a bold, new step and enter Casio’s “Electronic 
Classroom” — a place where the solution is always clear. 


Creators of the 
Graphic Calculator 


570 Mt. Pleasant Avenue, Dover, New Jersey 078C 


7 Casio Educational Product Distributors 


ABC SCHOOL SUPPLY CUISENAIRE NEMESIS DISTRIBUTING 

800-669-4ABC 800-237-3142 800-940-7407 

(IN GA 404-497-0001) (IN NY 914-235-0900) (IN FL 305-477-8822) 

ALLIED NATIONAL DALE SEYMOUR PUBLICATIONS PENNS VALLEY PUBLISHING/ 

800-999-8099 800-872-1100 LEARNING SYSTEMS 

(IN MI 313-543-1232) (IN CA 800-872-1100) 800-422-4412 

ARROWHEAD BUSINESS MACHINES THE DOUGLAS STEWART CO. (IN PA 215-855-4948) 

800-234-3396 800-279-2795 SERVCO PACIFIC 
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(IN CA 213-946-6680) een 0270 800-421-5188 
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(IN CA 415-424-0800) EDUCATIONAL ELECTRONICS 800-558-6970 
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Introducing E.Z. Math, E.Z. Algebra and E.Z, Arithmetic for the HP 48SX 


E.Z. Math, E.Z. Algebra and E.Z. Arithmetic are programs for the Hewlett Packard 48SX calculator conceived, written 
and programmed by Raymond La Barbera and the E.Z. Software Company. Each program comes on a 128K plug-in ROM 
card accompanied by an easy-to-understand, well-written, detailed manual loaded with lots of specific examples and _is 
designed for use by students, teachers. parents and business people. Each program features an easy-to-use, logically 
organized, user-friendly interface which enables those who consider themselves to be calculator and computer illiterates, 
as well as those who don’t like to read manuals, to have full access to all program features quickly and easily. Since the 
HP 488X is essentially an impressive sharp-looking, 8 ounce pocket computer, students are easily motivated to take it 
along with them to study, practice, drill] and master math in a study hall, on a train or bus, in a car, on line, on vacation, 
on a break—in short, for self-study at any time and in any place. 


What Can Be Done With E.Z. Math 


E.Z. Math effectively solves problems involving graphs, numbers, loans and savings. With E.Z. Math, anyone can: 
» Master the entire high school and college graphing curnculum, from algebra to calculus, with 188 families of equations, inequal- 
ities, functions, and systems, all arranged tn an easy-to-use, user-friendly system of menus to make graphic analysis a snap! 
° Get extensive help with calculations involving fractions, whole numbers, complex numbers and number sequences. 
» Easily do savings and loan calculations and generate complete amortization tables. 
e Learn many basic concepts including those involving sets, variables, graphing, solving, numbers, loans and savings. 


What Can Be Done With E.Z. Algebra 


E.Z. Algebra is a comprehensive ninth grade high school basic algebra course as well as a high school and college remedial 
algebra course that builds a solid algebra foundation. With E.Z. Algebra, anyone can: 

» Learn about sets, operations, vanables, relations and other concepts essential to a real understanding of algebra. 

* Understand the sets of natural numbers, whole numbers, integers, rational numbers and real numbers. 

» Master the meaning and properties of the operations of addition, subtraction, multiplication, division, power and root. 

© Do ail kinds of problems involving algebraic expressions, numerical phrases, equations and inequalities. 


What Can Be Done With E.Z. Arithmetic 


E.Z. Arithmetic is a comprehensive elementary school basic arithmetic course as well as a high school and college 
remedial arithmetic course that makes solving most arithmetic problems a snap! With E.Z. Arithmetic, anyone can: 
¢ Learn how to add, subtract, multiply, divide and order whole numbers, fractions, decimals, percents and integers. 
» Master the meaning, terminology and conversion methods for whole numbers, fractions, decimals and percents. 
¢ Drill and be graded on endlessly varied, randomly selected sets of problems involving whole numbers, fractions, decimals and 
integers, with the difficulty level. number of problems, operation and type of number user selectable. 


How To Order Copies or Get Further Information 

Each E.Z. Software program costs $130.00 ($125.00 retail, plus $5.00 shipping and handling). Take a 10% discount when 
ordering ten or more units. We accept payment by check, money order, COD, VISA, MC, AE and purchase order. If 
within 50 days you find that any E.Z. Software program fails to meet your expectations, we'll gladly take back your copy 
for a prompt, courteous refund. To order copies, either individually or bundled with HP 48SX calculators, please contact: 


SMI Corporation, 250 West New Street, Dept MM2, Kingsport, Tennessee 37660 
(800) 234-0123 or (615) 378-4821 or (615) 245-8982 (Fax). 


Send this application to: MAA Membership Department 
1529 18th Street, N.W. 
Washington, D.C. 20036-1385 
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Find the column for your desired combination of MAA journals in the table below. All members receive 
Focus. Select the row appropriate for your initial membership period (1 year or 1.5 years) and your status 
(student or regular). Circle your dues in the table. Write the amount on the form below. 


Journal codes used in the columns of the dues table are: THE AMERICAN MATHEMATICAL MONTHLY = M 
MATHEMATICS MAGAZINE = G THE COLLEGE MATHEMATICS JOURNAL = J 

Student Membership M G J M+G M+4+J G+eJ M+GeJd 
1 year Jan.—Dec. 1992) $34.00 $28.00 $30.00 $ 40.00 $ 41.00 $ 35.00 $ 47.00 
1.5 years (July 1991--Dec. 1992) $50.00 $41.50 $4450 $5850 $60.50 $ 51.50 $ 69.00 
Regular Membership 

1 year (Jan.-Dec. 1992) $68.00 $55.00 $58.00 $ 82.00 $ 85.00 $ 72.00 $ 99.00 
1.5 years July 1991-Dec. 1992) $99.50 $78.50 $83.50 $121.50 $126.50 $105.50 $148.50 


These are specially discounted rates for new members provided to help the MAA reach a wider audience. They are not available to those 
who have been members for two years. Student membership is available to high school and undergraduate students and to students 
regularly enrolled in graduate study at least half time. Student rates apply to unemployed persons who are seeking employment. Annual 
dues include subscription prices as follows: Regular Member $27 M, $17 J, $14 G, $6 Focus. Student member $11 M, $7 J, $5 G, $3 Focus. 
These rates are guaranteed for the indicated periods only. 
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Renewed for ’91: a free 
classroom display device 
with purchase of 30 
calculators. 


Showing is much more powerful 
than telling. So we’ve developed 
special classroom displays for 
our most advanced calculators. 


The HP 488X scientific expand- 
able calculator, the cost-effective 
HP 2858 and the new HP 48S are 
designed to put your students 
on the cutting edge of calculus 
and engineering. With more built- 
in functions and graphics solu- 
tions than any other calculators. 


If your department or students 
purchase 30 HP 48SX, HP 285 or 
HP 48S calculators (or a mix of 
all three), we'll give you free an 
HP 48SX and plug-in classroom 
display (a $900 retail value). Or 

a classroom display version of 
the HP 28S (a $600 retail value). 
And free teaching materials. 


Call (503) 757-2004 from 

8am to 3pm PDT for details. Or 
write: Calculator Support, 
Hewlett-Packard, 1000 NE Circle 
Blvd., Corvallis, OR 97330. Offer 
ends October 31, 1991, and 
applies only to college and high 
school instructors. 
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Winning Women into 
Mathematics 


Patricia Clark Kenschaft, Editor 


American media often ask why women “can’t” do 
mathematics. Any answer is misleading. Better 
questions are needed, along with indications of 
how to find potential answers. 


The Committee on the Participation of Women of 
the Mathematical Association of America was 
established in 1987 “to work for full involvement of 
womenin MAA activities that will encourage women 
to pursue careers in the mathematical sciences.” 
With this book, the Committee seeks to expand 
the number and effectiveness of those winning 
women into mathematics. WINNING WOMEN is 
written to inform, to empower, and to inspire. 


The Committee identifies fifty-five cultural cus- 
toms that discourage aspiring women mathema- 
ticians. They tell us how these customs can be 
changed and what can be done to recruit, retain, 
and acknowledge women in mathematics. A bib- 
liography of over 100 sources on the issues of 
women’s participation in mathematics Is included, 
as well as descriptions of programs that have 
been successful in encouraging young women to 
study mathematics. The book is filled with inter- 
esting anecdotes, and contains over 50 photo- 
graphs of prominent women in mathematics. 


88 pp., 1991 , Paperbound 
ISBN 0-88385-453-8 


List: $11.00 MAA Member: $9.00 


Catalog Number: WIW 


CONTENTS 
A bibliography of over 100 sources on 
the issues of women’s participation in 
mathematics 
Fifty-five cultural patterns causing Ameri- 
can women to be underrepresented in 
mathematics 
What you personally can do 


Programs that succeed 


A history of women in mathematics- 
especially in the MAA 


A chronicle of the programs, articles, 
and suggestions of the Committee on 
the Participation of Women 

A minority woman’s viewpoint 


An overview of the statistics 


Photographs, anecdotes, cartoons 
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(FAX) (202) 265-2384 

Prepaid orders sent postage & 
handling free. Visa and Mastercard 
orders accepted. (Please give the card 
number and expiration date on credit 
card orders) We will bill for orders 
over $10.00. 


Recursive & iterative programming ¢ Over 70 new math functions e New manual, 500 examples & exerci 


The Joy of 


DERIVE.— A Mathematical 
Assistant program. 


PC Magazine says it’s 

“a joy to use” and proclaims it 
“Editors’ Choice.” PC Week calls 
it “fast and capable.” The 
DERIVEe® program is de- 
livered with built-in 
standard equioment 
that delights 


EDITORS? 
CHOICE 


May 29, 1990 
Derive, Version 16 


both 
math lovers and 
math phobics. 


High performance. 


DERIVE does numeric and symbolic 
equation solving, exact and approximate 
arithmetic to thousands of digits, calculus, 
trigonometry and matrices. It displays or- 
mulas in comprehensible 2D format using 
raised exponents and built-up fractions. It 
plots beautiful 2D curves and 3D surfaces on 
monochrome or color monitors. 


Compact and sporty. 


PC Magazine cails ita “small wonder.” All tt 
takes is 512K of memory and one floppy disk 
drive. DERIVE takes to the road on PC com- 
patibles, and really gets around on laptop 
and even handheld computers! 


DERIVE is a Handcrafted 
registered trademark of Software 
Soft Warehouse, Inc. for the Mind 


Human engineering. 


You don't have to be fluent in computerese to 
use DERIVE. |n fact, it’s the friendliest and 
easiest to use of any symbolic math package 
on the road today. 
Its menu-driven 
interface 


and 
on-line help 
make it easy— 


you'll soon be up to 
speed doing math, instead of 

trying to learn how to drive the 
software. 


Freedom to maneuver. 


DERIVE’s automated expertise releases you 
from the drudgery of hand calculations. 
You can do problems you'd never attempt 
otherwise and obtain exact symbolic solu- 
tions, in addition to approximate numerical 
solutions. 


No sticker shock. 


DERIVE’s suggested retail price is $250. And 
of course DERIVE doesn't require an expen- 
sive computer, a math co-processor, or even 
a hard disk drive. 

Order DERIVE through your favorite software 
dealer or mail-order house. For a list of 
dealers, write Soft Warehouse, Inc. at 3615 
Harding Avenue, Suite 505, Honolulu, HI 
96816. Or call (808) 734-5801 after 11 a.m. 
Pacific Standard Time. 


And happy DERIVEing! 


Soft Warchouse: 


HAWAII 
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V.M. Tikhomirov, 
Translated by Abe Shenitzer 


We are pleased to announce the first volume in a 
new series of expository books translated from the 
Russian KVANT Library, published jointly with the 
American Mathematical Society. 


Throughout the history of mathematics, maximum 
and minimum problems have played an important 
role. Many beautiful and important problems have 
appeared in a variety of branches of mathematics 
and physics. The greatest scientists of the past— 
Euclid, Archimedes, Heron, the Bernoullis and 
Newton, took part in seeking solutions to these 
concrete problems. The solutions stimulated the 
development of the theory, and, as a result, tech- 
niques were elaborated that made possible the 
solution of a tremendous variety of problems by a 
single method. 


This book presents fifteen “stories” designed to 
acquaint the reader with the central concepts of 
the theory of maxima and minima as well as with 
its illustrious history. 


In Part One, the author familiarizes readers with 
concrete problems that lead to discussion of the 
work of some of the greatest mathematicians of all 
time. Part Two introduces a method for solving 
maximum and minimum problems that originated 
with Lagrange. While the content of this method 
has varied constantly, its basic concept has en- 
dured for over two centuries. The final story is 
addressed primarily to those who teach math- 
ematics, for it impinges on the question of how to 
teach. The author strives to show how the analysis 
of diverse facts gives rise to a general idea, how 
this idea is transformed, how it is enriched by new 
content, and how it remains the same in spite of 
these changes. 


Contents 


Ancient maximum and minimum problems: 1. 
Why do we solve maximum and minimum prob- 
lems?; 2.The oldest problem—Dido’s problem; 3. 
Maxima and minima in nature (optics); 4. Maxima 
and minima in geometry;5. Maxima and minima in 
algebra and in analysis; 6. Kepler’s problem; 7. 
The brachistochrone; 8. Newton’s aerodynamical 
problem; Methods of solution of extremal prob- 
lems; 9. What is a function?; 10. What is an 
extremal problem?; 11. Extrema of functions of 
one variable; 12. Extrema of functions of many 
variables. Lagrange’s principle; 13. More problem 
solving; 14. What happened later in the theory of 
extremal problems; 15. The fifteenth story, or 
rather, a discussion. 


187 pp., 1991, Paperbound 
ISBN 0-8218-0165-1 


List: $23.00 MAA Member: $18.00 


Catalog Number: MAXIM 


ff ORDER FROM: 

Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, DC. 20036 

(FAX) (202) 265-2384 


Prepaid orders sent postage & 
handling free. Visa and Mastercard 
orders accepted. (Please give the card 
number and expiration date on credit 
card orders) We will bill for orders 
over $10.00. 


Fractals 
Endlessly Repeated Geometrical Figures 


Hans Lauwerier 
Translated by Sophia Gill-Hoffstaat 

Fractals are shapes in which an identical motif repeats itself on 
an ever diminishing scale. No longer mathematical curiosities, 
fractals are now a vital subject of mathematical study, practical 
application, and popular interest. For readers interested in 
graphic design, computers, and science and mathematics in 
general, Hans Lauwerier provides an accessible introduction to 
fractals that includes instructions on how to design both new and 
well-known examples with the help of a microcomputer. 
35 color illustrations. 103 line illustrations. 
Paper: $14.95 ISBN 0-691-02445-6 
Cloth: $39.95 ISBN 0-691-08551-X 
Not available from Princeton in the United Kingdom and Europe 


D-modules and Spherical 
Representations 


Frédéric Bien 

The theory of D-modules deals with the algebraic aspects of differential equations. The book 
provides a general introduction to the theory of D-modules on flag varieties, and it describes spherical 
D-modules in terms of a cohomological formula. Using microlocalization of representations, the author 
derives a criterion for irreducibility. The relation between multiplicities and singularities is also 
discussed at length. 
Mathematical Notes, 39 
Paper: $22.50 ISBN 0-691-02517-7 
In Japan order from United Publishers Services 


With a new introduction by Thomas Banchoff 


Flatland 


A Romance of Many Dimensions 
Edwin Abbott Abbott 


Over a hundred years ago, Edwin Abbott Abbott wrote a mathematical adventure set in a world on 
one plane, populated by a hierarchical society of regular geometrical figures—who think and speak 
and have all too human emotions. Since then Flatland has fascinated generations of readers, 
becoming a perennial science-fiction favorite. 

Today, we can manipulate objects in four dimensions and observe their three-dimensional slices 
tumbling on the computer screen. But how do we interpret these images? In his introduction to the 
volume, Thomas Banchoff points out that there is no better start on the problem of understanding 
higher-dimensional slicing phenomena than reading this classic novel of the Victorian era. 

Paper: $7.95 ISBN 0-691-02525-8 


41 WILLIAM ST. e PRINCETON, NJ 08540 e (609) 258-4900 
ORDERS: 800-PRS-ISBN (777-4726) e OR FROM YOUR LOCAL BOOKSTORE 


An MAA 
Report 

on Teacher 
Preparation 


A CALL FOR CHANGE 
Recommendations for the 
Mathematical Preparation of 
Teachers of Mathematics 


James R. C. Leitzel, Editor 


How can we improve the teaching and 
learning of mathematics in our schools to 
better prepare our students for the future? 
We can begin by making some changes in 
the way our teachers learn and teach math- 
ematics. A CALL FOR CHANGE, a new 
MAA Report, offers a set of recommenda- 
tions that come from a vision of ideal mathe- 
matics teachers in classrooms of the 1990’s 
and beyond. It describes the collegiate 
mathematical experiences that a teacher 
needs in order to meet this vision. 


A CALL FOR CHANGE discusses stan- 

dards common to the preparation of math- 

ematics teachers at all levels; Elementary 

School Level (K—4), Middle School Level 

(5-8), and Secondary School Level (9-12). 

It shows what type of preparation teachers 

need to help them: 

@ (i) view mathematics as a system of in- 
terrelated principles, 

@ (ii) communicate mathematics accurately 
verbally, and in writing, 

M@ (iii) understand the elements of mathe- 
matical modeling, 


A Call For Change: 


commendations For The 
ematical Preparation Of Teachers Of Mathemat 
atics 


An MAA @ Report 


The Mathema 


tica sb 
Committee o I Association Of America 


In The Mathematicz| Education Of Teachers 
James R C Lentzej, Editor 


@ (iv) understand and use calculators and 
computers appropriately in the teaching 
and learning of mathematics, and 

™@ (Vv) appreciate the development of math- 
ematics both historically and culturally. 


Mathematics continues to be a dynamic 
changing discipline. There is new mathe- 
matics that can be exciting for young people 
to learn and technology provides new ap- 
Droaches for teachers to engage students 
in the teaching and learning of mathemat- 
ics. The mathematical preparation of teach- 
ers must adapt to these changing realities. 
Order your copy now of this important set 
of guidelines. 


60 pp., 1991, ISBN 0-88385-072-9 
List: $7.00 
Catalog Number CFC 


Order From 


1529 18th Street, N.W. 
Washington, D.C. 20036 
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The purpose of the present paper is to give a modern complete proof of 
unsolvability of the Tenth Problem of Hilbert. Our intention is to give the shortest 
proof known today, one which takes into account all the simplifications found since 
the problem was solved by the second author in [1970]. 

The Tenth Problem of Hilbert is the problem of solvability of diophantine 
equations. As originally formulated by Hilbert [1900], it was to find an algorithm to 
decide whether a polynomial equation in several variables, P(x,,x,,...,x,) = 9, 
has a solution in integers. 

The problem can also be formulated in terms of existence of solutions in natural 
numbers (nonnegative integers). These two forms of Hilbert’s Tenth problem are 
equivalent. An equation P(x,, x,,...,x,) =0 has a solution in integers if and 
only if the product I[P(+x,,+%x,,...,+x,) = 0 has a solution in natural num- 
bers. Also, from Lagrange’s Four Squares Theorem [1770], P(x,, x,,...,x,) = 0 
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has a solution in natural numbers iff P(x? +y?+u? +07, x3+y}+u3+ 
U3,...,x2 + y? + u2 + v2) = 0 has a solution in integers. 

What will be proved here is the algorithmic unsolvability of Hilbert’s Tenth 
Problem, i.e., the nonexistence of an algorithm, over natural numbers. This will be 
done in the normal way, by reducing another unsolvable problem to Hilbert’s 
Tenth. Actually we will reduce all unsolvable problems to Hilbert’s Tenth. To 
explain we need some definitions. 

Let A be a set or relation on the natural numbers. The set A is called 
recursively enumerable (r.e.) if A is the exact range of a recursive function (or if A 
is empty). A set is called recursive if its characteristic function is a recursive 
function. A function f is said to be recursive if it is computable by a Turing 
machine or register machine (see §4). 

The concept of recursively enumerable set is more general than that of recursive 
set. The relationship between the two concepts is also important here. A set A is 
recursive if and only if A and its complement A’ are both r.e. sets. Thus every 
recursive set is recursively enumerable. However there exist r.e. sets which are 
non-recursive. Hence the negative solution to Hilbert’s Tenth problem will follow 
immediately from the following theorem: 


THEOREM 1. Every r.e. relation A(a,, a5,...,4,,) can be represented in the form 


A(Qq,-++5Qm)  (A%1,..-,%,)[ PC G1, +. +5 Ans X15---5X,) = O). (1) 


Here P(a,,...,4,,,X1,..-,X,) is a polynomial with integer coefficients which 
depend on A. The Theorem is that for each r.e. set A, a polynomial P exists such 
that (1) is satisfied for all values of a,,a,,...,a,, (the parameters). The variables 
X1,Xz,--.,X,, correspond to the unknowns in our equation. All the variables, 
parameters and unknowns, range over the same set, nonnegative integers. 

A set or relation A(a,,...,a,,) in the nonnegative integers, definable in the 
form (1), is called a diophantine set. Thus Theorem 1 asserts that every r.e. set is 
diophantine. The converse also holds trivially: every diophantine set is r.e. Thus 
Theorem 1 asserts the equality of two collections of sets. 

Theorem 1 implies the nonexistence of an algorithm for Hilbert’s Tenth 
Problem because it reduces the decision problem for every r.e. set, to some 
instance of Hilbert’s Tenth. If Hilbert’s Tenth were solvable, then every r.e. set 
would be recursive. However, as we mentioned, there exist r.e. sets which are 
nonrecursive. Taking the set A in Theorem 1 to be such a set, one sees that there 
can exist no algorithm for Hilbert’s Tenth Problem. 

This argument seems completely satisfying to us: however, for those who wish it 
we give an additional argument after the proof of Theorem 1 (section 5). 

Theorem 1 also implies the algorithmic unsolvability of Hilbert’s Tenth Prob- 
lem. This is a slightly stronger conclusion than recursive unsolvability. When we 
come to this conclusion on the basis of Theorem 1, we are implicitly using the 
so-called Church—Turing Thesis. This is the statement that every computable 
function is Turing computable. It is called a thesis rather than a theorem because 
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it is apparently unprovable. It is somewhat in the nature of a definition, or 
proposal, that we identify the intuitive concept of computability with the precise 
mathematical formalization, Turing computability, register machines or General 
Recursiveness. 

Today the Church—Turing Thesis is widely accepted. If an algorithm exists to 
solve Hilbert’s Tenth problem, then Theorem 1 implies that it would have to lie 
beyond the present concept of algorithm, beyond Turing machines, beyond register 
machines, beyond Markov algorithms and indeed beyond all known formalizations 
of algorithm (all of which have been proven equivalent). 

Theorem 1 implies for example that an algorithm solving Hilbert’s Tenth 
Problem would also solve the word problem for groups, the halting problem for 
Turing machines and all other known r.e. unsolvable problems. Needless to say, no 
example of such an algorithm is known. 

If the reader wishes to know more about the theory of computability, he or she 
is directed to the books of Davis [1958], Minsky [1967], Rogers [1967], the second 
author’s paper [1984] or the first author’s [1974] MONTHLY paper. 

The unsolvability of Hilbert’s Tenth problem was originally proved in two steps. 
Step 1 was taken by Martin Davis, H. Putnam and Julia Robinson [1961] who 
obtained an exponential form of Theorem 1. Here P was not a polynomial but 
contained an exponential function, y = a*. In this form Theorem 1 can be 
interpreted as saying that every r.e. set is exponential diophantine. 

The second step in the solution of Hilbert’s Tenth Problem was to show that the 
exponential relation itself y = a* is diophantine. This difficult last step was taken 
by the second author [1970]. This proved Theorem 1 and solved Hilbert’s Tenth 
Problem in the negative. 

To show that the exponential function y = a” is diophantine, divisibility proper- 
ties of the sequence of Fibonacci numbers were used in [1970]. Subsequently it was 
seen how to do this using the sequence of solutions of the Pell equation. Proofs of 
Chudnovsky [1970], Davis [1971] [1973] and Kosovskii [1971] all use the sequence of 
solutions of Pell’s equation. 

Today the Pell equation gives the simplest known proof. We will also use the 
Pell equation here. Concerning this part of the proof, considerable credit goes to 
Martin Davis and Julia Robinson. Many simplifications in the present proof can be 
traced to their discoveries. 

By way of small new improvements in the Pell equation part of the present 
proof, we mention (for experts), that in the main lemma on diophantine represen- 
tation of the sequence of solutions of Pell’s equation (Lemma 2.27), use of the 
Chinese remainder theorem has been eliminated. (Davis [1973] p. 246.) Those who 
know the subject will also find that in this Pell equation part of the proof we 
eliminated also the use of the axiom of existence of infinitely many solutions to the 
general Pell equation, 


(Wd)(Ax,y)[d# 0 >x?-dy?=1A0<yl. (2) 


This assumption (or axiom) was used in Matijasevi¢—Robinson [1975] (pp. 532-533). 
Of course the axiom is true. From a certain point of view its use doesn’t matter. 
However, if one is concerned with formalizability of the proof in axiomatic 
theories, then use of this axiom is a deficiency. So we are pleased to have 
eliminated it. 
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When d is an arbitrary non-square, as is the case in (2) (d # OD), then it is 
difficult to prove statement (2) from a restricted set of first order axioms. For 
example statement (2) cannot be proved in the theory known as Bounded Arith- 
metic; Peano’s axioms with the induction axiom 


A(0) A (Wx)[A(x) > A(x + 1)] > (Wx)[A(*)], (3) 


restricted to instances where all quantified variables in the A(x) are bounded by 
polynomials in x. 

Indeed, whether the whole of the proof of unsolvability of Hilbert’s Tenth 
problem can be formalized in Bounded Arithmetic is an open problem. J. Paris 
and C. Dimitracopoulos [1982] showed that this would be the case if we add one 
more axiom (stronger than (2)). But without this axiom the answer is not known. 

This formalization problem is important because if the proof of Hilbert’s Tenth 
Problem can be formalized in Bounded Arithmetic, then, from an idea of A. J. 
Wilkie, NP = co — NP follows. This is why we mention that in this paper axiom 
(2) is used only for d of the special form, d = a” — 1. For these d axiom (2) is 
provable in Bounded Arithmetic. This increases the likelihood that the present 
proof can be formalized there. 

Besides its use in formalization, the proof here can also be used as the main link 
in a proof that every Turing computable function is recursive. We give a method 
for encoding computations of arbitrary register machines which greatly simplifies 
the tedious arithmetization usually involved with this procedure. This method of 
coding we call bit masking. It uses a famous theorem of Lucas [1878] on congru- 
ences in binomial coefficients this method was worked out in our [1984]. Originally 
we made use of a classical theorem of Kummer [1852], on the power of a prime 
dividing a binomial coefficient. Later we discovered the simpler proof using Lucas’ 
Theorem (see Lemma 3.10). 

Nearly the whole of the present proof of Hilbert’s Tenth Problem is now 
number theory. In fact it is classical number theory. Section 2, on the sequence of 
solutions of Pell’s equation, belongs to the Lucas—Lehmer theory of recurrent 
sequences. In the terminology of Lehmer [1930], this is the study of the Lucas 
sequences U, and V, where P = 2a, Q = 1, D = 4(a* — 1) and R = 4a’. With 
these values of the parameters, U, and V, correspond to the sequence of 
solutions of Pell’s equation, x* — dy* =1 with d=a*—1. Here U, = Y,(n) 
and V, = 2: X,(n) where X,(n) and Y,(n) are the sequence of solutions of x* — 
(a* — 1)y* = 1, (see §2). 

This theory is very old. Though the multiplication by 2 makes a small difference, 
it is not difficult to interpret Y,(m) and X,(n) in terms of U,, and V,, and so to see 
that many of the theorems go back to Lucas [1878] and Lehmer [1930]. Some 
modern theorems are due to Julia Robinson [1952] [1969]. In this connection it is 
interesting that while Lucas preferred the functions U, and V,,, Julia Robinson 
preferred the sequences Y,(n) and X,(n). 


§1 Diophantine Sets. A diophantine equation is a polynomial equation, 
P(d1,.. +5 Ams X1,>+++,X,) = 0, in several variables, with integer coefficients. The 
variables are divided into parameters a,,a,,...,a,,, and unknowns, X1,X5,.-+-5 Xp: 
All the variables, parameters and unknowns, range over the set of nonnegative 
integers, 0,1,2,.... 
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In the classical theory of diophantine equations one begins with an equation 
and asks for values of the parameters for which there exists a solution. In this 
proof we turn the usual procedure around. We start with the solution and search 


for the equation. We begin with a relation A(a,,a,,...,a,,) and we look for a 
polynomial P(a,,...,4,,,X,,..-,,) defining it in the sense of (1). 

DEFINITION 1.1. A relation A(a,,a,,...,a,,) is diophantine if there exists a 
polynomial P(a,,...,4,,,X,,-.-,%,) such that for all values of a,,...,a,,, (the 
parameters) 


A(Aq,-++5 4m)  (AXy, X2,---,%,)[ PCy, -- +5 Gigs X15 --+>X_) =O]. (1-1) 


This definition which is for relations, will also do for functions. A function will 
be said to be diophantine if its graph is diophantine. Below are examples of 
diophantine functions and relations. Most will be used in the proof. These 
examples include the elementary relations of order < , divisibility a|b, and congru- 
ence, a = b (mod c). 


a<b#(Ax)[a+x=b], (1.2) 
alb = (Ax)[ax = b], (1.3) 
a=b(modc) @ (Ax)[a =b+cx ora=b- cx]. (1.4) 


Disjunctions and conjunctions as occur in connection with (1.4) can be dealt with 
using 

A=0orB=0 e2A-:B=0, A=0&B=0 © A’? +B’*=0. (1.5) 
In the case of conjunctions, it is necessary to rename variables which occur in 
both A and B, for example 43x4(x) =0 A AxB(x) = 0 is equivalent to 
Ax dy[ A(x) = 0 A BCy) = O). 

From (1.5) it follows that conjunctions and disjunctions of Diophantine relations 
(but not negations) are diophantine. Proceeding in this way, we can prove that a 
great many relations are diophantine. A good is the relation of coprimality (a is 
relatively prime to b, (a, b) = 1) written in this paper as a 1 b. This relation can be 
seen to be diophantine using (1.5) together with 


a ib« (Ax, y)[ax — by = lor ax — by = —-1]. (1.6) 


Other examples of diophantine functions are the remainder function, r = rem(a, b) 
(r is the remainder after a is divided by b), and the quotient function, q = quo(a, b), 
meaning q is the quotient when a is divided by b. (This is the same as the integer 
part function, g = |a/b].) These functions can be seen to be diophantine from 


r=rem(a,b) @ r=a(modb) andr <b, (1.7) 
q = quo(a,b) ~ 0<a-—qb<b. (1.8) 


Proceeding in this way, defining new relations from old, using, often (1.5) we will 
show that a great many relations are diophantine. The most important tool in this 
process will be the sequence of solutions of the Pell equation. 


§2. The Pell equation. A general Pell equation is an equation of the form 
x* —dy*=1 (d#0). (2.1) 


where d is a constant and x and y are unknowns. (A hyperbola in the x, y plane.) 
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When d = 0 (d is a square), then the Pell equation (2.1) has only the trivial 
solution, (1,0). So one always assumes that d # O (d is not a square). 

Because x2 — dy? factors over the reals, x2 — dy? = (x + Vdy\Xx — Vdy), it is 
natural to work in the integral domain Z[Vd] consisting of real numbers of the 
form a=x+yvd, x,y € Z. Since yd is irrational, the integers x and y are 
unique. They are called the components of a. 

When a =x + y/d, the number @ = x — y/d is known as the conjugate of a. 
The number N(a) = aa@ = x* — dy” is called the norm of a. Since N(a) = x* — 
dy, the integer solutions to the Pell equation are just the components x, y of reals 
a such that N(a) = 1. 

N(a) = 1 and Ma) = a-a@ imply a~! = a, the conjugate of a is equal to the 
inverse. Also a =x + Vd | -y and 1 <a implies 1 <x and 0<yy. To see this 
observe that from 0 <a" <lLat+a=2x anda-—-a= = 2y/d we have 3<x 
and 0 <y. Then from x* = 1 + dy” we have 1 <x. Hence the inequality 1 <a 
implies that the components x and y of @ are nonnegative. 

From aB=@:B we have MaB)=aB-:aB=a'B:a:B=a'a‘B°B= 
N(a): N(B). Thus if N(aw) = 1 and N(B) = 1, then N(a: B) = 1. So the product 
of two reals representing solutions of the Pell equation represents again a solution. 
We also have N(a’) = Na). 

Now let a =x, +y,yd and B =x, + y,,d be two solutions of the Pell equa- 
tion, with x,,y,, 5, y> integers, N(a) = 1 and N(B) = 1. Suppose 1 <a and 
1 < B hold. As we have seen, *1, Vip x2, y, must then be nonnegative. So from the 
equations x7 = dy? +1 and x3 =dy3 + 1, it follows that x, <x, iff y, <yp. 
Thus 1 <a <f holds iff both 1 <x, <x, and 0 <y, <y, hold. So the set of 
reals a for which N(a) = 1 and 1 <a hold, is a well ordered set. If this set is 
nonempty, if there exists a nontrivial solution to the Pell equation, then there must 
exist a least real a such that 1 < a and N(a@) = 1. This real is called the generator. 
Its components are called the fundamental solution. 

The powers of the generator a generate all solutions to (2.1). To see this, 
observe first that N(a@) = 1 implies N(a”) = Na)” = 1, so that powers of a 
represent solutions. To see that these powers of a give all solutions of (2.1), 
observe that if N(B) = 1 and 1 < B, then Jn such that a” < B < a"*!. From 
1<B-a "<a and Nba") = N(B)Na')" = N(B)N(@)" = N(B)Na)” = 
it then follows that B:a~” = 1. Hence B = a”. Therefore 6 can be obtained as a 
power of a. 

It is not entirely trivial to show that for arbitrary d # O there always exists a 
nontrivial solution to the Pell equation (2.1). But when d is a nonsquare of the 
special form d = a’ — 1, then it is easy to prove this and hence the existence of 
infinitely many solutions. The special Pell equation with d = a” — 1, 


2— (a? - 1)y*=1, (2.2) 
has the fundamental solution (x, y) = (a, 1). So the generator is a + Va* — 1. By 
the preceding, the powers of this real then give all solutions, 

Xn) + Y,(n)-va*—-1=(a+vVa*—-1). (2.3) 


Let x = X,(n) and y = Y,(n) denote this sequence of solutions to (2.2), defined by 
(2.3). Then X,(n) and Y,(n) are strictly increasing functions of n. This is evident 
from (2.9) below. 
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Taking the conjugate of both sides of (2.3) one finds that the sequences X,(n) 
and Y,(n) are also definable from the conjugate of the generator, a — Va? — 1, 


X,(n) — Y,(n)Va* — 1 = (a — Va? — 1). (2.4) 


As we have mentioned, @ = a~', the conjugate of the generator is equal to the 
inverse, 


a—Va°—-1=(a+t a1). (2.5) 


This implies that most of the identities which hold for the sequence of solutions 

of the Pell equation also hold for negative values of parameters. For example 

from the identities (a + Vd)"*" = (a+ Vd)"“(a+ Vd)", (a + Vd)" = 

(a+ vVd)"a + yd)~”, using (2.3), (2.4) and (2.5) one obtains the identity 

X,(n +m) + Y,(n + m)vd = (X,(n) + Y,(n)Vd)(X,(m) + Y,(m)vd). 

(2.6) 

Taking the rational and irrational parts of (2.6), one obtains the Addition Equa- 

tions (Lucas). Here the + signs correspond. 


X,(n +m) = X,(n)X,(m) + dY,(n)¥,(m), (2.7) 
Y,(n +m) = Y,(n)X,(m) + X,(n)¥,(m). (2.8) 


These hold only for a > 2 but if we define X,(m) = 1 and Y,(n) = n, then (2.8) will 
hold for a = 1. Putting m = 1 in (2.7) and (2.8), we see that also, as a special case 
of (2.7), (2.8) 


X (n+ 1) =aX,(n) + dY,(n), Y(n+1)=aY,(n)+X,(n), (2.9) 
X,(n — 1) — aX ,(n) a aY,(n), Y,(n a 1) — aY,(n) — X,(n). (2.10) 


Adding pairs of equations (2.9) and (2.10), one obtains representations of X and Y 
as Lucas Sequences, that is, sequences satisfying a second order linear recurrence. 


X (0) = 1, X(1) =a, XA(n +1) = 2aXx,(n) —X,(n — 1). (2.11) 
¥,(0)=0, Y¥,)=1,  ¥,(m +1) =2aY,(n) -¥,(n- 1). (2.12) 
From the addition equations, (2.7) and (2.8), together with defining equation (2.2), 
one can derive the Double Angle Formulas (Lucas) 
X,(2n) =2X,(n)° -1, (2.13) 
Y(2n) =2X,(n)Y,(n). (2.14) 
Generally we are interested in the X, and Y, sequences only for a > 2. However 
as we mentioned, when a = 1 they can be defined in a natural way satisfying (2.8), 
(2.11) and (2.12). We can define X,(n) = 1 and Y,(n) = n. 


For fixed n, the function Y,(m) is a polynomial in a. This can be seen from 
(2.12). The degree is n — 1. From this, that Y,(1) is a polynomial in a, one obtains 


Congruence Rule, Y(n) = Y,(n) (moda — b). (2.15) 


The Congruence Rule holds for a, b > 1. Putting b= 1 in (2.15) and using 
Y,(n) =n we obtain the special congruence rule of Julia Robinson [1952]: 


Y(n) =n (moda — 1). (2.16) 
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The Lucas equations (2.11), (2.12) also enable one to derive bounds on the size 


of the X, and Y, sequences. For example, it is easy to derive the following bound 
on Y,(n): 


(2a —1)" < Y,(n +1) < (2a)". (2.17) 


This inequality, which holds for 1 < a, 1 <n, shows that the Y,(m) sequence grows 
exponentially in nm. These sequences can also be more directly related to exponen- 
tiation. The following congruence was obtained by Julia Robinson [1952]: 


X,(n) — (a —k)Y,(n) =k” (mod 2ak — k? — 1). (2.18) 


Proof. We will show that the congruence holds for all n > 0 and all k > O. It is 
easy to check for n = 0 and n= 1. So proceeding by induction on n, using 
recurrence equations (2.11) and (2.12), we obtain 


X(n+1)-(a—k)Y¥(n +1) =2a-X,(n) —X,(n - 1) 
(a —k)[2a-¥,(n) - ¥,(n - 1)] 
= 2a[X,(n) — (a — k)Y,(n)| 
-[X,(n — 1) - (a )Y,(n - 1)] 
= 2ak" —k""' =k"~!(2ak — 1) 
=k""!(2ak —k? -1+k*) =k""'(0+ k?) 
= k""'k* =k"*! (mod 2ak — k* — 1). 
Next we prove a divisibility property which we will need in the proof of (2.20). 
nlm - Y,(n)lY,(m). (2.19) 


Proof. From the Addition Equation (2.8) we have (dropping the subscript a), 
Y(k +n) = Y(kK)X(n) + X(K)Y(—n) = Y(K)X(n) (mod Y(n)). But Yn) 1 X(n) 
(Y(n) and X(n) are relatively prime), by (2.2). So Y(n)|YCK + n) iff Yn)|YCK). 
Now let m = ni + r where 0 <r <n. Then 0 < Y(r) < Y(n). Also Y(n)|Y(m) iff 
Y(n)|Y(ni + r) iff Yn) | YC). Hence Y(n)|Y(m) iff r = 0, i.e. Y(n)| Ym) iff n|m. 


First STEP Down Lemma 2.20. Y,2(n)|¥,(m) @ n-Y,(n)|m. (1 <a). 
Proof. By using identity (2.3), twice, it is easy to prove that for any j 
X,(ni) + Y,(niVd = (X,(n) + ¥,(n)vd)’. (2.21) 
Now expand the right side of (2.21) and take the irrational part to get 


Yala) = E (/) xeon’ “Yoory' (a _ (2.22) 
Hence | 
Y,(nj) = iX,(n)’'Y,(n) (mod ¥,(n)’). (2.23) 


For the proof of 2.20 in the = direction, suppose Y(n)7|Y(m). Then by (2.19) 
m = nj for some j. Let this be the / in (2.23). Since X(n) 1 Y(n), (2.23) implies 
Y(n)*|7Y¥(n). Hence Y(n)|j, so that n-Y(n)|m. For the converse suppose 
n-Y(n)|m. Let j = Y(n). Then (2.23) > Y(n)*|Y¥(n- Y(n)). So by (2.19), 
Y(n)*|Y(m). 
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Lemma 2.24. For 2<aand1<n, Y(n—1)+Y(n) < X,(n). 


Proof. Replace n by n — 1 in (2.9) to obtain a: Y(m — 1) + X(nm — 1) = Y(—m). 
From 2 <a, we get 2:-Y(n-—1)<a-:Y(n-1)<a‘Ya-D)+Xn-1)= 
Y(n). Hence Y(n — 1) < Y(n) — Y(n — 1). Add Y(n) to both sides to obtain 
Y(n — 1) + Y(n) < 2:Y(n) — Yn - 1) < a: Y(n) — Y(Cn — 1) = X(—n), by (2.10). 


LEMMA 2.25. Y(4nitm) = Y,(m), Y(4ni+2n + m) = FY(m) (mod X,(n)). 


Proof. From (2.14) Y(2n) = 0 (mod X(n)). From (2.13) X(2n) = -1 
(mod X(n)). So from (2.8) Yn + m) = Y(2n):-X(m) + XQn): Ym) = FY(m) 
(mod X(n)), we have Y(2n + m) = #Y(m) (mod X(n)). Using this Y(4n + m) = 
Yn + 2n +m) = —YQn +m) = +Y(m) (mod X(n)). So also Y(44n + m) = 
+Y(m) (mod X(n)). Here the signs + correspond. In the next lemma they do not. 


SECOND STEP Down LEMMA 2.26. Y,(k) = +Y,(m) (mod X,(1)) ek = +m 
(mod 27). 


Proof. We assume as usual 2 < a, 1 <n. For the proof in the = direction, 
suppose that k = 2nj + m. When j = 2i, we have Y(k) = Y(4ni + m) = +Y(m) 
(mod X(n)). In the case that j = 2i+ 1, we have Y(k) = Y(4ni + 2n +m) = 
+Y(m) (mod X(n)). So Yk) = +Y¥(m) (mod X(n)). 

For the proof in the = direction, assume that Y(k) = +Y(m) (mod X(n)). 
Choose k' such that 0 <k’ <n and k = +k’ (mod2n). Choose m’ such that 
0 <m' <n and m= +m (mod2n). Then from the assumption and Lemma 2.26 
in the direction already proven, —, we have Y(k’) = +Y(m’) (mod X(n)). Hence 
it follows that X(n)|Y(k’) + Y(m’). Therefore k’ = m', because if we suppose 
k' # m', then we would have 0 < |Y(k') + Y(m’)| < |Y(kK.) + Y(m’)| < YM —- 1) 
+ Y(n) < X(n), by Lemma 2.24. From k’ = m' we deduce k = +m (mod 2n). 


LEMMA 2.27. For A > 1. In order that C = Y,(B), it is necessary and sufficient 
that there exist natural numbers D, E, F,G, H, I and i such that 


(1) D? - (A? = 1)C? 


1, (4) E=(i+1)2C?, (7) H=C (mod F), 


(2) F?-(A?-1)E?=1, (5) G=A(modF), (8) H=B(mod2C), 


(3) I2-(G?-1)H?=1, (6) G=1(mod2C), (9) B<C. 
Proof of Sufficiency: Suppose there exist D, E, F,G, H, I,i satisfying (1)-(9). 


Then equations (1)—(3), Pell equations, imply the existence of numbers p, g, and r 
such that 


D=X,(p), C=Yip), F=X,(4q), 
E=Y,(q), I=X,(r), H=Y,(r). 


We also have 0<p<C and0<B<C. Hence the idea is to show B = p, by 
proving B =r= +p (mod2C). We can suppose 0 < C. Using the First Step 
Down Lemma, together with (4), we have 


CE = Y2(p)|Y4(a) = Yi(p)la > Cla. 
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To show that B = r (mod 2C) we use (6), (8) and the Congruence Rule (2.16) to 
get 


B=H=Y,(r) =Y,\(r) =r (mod2C) = B=r(mod2C). (2.28) 
By the Second Step Down Lemma, the Congruence Rule (2.15), by (5) and by (7), 
¥,(r) = Yo(r) =H = C = ¥,(p) (mod X4(q)) = r= £p (mod2q). 
But Clg. Hence r = +p (mod 2C). This plus (2.28) implies B = +p (mod 2C). 


Proof of Necessity: Suppose C = Y,(B). Let D = X,(B). Then (1) and (9) hold. 
Put q=B-Y,B) F =X,2q) and E = Y,(2q). Then (2) holds. Let 
m = B-Y,B) in the First Step Down Lemma (2.20). Then the First Step Down 
Lemma says 


¥,(B)’|¥,(B + ¥,(B)). (2.29) 


Hence C*|Y,(q). The Double Angle Formula (2.14) says 2X,(q)- Y,(q)|Y,(2q). 
Hence 2C’|E. Therefore (4) can be satisfied. Put G = A + F?(F? — A). Then (5) 
holds. Also (2) and (4) together imply that F? = 1 (mod2C). Then G =A + 
F*(F? — A) implies (6). Put J = X,(B) and H = Y¥,(B). Then (3) holds. From 
(2.16) H = ¥,(B) = B (mod G — 1). So by (6) H = B (mod2C). Therefore (8) 
holds. From (2.15) we have H = Y,(B) = Y,(B) = C (mod G — A). This together 
with (5) implies H = C (mod F’). Hence (7) holds. 


§3. Exponential and Binomial Coefficient. By Lemma 2.27, the 3-place relation 
y = Y,(n) is diophantine. Hence we may use it in showing that other relations are 
diophantine. We can also use the 3-place relation x = X,(n) for this purpose. It is 
also diophantine. (That x = X,(n) is diophantine follows from Lemma 2.27 and 
equation (2.2).) We will use both y=Y,(n) and x = Xm) in defining the 
exponential relation. 


Lemma 3.1. Suppose 1 <n, 2 <k. Fora sufficiently large, a > Y,(n + 1) 
k" = rem(X,(n) —(a—k)Y,(n), 2ak —k? —1). 


Proof. From (2.17) we have k < k” < (2k — 1)" < Y,(n + 1) <a. From this it 
follows that k +1<a, and hencea<a-k<a‘k+k-1l1=ak+(k4+)k—- 
k*—-1<ak + ak —k* —1=2ak —k?—1. Therefore k” < 2ak — k? — 1, 
k” is smaller than the modulus. But by congruence (2.18), k” = Xn) — 
(a — k)Y,(n) (mod 2ak — k* — 1). Since k” is less than the modulus, it is the 
remainder. 

Lemmas 2.27, 3.1 and (1.7) imply that the exponential relation, m =k” is 
diophantine. By Lemma 3.1, m = k” if and only if there exists an a such that 


m= X,(n) — (a —k) -Y,(n) (mod 2ak — k* — 1), 
m<a a>yY,(n+ 1). (3.1) 


We show next that the binomial coefficient is diophantine. This relation was 
first proved exponential diophantine by Julia Robinson [1952]. The basic idea of 
the proof is that- the binomial coefficients are just the digits in the base u 
expansion of (1 + wu)”, when uw is large enough. 
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LEMMA 3.2. For 0 < k, nandu > 2", 


(7) = rem| (+ Dl 7 
A ju}. 


uF 
Proof. Expand (u + 1)” by the Binomial Theorem and divide by u* to get, 
(u +1)" 


= S (Feta (Z) +E (tu (3.2) 


u i=k+1 


From (3.2) together with the inequality u'~* < 1/u, which holds for i < k — 1, we 
get 
= = 1 2 2" 
Ewes E(Ne<tE(-S<_ 63) 
=(0 i=0 u i=0 u 
Hence 
(u+1)" 


uk 


= (1 (mod u). 


So Lemma 3.2 follows from (7. <2” <u. 


Lemmas 2.27, 3.1, 3.2, 1.7 and 1.8 imply that the binomial coefficient is 
diophantine. We have m = (‘.] iff there exist u, x, and y, such that 


(ut 1)" =y-ukt!4+m-u* +x, 2” <u, x<u* and m<u. 
(3.4) 


We now use this to prove that the following relation < (bit masking ) is diophan- 
tine. 


DEFINITION 3.5. Let r and s be natural numbers written in binary, (base 2). Then 
r < s means each binary digit of r is less than or equal to the corresponding binary 
digit of s. 


The masking relation <_ is closely related to the operation of taking the logical 
and of two numbers. One can define < from logical and by 
<beaNb=a. (3.6) 
Logical and is definable from < by 
anNb=c #cxXa and axat+b—ce. (3.7) 


Another property of the masking relation is that r < s implies r < s. The masking 
relation also satisfies the axioms for a partial ordering, reflexivity, anti-symmetry 
and transitivity. 

The masking relation < is also diophantine. For the proof we will use the 
Theorem of E. Lucas [1878] on binomial coefficients modulo a prime. 


THEOREM 3.8 (Lucas [1878]). If p is prime, r,, 5, the base p digits of r and s, then 


" | 7 alee (mod p). (3.8) 


ry 
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Proof. It is easy to see, by induction on n, that this general form of Lucas 
Theorem, Theorem 3.8, is implied by the following special case (H. Anton [1869], 
approximately n = 1), of Lucas’ Theorem: 


LEMMA 3.9. Let p be a prime. Suppose 0 < b < pand 0 <d < p. Then 


en (e) (4) (mod p). (3.9) 


c‘pt+ad C 


Proof. Working in the polynomial ring Z [x], consider the coefficient of the 


term x°?*@ in the polynomial (1 + x)*?*°. From the equality (x + y)? =x? + y? 
we get 
ap+b b 
y [v * |x" =(1+x)?%=(14+x)"%(14+x)? = (14+ x7)" (14x)? 
n=0 


(Elbe El) Z EG er 


The coefficient of x°? +“ must be the same in both polynomials. However because 
of the inequalities 0 <i < b < p and0 <d < p, the equation jp +i =cp + dcan 
hold only when i = d and j =c. Therefore, the coefficient of the term x°?*? in 
(1 +x)*?*° must therefore be (2}(%). Hence (°° r | = (<| (3) holds in Z [x]. 
c]\d cp +d Cc d P 
Now we can prove that the masking relation < is diophantine. This will follow 
from 


LEMMA 3.10. 
r<s @ (> } = 1(mod 2). 


Proof. Put p = 2 in Theorem 3.8. Then 3.10 follows from the trivial relations 


jen (en (Bn (Q)-0 


Lemma 3.10 is proved. From this Lemma together with Lemma 3.10, 3.2, (1.7) and 
(1.8), it follows that < is a diophantine relation. 


§4. Arithmetization of register machines. We now have enough diophantine 
relations to prove that every recursively enumerable set is diophantine. Recall the 
definition of r.e. set: A set A is r.e. if and only if A is empty or the range of a 
recursive function. For the definition of recursive function, we will use the register 
machine, a model of computation equivalent to the Turing machine. A register 
machine is a “machine” with a finite program and a finite number of separately 
addressable registers, R1, R2,..., Rr. The registers are assumed unbounded, each 
register can contain an arbitrarily large nonnegative integer. A subset of the 
registers, say R,, R,,..., R,, (k <r), is designated as input registers and a subset, 
say R,, R,,...,R,,, (m <1), as output registers. This is to handle functions of k 
variables whose values may be m-tuples. For a function of one variable of course 
k = 1. Normally also m = 1. Usually R, is thought of as the input-output register. 

A register machine is actually a program, a list of commands written on lines 
labelled L1, L2,..., Ll. The register machine’s commands are normally executed 
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in sequence; however, the register machine may execute commands which tell it to 
transfer to a different location and begin execution there. 

For the computation of all recursive functions, it is sufficient to assume that the 
register machine is capable of adding or subtracting 1 from a register and of 
transferring on zero. 

The subtraction command can cause a problem if one attempts to subtract 1 
from an already zero register. For this reason Minsky [1967] permitted the 
subtraction command to occur only after a test for zero. His subtraction command 
therefore required two lines 


Li IF Rj = 0, GOTO Lk, (4.1) 
li+1 ELSE R<R, - 1. (4.2) 
It is sufficient to assume that the program is written in such a way that subtraction 


from a zero register never occurs. We will assume that our machine has the 
following one line commands: 


COMMAND INTERPRETATION 
Li GOTO Lk Transfer to line Lk. (4,3) 
Li IF 0 < Rj GOTO Lk Conditional transfer to line Lk. (4.4) 
Li Rj <— Rj + 1, Increment register Rj. (4.5) 
Li Rj — Rj - 1, Decrement register Rj. (4.6) 


In fact because it shortens our example and does not complicate the proof, we 
will allow several commands of type (4.5) and (4.6) to be written on one line, in 
parallel, when they refer to different registers. Below is an example of a register 
machine. 


EXAMPLE 1. A register machine which computes the x'" Fibonacci number, F,. 


1 IF R1 =0, GOTO L20 

L2 R2<— R2+ 1, R3<— R3 +1 
L3 R1i<« R1-1 

L4 IF R1 = 0, GOTO L16 

D5 R1<-R1-1 

L6 R4<-— R44+1,R5<—R5+1 
L7 R3 <— R3- 1 

L8 IF 0 < R3, GOTO L6 

L9 R4<— R441, R2<— R2-1 
L10 IFO <R2, GOTO L9 

Til R3<-R34+1, R4< R4-1 
T12 JFO<R4, GOTO L11 
T13- R2<—R2+4+1,R5<—R5-1 
114 IFO<R5, GOTO L13 
L1i5 IFO<R1, GOTO LS 

L1i6 R3<R3-1 

Li7 JFO<R3, GOTO L16 
l18 R2<- R2-1, R1< R1 +1, 
Li9 JFO<R2, GOTO L18, 


A function is recursive (computable) if and only if it can be computed by a 
register machine (Minsky [1961] [1967], Melzak [1961], Lambek [1961], Shepherdson 
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and Sturgis [1963].) The machine M of Example 1 computes the function f(x) = F.,, 
where F, is the xth Fibonacci number, (Fibonacci number with index x in the 
enumeration F, = 0, F, = 1, F.,, =F, + F,,,). If M is started on line L1 with x 
in register Rl and zeros in the other registers, then M eventually halts with 
f(x) = F, in register R1 and zeros in the other registers. We can understand this 
as a general definition of computability. 

To arithmetize the work of an arbitrary register machine, we will use digits of 
numbers written to a base Q, where Q is power of 2. Consider the register 
machine M of Example 1 or more generally any register machine with r registers 
and / lines in its program. Assume that M computes the total function y = f(x). 
During the computation the contents of a register at time ¢ can never exceed 
x +t. We will use the fact that after s steps in the computation, the contents of 
each register Ri is <x +5. 

Suppose that after s steps the value y = f(x) is obtained by M. Let 7, , denote 
the contents of register Rj at time ¢ during the course of the computation. If Q is 
large enough, we will have r;, < Q. So the numbers r, , may be considered to be 
the digits of a number written to the base Q. But we will need more room. We will 
require 27, , < Q, (for the reason see the explanation following (4.19)). Hence we 
will require that QO satisfy: 


x+s<Q/2, (4.7) 

1+1<Q, (4.8) 

Q pow2. (4.9) 

For example the number Q = 2**5t!t! will be large enough. To describe the 


location of the machine in the code at each time, let /; , be 1 or 0 according as we 
execute, or do not execute the instruction(s) at location Lj at time t. When the 
machine M computes, it will then generate numbers R, and L; where 


s Q 
R,= Dr,,0' [0 << z}; (4.10) 
t=0 
L; — » l, ,Q' (0 < l; t < 1) (4.11) 
t=0 


Here the index /j runs from 1 to r, where r is the number of registers. The index i 
runs from 1 to /, where / is the number of Jines in the program. 

It will be useful also to have a number J which when written to the base Q has 
all digits equal to 1. The geometric series will allow us to obtain such a number /. 


Ss 
If 1+(Q-1)1=Q°*', then J= YQ". (4.12) 
t=0 

To visualize the arithmetization, consider the machine of Example 1 which has 
r=5 registers and / = 19 lines in its program. When it is started on x = 2 
as input (i.e. with x = 2 in register R1), the machine will be seen to halt after 
s = 23 steps, with 1 in register R1 and zeros in the other registers. (This is be- 
cause F,=1 is the second Fibonacci number in the standard enumeration 
0,1,1,2,3,5, 8, 13,... .) During the computation of F,, the machine will generate 
the following numbers R,, R,, R3, R4, Rs, representing successive contents of the 
registers, in the sense of (4.10). These numbers, when written in base Q, would 
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look as follows: 
R, = 11000000000000000001 1222, 
R, = 001111111000000111111100, 
R, = 000011222221100001111100, 
R, = 000000000001122111000000, 
R, = 000000000111111111000000. 
Similarly, numbers representing the sequence of program locations L,, L5,..., Ly; 


will be generated during the computation of F,. In base Q they will look as 
follows: 


L,, = 000000000000000000000001 , 
L, = 000000000000000000000010, 
L = 000000000000000000000100, 
L, = 000000000000000000001000, 
L = 000000000000000000010000, 
L. = 000000000000000000100000, 
L, = 000000000000000001000000, 
Le = 000000000000000010000000, 
Lo = 000000000000000100000000, 
Lo = 000000000000001000000000, 
L,, = 000000000001010000000000, 
L,» = 000000000010100000000000, 
L, = 000000000100000000000000, 
L144 = 000000001 000000000000000, 
Ls = 000000010000000000000000, 
L 16 = 000010100000000000000000, 
L, = 000101000000000000000000, 
Lg = 001000000000000000000000, 
Lo = 010000000000000000000000, 
Ly = 100000000000000000000000, 


Notice that the generated numbers L,L,,..., L5), R>, R3, Ry, Rs; contain the 
entire history of the computation of f(2) = F, = 1. 

Now we show how to write down diophantine conditions on arbitrary variables 
L,, L,..., Lo, Ry, Ro, R3, Ry, Rs, 8,Q, x, y sufficient to force them to be these 
particular values. That is, we give Diophantine conditions on these unknowns 
which are satisfiable if and only if, on input x, the machine M produces the output 
value y = f(x). The unknowns in these diophantine relations will be s,Q, 
I,R,,...,R,,L,,..., £141. The quantities r and / will be constants, (e.g. r = 5 
and / = 19 in the example). After the conditions have been written down, the 
methods of §1, §2 and §3 may be used to translate them into diophantine 
equations in these same unknowns, and more unknowns. The variables x and y 
will be parameters in the resulting equations. 

The first four conditions will be (4.7), (4.8), (4.9) and (4.12). Then to force an 
arbitrary natural number R, to have the form (4.10), we will use the masking 
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relation 


Q 
R,<[5-1} (j=1,...,r). (4.13) 
Condition (4.13) is implied by condition (4.10), because Q is a power of 2. We also 
need to force exactly one digit to be 1 in every column of the L matrix. Since 
1 < QO by (4.8), we can use for this purpose the following two conditions: 


I+1 


I= VL,, (4.14) 
i=1 
L,=<I (i=1,...,/4+ 1). (4.15) 
To start the machine on line L1, we stipulate as a starting condition that 
1<JL,. (4.16) 


By means of GOTO instructions we can suppose there is only one stop command, 
and that it is located at the end of the program, on line L,,,. Then the condition 
for stopping the machine after s steps can simply be 


Lin, = O. (4.17) 


To simulate each GOTO command (unconditional transfer), Li GOTO Lk, we 
include a condition 


OL, < Ly. (4.18) 


This forces the ¢ + 1st digit of L, to be 1 whenever the ¢th digit of L, is 1. Here 
1<k <1+1. (Line L/ + 1 is permitted to be the target of a GOTO.) This same 
idea, (4.18) can be used also for the conditional transfer command (4.4), Li IF 
0 < Rj, GOTO Lk. (Assume k # i + 1 so the command is not trivial.) Here we use 
two masking conditions: 


The first masking condition forces transfer toline Li or line Li + 1. The second 
decides which one it will be. A diagram explains how this works: 

6 Q> Q* Q? Q? Q! Q° 
00000100000 100000100000100000100000 1. 
00000100000 100000100000100000100000 0 
* kK OK OK QO * * * Ox xe x x QO ex x x * OQ, 


QO * * * * x * ek Qk KOK * 


Here Q = 2° and s = 5 and the idea is that subtraction of 2R ; from QI/ pulls a bit 
from the Q‘*' position of Q-I when 0 <r, ,. Thus (4.19) implies that 0 <r, , if 
and only if /, ,,, = 1. The reason why we needed 27, , < Q and why we divided Q 
by 2 in (4.13) can now be understood. The purpose was to create the zeros between 
adjacent blocks of digits in 2R,;. The reason we use QJ — 2R, and not J — 2R; is 
that we could not be sure that J — 2, would be nonnegative. 

The command Li IF Rj = 0, GOTO Lk may be treated analogously. This 
command is just the opposite of (4.4), so it can be simulated with a masking 
condition like (4.19) but with the second occurrence of L, replaced by L,, ,. This 
has the effect of causing the machine to go to Li + 1 instead of Lk, when Rj > 0. 
(Again we are supposing that k #i+1 so that Li IF Rj = 0, GOTO Lk is 
nontrivial.) 
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While commands of type Li Rj <— Rj + 1 are not usually thought of as transfer 
commands, there is an implied GOTO associated with them; GOTO the next line. 
So, for each occurrence of such a command (more precisely for each line Li 
containing occurrences of such commands), we need one corresponding masking 
condition: 


OL, < Lix1- (4.20) 


Finally it is necessary to include for each register an equation which ensures 
that the contents of the register is equal to the tth Q-ary digit of the correspond- 
ing number Rj at each time t. We call these register equations. For the example 
program they would be: 


R, + yO°*' = QR, +x+ LOL, — LOQL,;, (4.21) 
k i 
R,=QR,;+ LOL, - QL,, (for j =2,3,...,r). (4.22) 
k i 


The k-sums are over all k for which there is an instruction of type Lk Rj <— Rj + 1, 
on line Lk. The i-sums are over all i for which the program has a command of 
type Li Rj <— Rj — 1 on line Li. 

The register equation for register R1 is different from the register equations for 
the other registers because R1 is the input-output register (assuming functions of 
1 variable). At time t = 0, register R, contains x. At time s it contains y. The 
other registers contain zeros at these times. 

If more registers are designated as input-output registers, for example if 
m > 1, then it is necessary to add a term x to the right side of (4.22) and a term 
yQ**' to the left side of (4.22), for the corresponding register equation. 

From these equations and conditions, it is not difficult to show, by induction on 
t,, that if machine M computes function f, then for all x, y, conditions (4.7), (4.8), 
(4.9), (4.12)-(4.22) have a solution in the unknowns s, Q, J, R,,..., R,, L4,...5 Lys, 
if and only if f(x) =y. Methods of §1, §2, and §3 then permit us to write 
conditions (4.7), (4.8), (4.9), (4.12)—(4.22) as polynomial equations in the unknowns 


s,Q,1,R,,...,R,, L,,...,£,;, and more unknowns. After we rename the un- 
knowns x,,X,,...,%,, transpose all the terms in the equations to one side and 
sum squares as in (1.5), we obtain a _ single polynomial equation 
P(x, Y, X41, Xo,...,X,) = 0 with the property that (Ax,, x5,..., x,) 


[P(x, y, X1,X>,...,xX,) = Ol if and only if f(x) =y. 

To complete the proof of Theorem 1, let A be any r.e. set. Suppose A # ©. Let 
f be a recursive function whose range is A, so that y € A iff (dx) [f(x) = y]. Let 
M be a register machine computing f. When A is a 1-ary relation, A(a,), we have 
a,&A iff (Ax) [f(x) =a,]. Thus a, €A if and only if Gx, x1, x2,...,x,,) 
[P(x, a1, %1,%2,...,x,) = 0]. When A is an m-ary relation, A(a,,a,...,4a,,), 
then we have (a,,a5,...,a4,,) €A iff (Ax) [f(x) = (a, a,,...,a,,)]. So P must 
include m copies of register equation (4.21), with yQ**! replaced by a,Q**' 
G@=1,...,m). Then A(a,,a,,...,a,) holds iff (Ax, x,,..., x,) 
[ P(x, A1,..+5 A, X15-.+,X,) = O]. Theorem 1 is proved. 


§5. Hilbert’s 10th problem is unsolvable. We give here a proof that Theorem 1 


implies the recursive unsolvability of Hilbert’s Tenth Problem. To shorten the 
proof T. Rado’s game [1962] will be used. 
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THEOREM 2. There exists no algorithm to solve Hilbert’s Tenth Problem. 


Proof. Consider the problem of programming a register machine to return the 
largest possible number in register R1 and stop. Suppose the registers are initially 
set to 0. Let R() be the largest possible number generable in this way, by an / line 
register machine with registers set initially to 0. 

Restricting subtraction commands to the (safe) type (4.1)-(4.2), and disallowing 
parallelization of commands, R(/) is evidently a well defined function of /. A finite 
set has a greatest element. There are only a finite number of / line register 
machines. 

Evidently we have R(1) = 1 and R(@) = 2. Also R is an increasing function of J, 
RC) < RU + 1). This can be seen by adding a new line, R1 < R1 + 1, to the end 
of an / line program. 

Define a binary relation S by putting S = {(k,1): k < R()}. Then 


(k,l) © S = (As)[some / line machine halts in s steps with contents of R1 > k]. 


(5.1) 
From (5.1) it follows that the set S is r.e. This set S is related to the function R by 
R(1) = mink[7S(k +1,D)]. (5.2) 


Apply Theorem 1 to S. The result is a polynomial P(k,1,x,,...,x,) with the 
property that 


S(k,1) = (Ax,, X2,...,%,)[ P(K, 1, x1, X2,...,X,) = 0]. (5.3) 
From (5.2) and (5.3) it follows that 
R(1) = min k[>(4x,,x5,...,x,)[P(K + 1,1, x1, %2,...,%,) = O]]. (5.4) 


Hence if Hilbert’s Tenth Problem were solvable, then R would be computable. 
But the function R grows very rapidly. In fact it is noncomputable. 


LEMMA 5.5. Let f be any computable function. Then for I sufficiently large, 
f@D < RM. 


Proof. Without loss of generality we can suppose that f is an increasing 
function. Let N be a register machine which computes the function f. Suppose N 
has c lines in its program. Let F be the machine obtained by adding the 
instruction R1 <— R1 + 1 to the end of N. Then F has c + 1 lines and it computes 
the function f(x) + 1. 

Let D be a 5 line register machine with the property that when started with x 
in R2 places 2x in R1 and stops. D could be the machine, 


L1 IF R2=0, GOTO L6, 
R2< R2-1, 
R1< R1 +1, 
R1< R1 +1, 
GOTO L1, (5.6) 


For each x, let M, be a register machine with x lines and the property that 
when started with 0 in R2 produces x in R2. For example M, could be taken to 
be a straight line program consisting of x copies of R2 <— R2 + 1. 
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Now consider the register machine F(D(M,)). By the notation F(D(M,)) is 
meant M, followed by D followed by F, in that order. F(D(M,,)) is a register 
machine with x + 6+ c lines in its program. It has the property that for each x, 
when started with registers initially set to zero, it will produce the number 
f(2x) + 1in R1, then halt. The existence of this machine proves that 


f(2x) < R(x +640) (5.7) 
for each x. When x > 6+, we have x + 6+ c < 2x. Hence 
f(x+6+c) <f(2x), (5.8) 


because f is increasing. Now put / =x + 6+ c. Then for / > 12 + 2c, (5.7) and 
(5.8) imply f() < RW). Lemma 5.5 is proved. Hilbert’s Tenth Problem is unsolv- 
able. 


§6. Every computable function is a polynomial. Unsolvability is a negative 
result, but the solution of Hilbert’s Tenth Problem has many positive conse- 
quences. One is the existence of polynomials whose set of positive values coincides 
with any given r.e. set. This result is well known, so we shall prove a stronger form 
of it which at first glance appears even more surprising. 


THEOREM 3. Let f be any computable function. Then there exists a polynomial Q, 
with integer coefficients, such that for all nonnegative integers x and y 


f(*) =y © (Ax, X15---5 X_)[QO(%, Xo, Xy5---5 Xn) = YI. (6.1) 


Proof. We apply Theorem 1 to the graph of f (which is an r.e. set), and then 
use a trick due to H. Putnam [1960]. Starting from the polynomial P which 
Diophantine defines the graph of f, and using also the fact that f is nonnegative, 
(Vx) [ f(x) = OI, we get 


f(x) =y @ (4x,,...,%,)(P(%, y, x1,...,%,) = 0), 
= (Ax9,...,%,)(1 — P(X, Xo, X15---5%X,)° > Oand x, = y), 


= (Ax9,...5%,)((Xo + 1)(1 — P(X, Xo) X1y-005Xn) | =yt 1). 


In the <= direction we are using the fact that 1 — P(x, Xo, X4,.--5%X,)° > 0 
implies P(x, X9,xX,,...,%X,) = 0 which implies f(x) =x ). So we can put 
Q(x, X,-..,X,) = (%) + Dl — Px, Xo, x%1,...,%,)°] - 1. 

To illustrate Theorem 3, we could apply it to the function f(x) = F,, whose 
range is. the set of Fibonacci numbers. In this case the theorem would give us a 
polynomial Q(x, x,,..., x, such that for all x, y 


F, =y @ (AXxo,...,X,)[Q(%, Xo,---,X,) =YI- (6.3) 


This would be a Fibonacci number representing polynomial which gives the 
sequence in order, as a function of x. 

The proof of Theorem 1 shows how to construct such a polynomial. If we use 
this proof, the number n of unknowns will be rather excessive. By another 
construction, however, it can be shown that we need only nine unknowns. n = 9 
here and generally in Theorems 1 and 3. This result, the nine unknowns theorem, 
is a theorem of the second author. A proof can be found in a paper of the first 
author [1982]. 
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1. Introduction. Our goal is to show how some problems in quantum chemistry 
arise naturally as minimization problems involving N X N matrices in the Hilbert- 
Schmidt norm. These minimization problems have the added feature that the 
solutions remain solutions when the Hilbert-Schmidt norm is replaced by any other 
of the “usual” norms. One outgrowth of our discussions will be a scientifically 
motivated procedure for converting a basis into an orthogonal one; the Lowdin 
procedure will arise “naturally” and will have a nice property not shared by the 
better-known Gram-Schmidt method. 

In Section 2 we review some aspects of the theory of linear operators on a finite 
dimensional Hilbert space. This is the “linear algebra” part of this paper. Sections 
3 through 6 contain the main results. 


2. Finite dimensional operator theory. Let (¥,<-,- ), || - ||) be a finite dimen- 
sional complex Hilbert space. Let N be its dimension. We may suppose for 
concreteness that #= C”™. Let A(#) be all the linear operators on #; that is, 
BH) = M,C) consists of all N X N complex matrices. For A © @#) let 
|A| = (A*A)!/7. By the spectral theorem, 


N N 
IA|= Viaje ®e, = VA, :,e,¢e,, (2.1) 

n=1 n=1 
where A, > +: >A, > Oare the eigenvalues and e,,..., e,, are the correspond- 


ing normalized eigenvectors of |A|. Here e,, @ e,, is the orthogonal projection onto 
the span of e,. Because || Af ||’ = (Af, Af) = <A*Af, f) = I|IAIfII?, we can write 

A=UIA|, (2.2) 
where U is a unitary operator from the range of |A| to the range of A. The 
decomposition (2.2) is called the polar decomposition of A (cf., e.g. [7]). It reduces 


to z = e'®|z| in C’. When A is injective, U is unitary on #. But U can always be 
710 
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viewed as being a unitary operator on & by extending it in any way as a unitary 
operator from Ran(|A|)+ = Ker(|A|) to Ran(A)+ = Ker(A*). However, U is 
uniquely determined as a unitary operator on & only when Ais injective. 

Let J, be the largest index n such that A, > Oforn <J,.(Thus Jp = 0;J, =N 
if and only if A is injective; and 2 <J, < N — 1 inall other cases.) Using (2.1) and 
(2.2) we can write A uniquely in the form 

J J 
A — » Sn( A) Sh ® e,) — > S,( A)< ; Cn tn (2.3) 
n=1 n=1 
where s,(A) = A, is the nth singular number of A and f,, = Ue,. Replacing J, by 
N would make no difference since s,(A) = 0 for J, <n < N. The vector s(A) = 
(s,(A),..., 5,(A)) is called the singular sequence of A. 
Let E be C™ made into a Banach space under a norm v which satisfies 


V((X1,---5Xy)) = V(E Xa. - > ENX en) 
for every (x,,...,Xy) © C%, every permutation 7 of {1,..., N}, and every choice 


of «, © C such that |e,|| = 1 for 1 <j <N. Now make @(C%) into a Banach 
space, denoted by &,, under the norm 
Alle = »(s(A)). 
These spaces were introduced by Schatten and von Neumann (cf. [12]). When 
E =I1?, so that v((x,,..., x\)) = (L9_,|x,,/"}'/” or max, |x,|, according as 1 < p 
< © or p = %, the corresponding space @, is usually denoted by @,. Thus for 
1<p<®»®, 
Jy 1/p 


Alle, = | Y s,( A)? 


n=1 


and ||Allz, = s5,(A) = ||All = supf{llAflk Ilfll< 1}, which is the usual norm on 
BH). 


The norms in the spaces @, are all unitarily invariant in the sense that 
[UAV lle, = llAlle, 


whenever U and V are unitary. The space @, is a Hilbert space under the inner 
product 


(A, B)2 = tr( B*A), 


where “tr” stands for trace. The norm in @, is called the Hilbert-Schmidt norm. 
For A = (a,,) (with respect to a fixed orthonormal basis), 


N 
|Allz, = <A, Ad2 = ZY la,il’. 
i,j=1 


Finally we shall have occasion to refer to Mirsky’s inequality [11], which states 
that for 4, BE MAF), 


¥ |s,(A) — 5,(B)| < ¥|s,(A — B) 


whenever 1 < m < N. (For infinite dimensional generalizations involving compact 
operators, see Markus [10].) 
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3. A computational problem from quantum theory. Suppose that a chemist 
gives us an operator A in @(#H) (with H= C% as before). We are told that A is 
a finite dimensional approximation of a Hamiltonian and we are asked to find its 
ground state (or lowest) eigenvalue and eigenvector. We note that A is not 
self-adjoint. The chemist says “sorry,” this is a result of some small errors due to 
the computer, so we are instructed to replace A by the self-adjoint operator of 
which it is a small perturbation. But which operator is that? Seemingly we are to 
find the self-adjoint operator nearest to A in some reasonable sense. 

Picture our arbitrary operator A in @(#) as being represented by a point in 
the complex plane, and think of the real line as representing the self-adjoint 
operators on &. 


Re( A) 


The picture suggests that the nearest self-adjoint operator to A is Re(A) = 
(1/2 A + A*). This is in fact true if we measure the distance between operators 
by any of the unitarily invariant norms introduced in Section 2. The proof is as 
follows. If S = S* in @(H#), then 


4 — Re( A)ll4, =[2(4 — S) + 2(S — A*) lle, 
< WIA — Sle, + allS — Atle, = 114 — Slleg 
since ||B*llz, = IIBllz,. QED. 


When is Re(A) the unique solution? Answer: Whenever the norm in @, is 
strictly convex.’ Moreover, the above diagram is a truly correct picture in the 
Hilbert-Schmidt case. Think of &@, as a real Hilbert space of dimension 2N*. The 
self-adjoint operators on W form a real subspace ~ of @, of dimension N*. The 
orthogonal projection P , of &, onto .” is given by P_p= Re, ie., 

P(A) = H(A + AP). 
Thus P_, is the unique nearest point in .~ to A since the orthogonal projection is 
also the nearest point map. Thus the above picture is justified. 

Re(A) is the unique solution in , for 1 < p < », but uniqueness fails in 7%, 


and in & (= GA) with its “usual” norm). 
The above results generalize and simplify the discussion in [5]. 


‘A norm | - | is strictly convex whenever |x + y| = |x| + ly| implies that either x = ty or y = &x 
for some ¢t > 0. 
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4. The Lowdin problem. Theoretical chemistry considerations associated with 
basis set optimization and with the understanding of properties in terms of 
electron pair (chemical bond) transferability lead to the following mathematical 
problem." 

We are given a basis {f,,..., fy} of unit vectors for #. (In quantum mechanics 
all states are represented by unit vectors.) These vectors contain much information 
(namely, the information from the known subsystems). In order to do theoretical 
and computational studies it is necessary to orthogonalize the {f,,}. This is done by 
an orthogonalization, that is, an operator B € @(#) such that Bf, = e, satisfies 
(€,>€m) = Omn = 1 or 0 according as m =n or m #n, for 1 < m,n < N. But one 
wishes to change the {f,} as little as possible because changing the {f,} results in a 
loss of information. How can B be chosen in an optimal way? 

A natural approach to this problem is to choose B to minimize the least squares 
deviation 


N 
2 
DY Mfp - enll’: 
n=1 


Writing B = UL (with L = |B|) as the polar decomposition for the invertible 
operator B we have 


N N 
Y If, -— ell’ = ¥& Bote, - e, ll? 
n=1 n=1 
N 
= Y(t! - v)u~'e, |? = I|L-! — Ulla, 
n=1 
since 
N N 
|Allz, = tr(A*A) = ¥ (A*Ag,, 2,) = ¥ IlAg ll’ 
n=1 n=1 
holds for any orthonormal basis {g,,..., gy}. 


Replace B by another orthogonalization B, where Bf, =é, for all m and 
(é,,€,,) = 6,,, for all m,n. Since 
UL Vo. 
t, —_—? Cn id Cn 
and the linear map V from e, to é, for all n is a unitary operator, the invertible B 
is given by V(UL) =(VU)L, which is its unique polar decomposition. Thus 
L = |B|(= |B|) is independent of the orthogonalization B, and varying B = UL is 
equivalent to varying U over Y(N), the group of all unitary operators on C%. Thus 


N 
min (lf, —e,l’ = min ||L7!—- Ullz,. (4.1) 
B y=] UEWN) 

We now “solve” this minimization problem pictorially (as in Section 4). In the 
figure below, the unit circle represents the unitary operators in @(N) and the 
positive real axis represents the positive self-adjoint operators. Thus L™! is 
represented by one of the fat dots. Geometrically the point on the unit circle 


"The point is that real issues in quantum chemistry motivate our mathematical problem. For some 
of the motivation, see [3], [5], [8]. 
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closest to either dot is the point (1,0). Thus we conjecture that the minimum (4.1) 
is achieved for B = L or, equivalently, for U = J. This is in fact correct. 


IR 


There are several ways to prove this. Here is a calculus proof taken from [1], [2]. 
Let A = A* be a fixed positive self-adjoint operator on #= C”. Let 


F(U) =||A — UZ; 


F: @N) — [0,) is a smooth map on a compact manifold. If it has a local 
minimum at U, € @(N), then F’(U,) = 0. Let P be any rank one self-adjoint 
projection on # and let f(t) = F(e"”U,). Then since the smooth function f: 
R — R has a local minimum at t = 0, we must have f’(0) = 0, f"(0) > 0. Using 


d | | 
f(t) = a trl(A — ey) *(A — e-"?US)], 


and differentiating inside the trace symbol, a short calculation (cf. [1]) yields the 
following restrictions on the spectrum of U): 


f'(0) = 0 for all P implies o7(U,) CR; 
f"(0) = 0 for all p implies Re a7(U,) <[0,~). 


Thus, since U, is unitary, o(U,) = {1} and U, = I. 

Here is a recipe for finding L (or L~'). Recall that { fis..+> fry} is given. The 
Gram matrix is the N x N matrix G = (g,,) defined by g,, = ¢f;, f;). This is a 
positive definite matrix and L = G-'”* (or L7! = G'’”). 

The calculus method used above works for other norms as well. Here is a 
sample result. Let 1 < p < ~ and let 


F,(U) =||A— Ullp, 


where A is a given positive self-adjoint operator on C% and U varies over Y(N). 
Then 


minF,(U) = F,(1) 
U 
for 1 < p < ~. Moreover, when 1 < p < ~,U = J is the only local minimum of F, 


(but this is not the case when p € {1,} and N > 2). See [2] for more details and 
for infinite dimensional versions. See also Section 6 for additional results. 
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The minimization problem solved in this section goes back to the Swedish 
quantum chemist Per-Olof Lo6wdin [9]. That’s why we used the letter L. The 
Léwdin orthogonalization (.e., B = L) is a symmetric orthogonalization in two 
senses. First L is symmetric, i.e., self-adjoint. Next, if the given basis {f,,..., fay} is 
permuted to become {f_,,..., fa}, then the same B = L results, and the new 
“nearest” orthonormal basis we obtain is {e,,,...,e,,} where {e,,...,e,} is the 
original Lowdin orthonormal basis (i.e., e, = Lf, for all n). So the Léwdin 
orthogonalization procedure is symmetric with respect to permutations. This is a 
clear indication of how the Loéwdin procedure differs from the Gram-Schmidt 
orthogonalization procedure. 

As a pedagogical aside we mention that we advocate the teaching of the Lowdin 
orthogonalization method in linear algebra courses since (among other reasons) it 
is easy to motivate by physical considerations. 


5. A more general approximation problem. Let {f,,..., fy} be a given basis of 
unit vectors for #W= C%. It is desired to replace {f,} by an orthonormal basis 
{€,,...,@,} such that 

(i) {e,,..., ey} is close to {f,,..., fry}; 

(ii) {e,,..., €y} is close to a given orthonormal basis {g,,..., gy}. 

If constraint (ii) is missing, this reduces to the Léwdin problem stated and 
solved in the previous section. Condition (ii) may be present to test a hypothesis 
that some competing chemical theory is right (or wrong). For more background 
and a fuller discussion see [5]. 

Let 0<b<1. Let Bf, =e, for n = 1,...,N, where {e,} is an orthonormal 
basis. Define 


N N 
F,(B) =b YX |lf, —e, ll? + —5) ¥ lle, — ell’. 
n=1 n=1 


F,, is the Lowdin functional previously considered. Decreasing b makes constraint 
(ii) more important relative to (i). 

We want to minimize F,(@) as @ varies (with b fixed). For b = 0,1 we know 
the answer, although the problem is of no physical (or mathematical) interest when 
b = 0. As before write B= UL for the unique polar decomposition of the 
orthogonalization B. The particular orthogonalization sending f,, to g, for each n 
we denote by WL. Then 


N N 
F,(B) =b ¥ ||L~'U~'e, — e, |) + (1 + 6) & IIWLF, — ULF, II 


n=1 n=1 
_ y 

=bY |(L7! - v)u7e, I? + A —b) IW - UYLA, I 
n=1 n=1 


= b|L-' — ullz, + (1 — b)IlW — UZ. (5.1) 


Minimizing F,(B) as B varies is the same as minimizing the right-hand side of 
(5.1) as U varies over ®(N). In this case the positive self-adjoint operator L~! 
and the unitary operator W are given and fixed. 
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A more general problem can be formulated as follows. Given A,,..., A, in 
BH) and b,,..., by, in R such that b, > 0 and ©*_, b, = 1, minimize 


M 
GU) = ¥d,I|A, — UIl%, 


n—-1 


as U varies over Y@(U). The calculus approach of the previous section can be used 
(cf., [5]). Here is a simple direct Hilbert space argument. 
By the law of cosines, 


G,(U) 


M 
Y 4,{llA lz, + llUilz, — 2Retr(U*A,,)} 
n=1 


M M 
N+ ¥|,||A,IIZ, - 2 X¥ b, Re tr(U*A). 


n=1 n=1 


Thus minimizing G,(U) is equivalent to maximizing 
H,(U) = Retr(U*C), 


where C = ©™__b,A, (since G,(U) + 2H,(U) is a constant, independent of U). 

Let C = V|C| be the polar decomposition of C. (Note that V need not be uniquely 

determined as a unitary operator since C need not be injective.) The maximum 

value of H,(U) is clearly ||Cllz,; it is attained whenever U = V on the range of C. 

Thus the minimization problem for G, is solved, uniquely so when C is injective. 
In the context of F,(B) as in (5.1), C becomes 


C=bL~'+(1+))W. 


If LW = WL, then C is a normal operator with spectral decomposition 


N 
C= > Nt BA, 
n-1 
for some orthonormal basis {h,,...,,} and complex constants {n,,...,7,}. The 
minimizing U for F,(B) is given by 
Uh, = £,h, 


for n€ K ={neé ({1,..., N}: 7, #0}, where £,=7%,/l|n,|, while {Uh,: n € 
{1,..., N} \ K} can be an arbitrary orthonormal basis for {h,: n © K}~. 


6. The Arazy approach. In the fall of 1984, Jonathan Arazy (of Haifa, Israel) 
visited Tulane University and made a number of interesting observations [4] on 
Lowdin orthogonalization and related topics. Part of Section 5 is due to him; in 
particular he noted that the minimum value of F,(B) can be given explicitly in 
terms of @, and @, norms. 

More interestingly, Arazy noted that Mirsky’s inequality (see Section 2) implies 
that U =I gives the solution of Lowdin problem in every unitarily invariant norm 
Il: Ile 
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Let v, E be as in Section 2. Let A € @(C”) and let A = U|A| be its polar 
decomposition. For any V € @(N), 


|A -Vlle, = v(s(A —V)) _ by definition 

v(s(A) —s(V))_ by Mirsky’s inequality 

v((s,(A) — 1,...,5y(A) — 1)) since s(V) = (1,...,1) 
| [s(4) O O\| 


WV 


O s(4f LO tiie 
= ||A — Uplle,; (6.1) 


here, as before, {e,} is the orthonormal basis of eigenvectors of |A|, (f,} = {Ue,}, 
and the matrices in (6.1) are taken with respect to the row basis {f,} and the 
column basis {e,}. This way of writing A is nothing but a restatement of (2.3). Thus 


|A —Vle, > IIA — Uplle, 


holds for all Ve @(N) and for all U) © WN) which exchange the {e,} with the 
{f,}, in order, in the polar decomposition of A. When A is injective, U, is thus 
uniquely determined; in fact the polar decomposition of A is A = U,|A|. This 
shows that the Léwdin solution is valid for all the norms @, on M(H). 

When A is injective, is U, the unique solution of the minimization problem in 
the @, norm? This depends upon an analysis of when equality holds in Mirsky’s 
inequality. In particular, this depends on v. 

This approach via Mirsky’s inequality can be adapted to the problem of Section 
5 as well. For instance, suppose we want to minimize 


M 
Yb, FIA, — Ulle,) 
=1 


as U varies over YN). Here b, > 0,4*_,b, = 1, and F, is some suitable 
functional related to (E,v). For instance, when &, = @, for 1 <p <, take 
F.(s) = |s|’. When p = 2, this reduces to the problem of Section 6. For 1 < p < ~, 
we get a unique solution to the analogous problem (when C = ©”, b_A,, is 
injective), and thé @, solution coincides with the @, solution. 
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The Fifty-First William Lowell Putnam Mathematical Competition 


LEONARD F. Kxosinsk1, Santa Clara University 
GERALD L. ALEXANDERSON, Santa Clara University 
Loren C. Larson, St. Olaf College 


The following results of the fifty-first William Lowell Putnam Mathematical 
Competition, held on December 1, 1990, have been determined in accordance with 
the governing regulations. This annual contest is supported by the William Lowell 
Putnam Prize Fund for the Promotion of Scholarship, left by Mrs. Putnam in 
memory of her husband, and is held under the auspices of the Mathematical 
Association of America. 

The first prize, $5,000, was awarded to the Department of Mathematics of 
Harvard University. The members of the winning team were: Jordan S. Ellenberg, 
Raymond M. Sidney, and Eric K. Wepsic; each was awarded a prize of $250. 

The second prize, $2,500, was awarded to the Department of Mathematics of 
Duke University. The members of the winning team were Jeanne A. Nielsen, Will 
A. Schneeberger, and Jeffrey M. Vanderkam; each was awarded a prize of $200. 

The third prize, $1,500, was awarded to the Department of Mathematics of the 
University of Waterloo. The members of the winning team were Dorian Birsan, 
Daniel R. L. Brown, and Colin M. Springer; each was awarded a prize of $150. 

The fourth prize, $1,000, was awarded to the Department of Mathematics of 
Yale University. The members of the winning team were Thomas Zuwei Feng, 
Andrew H. Kresch, and Zhaoliang Zhu; each was awarded a prize of $100. 

The fifth prize, $500, was awarded to the Department of Mathematics of 
Washington University. The members of the winning team were William Chen, 
Adam M. Costello, and Jordan A. Samuels; each was awarded a prize of $50. 

The five highest ranking individual contestants, in alphabetical order, were 
Jordan S. Ellenberg, Harvard University; Jordan Lampe, University of California, 
Berkeley; Raymond M. Sidney, Harvard University; Ravi D. Vakil, University of 
Toronto; and Eric K. Wepsic, Harvard University. Each of these was designated a 
Putnam Fellow by the Mathematical Association of America and awarded a prize 
of $500 by the Putnam Prize Fund. 

The next seven highest ranking individuals, in alphabetical order, were Andrew 
H. Kresch, Yale University; Samuel A. Kutin, Harvard University; Royce Y. Peng, 
Harvard University; Eric M. Rains, Case Western Reserve University; Jeffrey M. 
Vanderkam, Duke University; Samuel K. Vandervelde, Swarthmore College; and 
Michael E. Zieve, Harvard University. Each was awarded a prize of $250. 

The following teams, named in alphabetical order, received honorable mention: 
California Institute of Technology, with team members Tien-Yee Chiu, Russell A. 
Manning, and Robert G. Southworth; the University of California, Berkeley, with 
team members Brian J. Birgen, Jordan Lampe, and Thomas S. Lumley; Mas- 
sachusetts Institute of Technology, with team members Christos Athanasiadis, 
Andrew Chou, and David B. Wilson; Stanford University, with team members, 
Daniel P. Cory, Gregory G. Martin, and Andras Vasy; and Swarthmore College, 
with team members Olaf A. Holt, Robert E. Marx, and Samuel K. Vandervelde. 

Honorable mention was achieved by the following thirty-five individuals named 
in alphabetical order: Eric M. Boesch, University of Maryland, College Park; 
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Hubert L. Bray, Rice University; Daniel R. L. Brown, University of Waterloo; 
Michael J. Callahan, Harvard University; David B. Carlton, Harvard University; 
Helmut R. Celina, Davidson College; Nickolai I. Chavdarov, Brandeis University; 
William Chen, Washington University, St. Louis; Mark T. Chrisman, University of 
California, Davis; Bryan F. Clair, University of California, Berkeley; Brian D. 
Conrad, Harvard University; Daniel B. Finn, University of Rochester; Joshua B. 
Fischman, Princeton University; Mikhail Grinberg, Massachusetts Institute of 
Technology; Alex Gurevich, University of Maryland, College Park; Richard S. Kiss, 
Simon Fraser University; John C. Loftin, Stanford University; Gregory G. Martin, 
Stanford University; David K. McKinnon, Harvard University; Jeanne A. Nielson, 
Duke University; David M. Patrick, Carnegie Mellon University; Alexander R. 
Pruss, University of Western Ontario; Jordan A. Samuels, Washington University, 
St. Louis; Will A. Schneeberger, Duke University; Lawren M. Smithline, Harvard 
University; Robert G. Southworth, California Institute of Technology; Colin M. 
Springer, University of Waterloo; Jun Teng, California Institute of Technology; 
Andras Vasy, Stanford University; Martin M. Wattenberg, Brown University; 
David B. Wilson, Massachusetts Institute of Technology; Michael P. Wolf, Harvard 
University; and John H. Woo, Harvard University. 

The other individuals who achieved ranks among the top 98, in alphabetical 
order of their schools, were: Boston University, Michael G. Szydlo; University of 
British Columbia, Gregory F. Wellman; University of Calgary, Geoffrey T. Falk; 
California Institute of Technology, Ian Agol, Alan I. Knutson, William M. Watson; 
University of California, Berkeley, Stephen P. Bard, Thomas S. Lumley, Max L. 
Shireson, Zheng Yin; Carleton University, Adam M. Logan; Carnegie Mellon 
University, Sanjay Khanna; University of Chicago, David J. Pollack, Adrian 
Tanner; Columbia University, Andrew Mogilyansky; Cornell University, Isaac J. 
Kuo; Harvard University, Daniel E. Gottesman, F. Dean Hildebrandt, Roger W. 
Lee, Andrew P. Lewis; Harvey Mudd College, Guy D. Moore; University of 
Illinois, Urbana-Champaign, David E. Bekman; University of Maryland, College 
Park, Lev Novik; Massachusetts Institute of Technology, Andrew Chou, Edward B. 
Hontz, Michael J. Lawler; Michigan State University, Thomas P. Hayes, Jacob R. 
Lorch; University of Minnesota, Twin Cities, Wei Shen; University of Missouri, 
Rolla, Xi Chen; Mount Allison University, Eugene Fink; New York University, 
Daniel J. Bernstein; State University of New York, Stony Brook, Jason Israel; 
Princeton University, Timothy Y. Chow, Gregory D. Landweber; Queens Univer- 
sity, Alex Grossman; Reed College, Nathaniel J. Thurston; Stanford University, 
Beesham A. Seecharan, Jay A. Shrauner, Garrett R. Vargas; Swarthmore College, 
Olaf A. Holt; University of Texas, Austin, Bryan W. Taylor; University of Toronto, 
Nima Arkani-Hamed; Trinity College, Hartford, Marshall A. Whittlesey; Washing- 
ton State University, Julie B. Kerr; Washington University, St. Louis, Peter H. 
Berman, Adam M. Costello, Jeremy T. Tyson; University of Waterloo, Michael A. 
Buckley, John Daniel Christensen; and Yale University, Thomas Zuwei Feng, 
Evan M. Gilbert, Zhaoliang Zhu. 

There were 2347 individual contestants from 380 colleges and universities in 
Canada and the United States in the competition of December 1, 1990. Teams 
were entered by 289 institutions. 

The Questions Committee for the fifty-first competition consisted of George E. 
Andrews, Paul R. Halmos (Chair), and Kenneth A. Stolarsky; they composed the 
problems listed below and were most prominent among those suggesting solutions. 
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PROBLEMS 
Problem A-1. Let 


and for n > 3, 
T,=(n+4)T,_, — 4nT,,_, + (4n — 8)T,,_3. 
The first few terms are 
2,3, 6, 14, 40, 152, 784, 5168, 40576, 363392. 


Find, with proof, a formula for T,, of the form T, = A,, + B,, where (A,,) and (B,,) 
are well-known sequences. 


3 3 
Problem A-2. Is V2 the limit of a sequence of numbers of the form Vn — Vm, 
(n,m =0,1,2,...)? 


Problem A-3. Prove that any convex pentagon whose vertices (no three of which 
are collinear) have integer coordinates must have area > 5/2. 


Problem A-4. Consider a paper punch that can be centered at any point of the 
plane and that, when operated, removes from the plane precisely those points 
whose distance from the center is irrational. How many punches are needed to 
remove every point? 


Problem A-5. If A and B are square matrices of the same size such that 
ABAB = 0, does it follow that BABA = 0? 


Problem A-6. If X is a finite set, let |X| denote the number of elements in X. 
Call an ordered pair (S, T) of subsets of {1,2,...,} admissible if s > |T| for each 
s € §, and t > |S| for each t € T. How many admissible ordered pairs of subsets 
of {1,2,...,10} are there? Prove your answer. 


Problem B-1. Find all real-valued continuously differentiable functions f on the 
real line such that for all x 


(F(x) = (FY + (F()’) ae + 1990. 


Problem B-2. Prove that for |x| < 1, |z| > 1, 
(1 —z)(1 -— zx)(1 -— zx?) +++ (1 -— zx!) 


(z—x)(z—x*)(z —x°) +++ (z- x!) 0. 


1+ } (14+!) 
j=l 


Problem B-3. Let S be a set of 2 X 2 integer matrices whose entries a,, (1) are 
all squares of integers, and, (2) satisfy a;; < 200. Show that if S has more than 
50387 (= 154 — 157 — 15 + 2) elements, then it has two elements that commute. 


Problem B-4. Let G be a finite group of order n generated by a and b. Prove or 
disprove: there is a sequence 


81> 82> 835--- §2n 
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such that 
(1) every element of G occurs exactly twice, and 
(2) g,,, equals g,a or g,b, for i = 1,2,...,2n. (Interpret g,,,,, as g,.) 


Problem B-5. Is there an infinite sequence dy, a@,,a5,... of nonzero real num- 
bers such that for n = 1,2,3,... the polynomial 
D,(X) = @y) + a,x + anx* + +++ +a,x" 
has exactly n distinct real roots? 


Problem B-6. Let S be a nonempty closed bounded convex set in the plane. Let 
K be a line and ¢ a positive number. Let L, and L, be support lines for S parallel 
to K, and let L be the line parallel to K and midway between L, and L,. Let 
B.(K, t) be the band of points whose distance from L is at most (t/2)w, where w 
is the distance between L, and L.,. What is the smallest ¢ such that 


SO ()B;(K,t) #@ 
K 


for all S? CK runs over all lines in the plane.) 


Support line L, / > a ay 
: meal = - 


SOLUTIONS 


In the 12-tuples (1,5, 9,..., 9, _,) following each problem number below, n, 
for 10 > 7 > 0 is the number of students among the top 199 contestants achieving 7 
points for the problem and n_, is the number of those not submitting solutions. 


A-1 (150, 9, 1, 0, 0, 0, 0, 0, 1, 1, 6, 33) 
Solution. The formula for T,, is 
T, = nit 2”. 


This can be verified by induction. Alternatively, set t, = n!+ 2”. Clearly tj = 2 = 
Ty, t, = 3 =T, and t, = 6 = T,. Also, 


t, —nt, , = 2" -—n2""'. 


Support line L, 


Now 2” and n2”7! are both solutions of the recurrence equation 
tn ~ 4fn—-1 + 4fn-2 = 9, (*) 
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which is easily shown by direct substitution. Therefore since t, — nt, _, is a linear 
combination of solutions to (*), it must also be a solution. Consequently, 


(t, _ nt, 1) _ A(t,—1 _ (n 7 1)t,, >) + A(t, 2 _ (n 7 2)t,—3) — 0, 
OT 
t,=(n+ 4)t,_, — 4nt,_. + (4n — 8)t,_3: 


Hence ¢t, = T,, because they are identical for n = 0,1,2 and satisfy the same 
third-order recurrence (*) for n > 3. 


A-2 (63, 25, 16, 4, 0, 0, 0, 4, 5, 6, 21, 57) 


Solution. Since 


Vin +1) + Yin + 1yn + Vn? 


3 3 
it follows that yn + 1 — Vn > 0 as n > ~, and hence that there are arbitrarily 
3 3 3 3 3 3 
small numbers of the form Vn — Vm. Since k(Vn — Vm) = Vki3n — Vkim, it 


follows that the set of numbers of that form is closed under multiplication by 
arbitrary positive integers. The preceding two sentences imply that the set of 
numbers of the form under consideration is dense, and hence that every real 
number is a limit of a sequence of such numbers. 


A-3 (4, 4, 4, 0, 0, 0, 0, 0, 22, 36, 76, 55) 


Solution. By Pick’s formula, the area is J + B/2 — 1, where J is the number of 
internal lattice points and B is the number on the boundary. Clearly, J > 0 and 
B>5. If 1 > 1 we are done. 

If J = 0, then separate the vertices v,,U,, U3, V4, U, into four classes according 
to the parity of their coordinates. At least one class must have at least two 
elements, say v, and v,. Hence the mid-point $(v, + v,), is also a lattice point; 
call it vy. Since J = 0, U_ is on the boundary of the pentagon. Now consider the 
five points {v,, Vz, U3, U4, U5}. The same reasoning produces a second lattice point 
v4 which is not v, (since vj is a mid-point) and not in the interior (since I = 0.) 
Thus we have a second new lattice point on the boundary. Therefore, B > 7, so 
again the area is > 5/2. 


A-4 (44, 7, 6, 6, 0, 0, 0, 0, 32, 15, 30, 61) 


Solution. The answer is certainly greater than 2. Reason: to any two distinct 
points there corresponds at least one point whose distance from each of the given 
ones is rational. Proof: draw circles centered at the given points with rational radii; 
if the circles are not chosen too carelessly, they will intersect. (Choose the radii to 
be more than half the given distance but less than the whole.) 

Three punches are enough. Indeed: punch twice, at distinct centers. Since each 
punch leaves countably many circles, the two punches leave their intersections, a 
countable set. Consider all circles centered at points of that set, with rational radii; 
their intersections with an arbitrary line form a countable set. A point of that line 
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not in that countable set is at an irrational distance from all remaining points; 
apply the punch there. 


A-5 (29, 5, 0, 0, 0, 0, 0, 0, 1, 0, 58, 108) 


Solution. The answer is yes for 2 X 2 matrices and no in all other cases. Indeed: 
since ABAB = 0, it follows that B(ABAB)A = 0, and hence that BA is nilpotent. If 
a 2 X 2 matrix M is nilpotent, then M7 = 0 (because the characteristic equation of 
M has degree 2 or less). 

A counterexample for 3 X 3 (and therefore, just by enlargement by 0’s, for any 


size) is to take 
0 O 1 0 O 1 
0 O Of, B=i{1 0 OJ. 


0 1 O 0 0 O 


A= 


A-6 (6, 6, 54, 1, 0, 0, 0, 0, 4, 0, 45, 85) 


Solution. Let A, denote the number of admissible ordered pairs of subsets of 
{1,2,..., nm}. Clearly 


Define 


while 


aN 
= 
| 
es 
= 
| 
I l 
a a 
a) a) 
| an 
| —, 
_ a) 
ee” 
a 
x Set 
Set — 
ee 
— + 
~~ we) 
+ 
— 


O<i,j<sn-1 


= A,-1 + 1. 
Hence we immediately verify by induction that 
A, = Fons; B, = Fon43 — I. 


Hence, A,) = Fy, = 17711. CF; is the i-th Fibonacci number, defined by Fy = 0, 
F, = 1, and for n > 2, F, = F,_, + F,_)-) 
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B-1 (114, 2, 52, 0, 0, 0, 0, 11, 5, 3, 10, 4) 


Solution. There are two such functions, namely f(x) = y1990e*, and f(x) = 
— y¥1990 e*. To see this, suppose that the identity holds. Differentiating each side 
gives 


2F (x) f(x) = (FO) + (F'QODY, 

or equivalently, 
(f(x) -f'(2))=0, f(x) = fC), 

logif(x)| =x+C, — If(x)l = ee’. 
But f is continuous and f(0) = + 71990, and this implies that f(x) = + ¥1990e”. 
B-2 (23, 5, 4, 9, 0, 0, 0, 3, 0, 0, 32, 125) 

Solution. Let Sy = 1, and for n = 1, let 
n _(l-z)(l -— zx)(1 — zx?) +++ (1 = zx") 
St +e) CG ee w) 
It is easy to check that S, = (1 — zx)/(z -—x), S,=( — zx) — zx?)/ 
(z —x)z — x’), and by induction, 
(1 — zx)(1 — zx?) +++ (1 - zx”) 


To complete the proof, we need to prove that lim, _,., 8, = 0. To see this, we note 


that 
1 — "tl 
Sn+i = or Se 


S 


Asn > ©, 1 — zx"*! goes to 1 and z — x”*! goes to z. Thus, there exist positive 
numbers N and « such that 


1—z"*! 1 ; 
——_—-|< — +e< 
Zz — xn! |z| é 
for all integers n > N. It follows that 
1 
IS,a,1 <{— + e]lS,| 
Iz| 


and the result follows. 
B-3 (97, 7, 4, 2, 0, 0, 0, 0, 12, 2, 54, 23) 


Solution. Let U be all such 2 X 2 matrices, D the diagonal ones, and J those 
that are multiples of (! ‘). Note that (i) any two from D commute, (ii) any two 


from J commute, and (iii) (: | and (: ‘) commute. Clearly, 
lUN(DUJ)‘| = |U| — |D| — Vl + ID as = 154 — 152 — 15 + 1. 
Suppose that no two elements of S commute, and write 
S=(SA(DUJ))U(SN(DUJ)’). 
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Clearly, |S 1 (D UJ)| < 2 and (by (iii) above) 
ISA (DUS) | <|UN(DUJ)‘| < 154 — 152 — 15 +1. 


The result follows. (Here X° denotes the complement of X.) 

The number 50387 is far from the best possible and there are many potential 
solutions. Jiugiang Liu and Allen J. Schwenk from Western Michigan University 
have shown, using an inclusion-exclusion argument, that the maximum number of 
elements in U in which no two elements commute is 32390. 


B-4 (9, 5, 2, 1, 0, 0, 0, 0, 3, 2, 63, 116) 


Solution. Construct a graph whose vertices are labeled by the elements of G, so 
that for each vertex g, there are two “out” arcs, one to vertex ga and one to vertex 
gb (and consequently, each vertex g has two “in” arcs coming to it, one from ga™! 
and one from gb~'). The resulting graph is connected and each vertex has 
outdegree 2 and indegree 2. Therefore there is an Eulerian path which traverses 
the arcs, once and only once, and returns to the beginning. We get the desired 
sequence by listing the group elements associated with the vertices as we follow 
this path. 


B-5 (16, 11, 15, 12, 0, 0, 0, 11, 5, 5, 48, 78) 


Solution. Take a, = 1, a, = —1, and proceed by induction. Say p,(x) has the 
property, and also p,(x) - © or —% as x — » depending upon whether n is even 
or odd. Then 


(—x)"*? 
M 


has a sign change arbitrarily close to every root of p,(x) for M sufficiently large, 
and also the same sign as p,(x) at x* + 1 where x* is the largest root of p,(x). 
But now (M is already fixed) for x sufficiently large p,,,(x) has another sign 
change. Since p,, ,(x) has at most n + 1 roots, the result follows. 


B-6 (5, 0, 0, 0, 0, 0, 0, 0, 38, 6, 29, 123) 


Pn+i(*) = p,( x) + 


Solution. Consider the dissection of the equilateral triangle (as shown) into 9 
similar equilateral triangles. 


It shows that any t < 1/3 produces an empty intersection. 
We now show that the intersection is nonempty for ¢ > 1/3, since it always 
contains the centroid of S. Think of L, as the upper support line (see sketch). 
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Pi 


QO; P, Q, 


Let P, be a point of contact of L, with S, and P, a point of contact of: L, with S. 
Extend two lines from P, to points Q,,Q, on L, so that A,, the region ‘‘above” 
P,Q, and in § has area equal to that of B,, the region “below” P,Q, above L, 
and outside of S. If S is perturbed by replacing A, by B, (and similarly A, by B,) 
the new S (it is a triangle) will have a “lower” centroid. But this new centroid is 
still 1/3 of the way above L, (on the way to L,). Hence if B is a band comprising 
the middle 1/3 of the strip between L, and L.,, it contains the centroid of S. 
Hence for t = 1/3, the intersection is nonempty. Hence t = 1/3 is the smallest 
such value. 


NOTES 
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Branching Processes That Grow Faster Than Binary Splitting 


F. M. DEKKING 
Department of Mathematics and Information, Delft University of Technology, 
Julianalaan 132, 2628 BL Delft, The Netherlands! 


A drastic qualitative change in a physical system occurring after an arbitrarily 
small change of a parameter (like temperature) is usually called a critical phe- 
nomenon. One of the most thoroughly studied mathematical models of such 
phenomena is that of percolation which paraphrases the passage of a fluid through 
a porous medium consisting of very many small channels ((2]). The qualitative 
property of interest is whether or not the fluid will percolate the medium 
macroscopically if it passes each small channel with probability p (the parameter 
of the process). Typically this has probability zero for all values of p smaller than a 
critical value p,, and strictly positive probability for p larger than p.. 

For most percolation models it is very difficult to determine the exact value of 
p., and even more difficult to describe the behavior of the probability of percola- 
tion near p,. These difficulties are mainly caused by the interconnectivity structure 
of the small channels in the medium. In this note we present a model which has 
the same flavor as percolation models (where the interconnectivity structure is that 
of a tree), in which the above mentioned quantities can be exactly computed in an 
elementary way. The relation to existing models is in fact very close: our model has 
been derived from the analysis of Mandelbrot’s percolation process in [1]. 


1. Branching processes and family trees. The theory of branching processes is 
often said to have its birth with the solution by H. W. Watson of the problem of 
“the decay of the families of men” posed by F. Galton (see, e.g. [3]). This problem 
can be phrased as follows. We are given an offspring distribution Z,, i.e. a random 
variable taking non-negative integer values k with probability p, >0, k= 
0,1,...Q22_ 9p, = 1). At time 0, there is one particle, the ancestor (Z, = 1). This 
particle gives rise to a certain number of particles at time 1, according to the 
distribution Z,. Each of the Z, particles at time 1 independently gives rise to a 
certain number of particles at time 2, each according to the offspring distribution 
Z,. At time n there are Z, particles each reproducing independently of all 
particles at the same or previous time instants, all according to the same offspring 
distribution Z,, giving rise to a total of Z,,, particles at time n + 1. Here it is 
understood that Z,,, = 0 if Z, = 0. The Galton-Watson problem is to determine 
the probability that the process (Z,,) becomes extinct, i.e. that Z, = 0, for some 
n> 1. 


‘Work performed during a stay at the Université de Provence, Marseille, France 
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With a branching process (Z,,) we may associate a (random) family tree: there is 
a distinguished node called the root associated to the ancestor, which is connected 
to Z, first level nodes. In general there are Z, n™ level nodes, and each n" level 
node is connected to the n + 1™ level nodes associated with the offspring of the 


Fic. 1 


corresponding particle at time mn. As an example, consider the (1-or-3)-process 
defined by p, = 1 — p and p, = p for some p between 0 and 1. See Figure 1 for a 
realization of the first four levels of the family tree of such a process. Note that the 
family tree associated to a branching process is finite if and only if the process 
becomes extinct, and that the extinction problem can now be interpreted as the 
question ‘with what probability does the family tree contain the unary tree as a 
subtree, rooted at the root of the family tree?” Here the unary tree is the family 
tree associated to the (deterministic) branching process where each particle gives 
rise to exactly one particle (the offspring distribution is determined by p, = 1). 

We shall be concerned with the question “‘with what probability does the family 
tree of a branching process contain the binary tree as a subtree, rooted at the root 
of the family tree?’ Here the binary tree is the family tree associated to the 
branching process with offspring distribution given by p, = 1 (each particle gives 
rise to exactly two particles). If this probability is positive, then we say that the 
branching process grows faster than binary splitting, or has the binary splitting 


property. 


2. A fixed point equation. The solution of the extinction problem makes use of 
the generating function f of the offspring distribution defined by 


f(x) = L Dx". 
k=0 


In fact, let 56, be the probability that Z, = 0. Then one finds, intersecting the 
event [Z,,, = 0] with the events [Z, = k] that 


On+1 = Po + > ONDE = f(6,)- 
k=1 
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Hence 6 = lim 6, satisfies the equation 


f(x) =x. (2.1) 


Putting 6) = 0, one sees that 6 is the smallest root in [0,1] of (2.1). Then 
(excluding the trivial case where p, = 1, in which case 5 = 0) using that f is a 
convex function with f(1) = 1 and f’(1) = EZ,, the mean of the offspring distribu- 
tion, one proves the well-known extinction criterion for a nontrivial branching 
process (Z,,): 


S=1 iff EZ, <1. (2.2) 


As we will see later, there is no hope for such a simple criterion for the binary 
splitting property. First we prove an analogue of (2.1). 


THEOREM 1. Let (Z,,) be a branching process with generating function f, and let 
qa be the probability that (Z,,) grows faster than binary splitting. Then 1 — 7m is the 
smallest root in [0,1] of the equation 


f(x) + A -x)f'(x) =x. (2.3) 


Proof. For n = 1,2,...let y, = 1— 7, be the probability that the family tree 
associated to (Z,,) does not contain a binary subtree of height » rooted at the root 
of the family tree. If the family tree has k nodes at level 1 (i.e., Z, = k), then it 
does not contain a binary subtree of height n + 1 iff kK = 0 or 1, or either all or all 
but one of the k subtrees rooted at these nodes do not contain a binary subtree of 
height n. Therefore one has 


Ynt1 = Po t+ Py + L (yn + kyr "(1 — Yn) ) Pk 
k=2 


=Po+Pi+f(%m) — Po - Pin + 1 = Yn) (F'(%n) = Pi) 
and hence 


Yn+1 — G(¥); (2.4) 


where G(x) = f(x) + 1 —x)f"x). Therefore lim, ,..y, = y = 1-7 satisfies 
(2.3). Furthermore, if we put y, = 0, then (2.4) is also true for n = 0. Since 
G(x) = (1 — x) f"(x) is non-negative, G(x) is increasing and it follows that y = 
lim, .. G”(0) is the smallest root in [0,1] of (2.3). & 


A first simple consequence of this result (which is also obvious directly) is that 
a7 = 1 if and only if py) + p, = 0. There is no simple expression involving the 
moments of the offspring distribution which yields whether 7 = O or not. We shall 
return to this remark in the next section, giving now an example to illustrate this. 
Let (Z,,) be any branching process with offspring distribution given by (p,);_; 
and let (Z',) be the branching process with offspring distribution (p/,)7_, given by 


Po=9,  PL=PotPi, Pe=D, fork >2. 


Then it is clear Gif not by comparing the associated family trees, then because they 
yield the same equation (2.3)), that a is the same for both processes. However, if 
Po > 0, then all moments of the first offspring distribution are strictly smaller than 
those of the second. 
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3. A critical phenomenon. Here we shall complement the remark made in the 
previous section by giving an example which shows that the binary splitting 
probability 7 can not be a continuous function of the moments of the offspring 
distribution, nor of any other parameter of the process which depends continu- 
ously on the p,, k = 0,1,2,.... 


THEOREM 2. For each p in [0,1] let (Z,,) be the branching process determined by 
P,=1-pD,p,; =p, and let w(p) be the binary splitting probability of (Z,,). Then 


0 forO <p<— 
T(P)=13 3 ; 8 8 ; 

— 4 —4/1- — ~<p< 

44 D 9 NP 


Proof. The generating function of the offspring distribution is f(x) = (1 — p)x 
+ px°. According to Theorem 1, 1 — 7(:p) is the smallest root in [0, 1] of 


1—p+ 3px* — 2px? =x, (3.1) 
which is equivalent to 
1 —x = p(2x° — 3x*+4+ 1). 


Dividing by (1 — x) we obtain 1 = —p(2x* — x — 1), which has roots given by 


Hence, if p < s then the smallest root of (3.1) is necessarily x = 1, and therefore 
w(p)=0; if p> s then the smallest root of (3.1) is 


1 1 ; 3 3 1 9 
— — — — = — + — — — e || 


We end this note by remarking that Theorem 1 and 2 can obviously be 
generalized. It is an easy exercise to compute 7(p) for the branching process with 
geometric offspring distribution (p, = (1 — p)p* for k > 0). The critical value of 
p here is =. See [4] for generalizations to N-ary subtrees. 


REFERENCES 


1. J. T. Chayes, L. Chayes, and R. Durrett, Connectivity properties of Mandelbrot’s percolation 
process. Probab. Th. Rel. Fields, 77 (1988) 307-324. 

2. G.R. Grimmett, Percolation, Springer Verlag, Berlin, 1989. 

3. T.E. Harris, The Theory of Branching Processes, Springer Verlag, Berlin, 1963. 

A. G. Pakes and F. M. Dekking, On family trees and subtrees of simple branching processes, to 

appear in J. Theoret. Probab. 4 (1991). 


> 


1991] NOTES 733 


by 


ok = Di—‘[g]/(j -i)! forl <i<j<n+1 
ij < 


0 forl<j<i<n+1 
The article [2] referring to the approach in [1] includes the following: 


THEOREM. The map g — g* is a homomorphism from the algebra of functions 
continuously differentiable on a neighborhood N to a commutative algebra of 
upper-triangular matrices. 


Hence we have the following statements: 


(1) (f+g)* =f" + 8%, 

(2) (cf )* =c-f*, where c is a real constant, 

(3) (f-g)*=f* st =e -f*, 

(4) (1/g)* =(g*) ' if g(x) #0, 

(5) (f/g)* =f* + (g*) ' =(g*) '-f* if g(x) #0. 


A similar result for the map g — G, was discovered independently in the 
process of this research but can be nicely proved from the above theorem by 
noticing that G, = (Dg*D~')', where D = diag{0!,1/1!,1/2!,...,1/n!}. For ex- 
ample, a proof of the statement comparable to (4) above would be the following: 


Gije — [ D(1/g)*D-"]' 
- | D(g*) 'D-]’ _ |(De*D-") *] _ [(De*D-1)'] _ Gp}. 


As a matter of fact, any mapping of the form g > Ve*V~! or g > (Ve*V 'Y, 
where V is a constant, nonsingular matrix, is an algebraic homomorphism as 
above. 

Thus we have, 


1/g 0 0 
D[1/g] I/g 0 0 
Go'=G,,,=|D*[1/s] 2D[1/8] 1/g 0 0 
D*[1/g] 3D7[1/g] 3D[1/g] 1/¢ 0 

D"[1/g] nD*~"[1/g] se Ag 


A second approach to G, ! is by way of the Cayley-Hamilton Theorem of linear 
algebra. Here we have the obvious extension of the theorem in view of the fact that 
G, is a matrix of functions defined on some real neighborhood. 


THEOREM (Cayley-Hamilton). For each value x in the domain of g, the matrix G, 
satisfies its characteristic equation for that value of x. 


But for a given value of x the characteristic equation of G, has the form 


[g(x) — A(x)]"*" = 0, 


where A(x) represents an eigenvalue of G, for this specific value of x. 
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Thus, for each x, we have the matrix equation 


[g-1-G,]"" =0. 


& 
Using the binomial theorem, one can expand this expression to obtain 
an n+1 k 
x k Jer tK( —1)"Gf = 0. 


where it is understood that G; = J, With some algebraic manipulation one obtains 


n+1 


G, >> ("; + ') n+1—k(_1)"Gkol _ —gnrtl . I. 
Hence we have the formula 
n 1 
+] k 
- © (-'{" tt )aaGt 
Ln k + 1 gktl g 


But thankfully we can write G* as G,« by the homomorphism, so that the solution 
to the earlier system of equations represented as G, - X = E is 


‘ +1) 1 
X= YoY ti} — Gx E. 


The last component of this vector is D”[1/g], from the prior approach, so that we 
have the following result: 


THEOREM. 


n 1 
p'ti/el= E(-*(" tt) a'let]. 
k=1 & 
[Note that the sum starts with k = 1 since for k = 0 and n > 1, D"[g*]=01] 


With this formula in hand one can try some applications. Choose your favorite 
function, g, and try it! The problem, of course, is that the formula requires the nth 
derivatives of g* for k = 1,2,...,n. Nevertheless, there are a few cases in which 
it is easy and some interesting combinatorial results arise! 

With g(x) = e* one obtains: 


y= Doty ie 
With g(x) =x”, for any real number m, one obtains 
(-1)"(m+n-1)---(m+1)m 
= Lewy 1) mk) (rn ~1)+-+(mk —n +1). 
If m is a positive integer, then dividing by n! yields the identity: 


orm = Eves ym) 
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If m is a negative integer we note the following: 
0 forn > -m 


(-1)"n!( 7") forn < —m. 


(m+n—1)-+:(m+1)m= 


Thus we have the two additional cases. If nm > —m then 


o= Deve (me 4) 


whereas if n < —m then 


j= Eee ome) 
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1. Introduction. Ramanujan introduced the nested radicals 


Vi+2vl+3vlt4 (1) 


and 


V6 + 2V7 + 3y8+4°° (2) 


in [1] (see also [4, p. 327]) and gave solutions without any rigorous proof. It is not 
obvious that such expressions have an unambiguous meaning. In [3, p. 32] nested 
radicals of the form 


a + a+Vat-.: (3) 


are considered and convergence is shown to a root of the appropriate quad- 
ratic equation. A more difficult example with alternating signs is considered in 
[5, p. 299]. A solution had been given by Ramanujan and is to be found in 
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om EOE eo 
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on taking d := k*/“?—, Set A(k) = |f,(k) — f,(k)|. By (4) and (7) 
A(k) < A(k +a)/p 
so that 
0 < A(k) < A(k +na)/p”. 


Thus any two solutions satisfying (6) agree for k. Hence by (4) they agree for all 
points of the form k+ Na. & 


COROLLARY. Suppose that f satisfies (4) and (6) for all positive k and that a 
solution f* to (4) is obtained by iteration as above commencing with 0 < f, <f. 
Then f* = f. 

Proof. From (5) we have that f, < f implies f,,, <f, so f* <f and satisfies 
(6). By the Theorem f* =f. & 


3. Examples. I. Take p := 2, a := 1 and g(k) := (A — a)k + A? (A = a). Then 
the resulting functional equation 


f(kK) = VA -ayk +X +kf(k +a) 


has solution f(k) =k + 4X. 
Further specialization to A = a produces a family of nested radicals including 
(1) which comes on setting a = 1. That is 


f(2) = V1 + 2y1+3vV14+-°° 


and so the value of (1) is 3, on using the Corollary and on defining the value of a 
nested radical such as (1) or (2) to be that provided by the iteration (5). 
Similarly letting A = a + 1 produces the functional equation 


f(k) = Vk + (a+ 1) +kf(k +a) 


with solution f(k) =k +a+1. Then a = 1, k := 2 gives the value of 4 for the 
expression (2), agreeing with Ramanujan. 


II. Take p := 3,a:= 2,a:= 2b > 0, and g(k) := b° + 3b7k. Then (4) becomes 


3 
f(k) = Vb? + 3b2k + K2f(k + 2b) 
with solution f(k):=k +b. So iterating from any f, with 0<f,(k) < 
(b° + 3b*k)'”?, f,(k) will converge to k + b. In particular k := b := 1 gives 


4+ 10 + 97/16 + 25 722 + 49528 + (8) 


III. Similarly, expansion of (kK + a)* for a > 0 gives the solution to the func- 


tional equation 
f(k) = Va(k) + k°f(k + 3a), 


where g(k) := a4 + 6a7k? + 4a°k, as f(k) = k + a and expressions similar to (8) 
follow on iteration from fy := a. 


IV. In (5) take, for k > 1 (by translation), a = 0, p:= z,a:= land g(k) =k. 
Starting with f, := 1 we see easily that f,(k) < k + 1 for all k = 1 and that f,(k) 
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converges to the solution of f(k) = y(k + f(k + 1)). Then 


fY=-yitye2eBr . 


This is the kind of “continued square root” considered in [7]. 


V. In (4) take a :=0, a:=0 and g(k):=k, for p>1 to get f(k)= 
(k + f(k))'/””. This is solved by the iteration f, := 1 and f,,,(k) = (k + f,(k))'/””. 
Then f,(k) < (p — 1)k + 1 holds inductively. Consequently we have convergence 


to 
p D, p 
p=Ve+Vk +k + vee 
and £ is finite. It follows that B is the positive root of x? —x —k = 0. 


4. A two variable extension. For simplicity we consider only square roots and 
wish to solve the functional equation 


f(x,y) = ya(x) +y*f(xt+c,y +d) (9) 


for a nonnegative function f where x,y >1, c,d,a>0, g is a nonnegative 
function and y := inf{g(x): x > 1} > 0. The results here are less satisfactory as no 
explicit solution of (9) is at hand. Again, we solve iteratively with f, := 1, and 


frai(%s¥) = s(x) +y*f, (4 t+e,y +d). 
Whence 
fe(%,y) — ,-1(%, Y) =y"| f(x +c,y+d)—f,-(x« +e,y + d)| (10) 


and f(x, y) = y(g(x) + y*) = 1 = f(x, y). So, by (10) f, = f,,_, for all n and f, 
increases to some f* satisfying (9) and possibly taking infinite values. 


UNIQUENESS. Suppose f and A satisfy (9). Then 
f?(x,y) —h*(x,y) =y*[f(xtec,y td) —h(x+c,y +d). 
So if A(x, y) = |f(x, y) — A(x, y)I, it follows that 
ye 
h(x,y) + f(x,y) 


As before, we obtain from (9) for f (or h), f(x, y) = y'/? and substituting back in 
(9) we obtain in the limit f(x, y) => y*. Thus 


A(x,y) <2 "A(x +nc,y + nd) 
and, assuming a growth condition on f (and h): f(x + nc, y + nd) = 0(2”), we get 
f=h. 
EXAMPLE. Set g(x) := (A — a)x + AX(A = a), c= d= a, a ‘= 1. We may de- 
mand that y := x and obtain Example I above. 


A(x,y) = A(x+c,y+d). 


5. Other solutions. The functional equation (4) can give rise to solutions which 
do not meet the growth condition (6). Consider (4) with p := 2, g(k) = la =a 
:= 1. Then we have the simple functional equation 


f(k) = yl t+kf(k +1). (11) 
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Let g(k) be any non-trivial nonnegative C® function with support in [.25, .75] so 
that g and all its derivatives vanish at those endpoints. Extend g to be 0 
elsewhere. Then let f(kK) := k + 1 + g(k) on [0,1) and use (11) to extend f to 
[0,). Then f is a nonnegative C” solution to (11) but badly violates (6). 

Similarly, consider the function f given by f(k) =k + 1 for k #1,2,3..., and 
set f(1) := 2 +t, t > 0. Then (11) determines f on the remaining natural numbers 
and we have produced a finite discontinuous solution to (11). Moreover, from (11) 
we easily obtain, for an integer m > 1, 


f(m) >m+14 2t, 


which violates the growth condition (6). 

It is reasonable to conjecture that any analytic solution to (11) must satisfy (6) 
and so be k + 1. Anecdotal evidence for this is provided by the fact that the 
corresponding Riccati differential equation 


dy ; 1 
xX— = —_— —_— 
y Xy 


has solutions y, = (x + 1) and y, = (x + 1)+ x71 — x — Ae™*)™*. 
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The integers and positive integers are denoted by Y and 9", respectively. By b 
(mod a) we denote the arithmetic sequence of integers {na + b: n € Y}, where 
a © Y" is the modulus and b, with0O < b < a, is the residue of a. 

A system .“= {b; (mod a;): i € I} of arithmetic sequences that forms a parti- 
tion of Y is called a disjoint covering system (abbreviated DCS). Here I = 
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{1,..., NM} is a finite index set, or J = {1,2,...} is countably infinite; “ is said to 
be finite or infinite accordingly. A system ~ is called incongruent if i # j > a; # 
a,;. It is well known that a DCS with 1 < |J| < © cannot be incongruent. See e.g. 
[8]. Thus, a finite DCS will always have at least two sequences with the same 
modulus (unless |/| = 1). Hence we shall speak about the multiset of moduli; the 
term multiset generalizes that of a set by allowing the membership of a specified 
number of copies of each element. 

A major open problem in the study of DCS’s is: given a multi-set of moduli, 
determine if it is possible to choose corresponding residues so as to obtain a DCS. 
Considering the finite case, Znam [8] gave a necessary condition on the multi-set of 
moduli for this to be possible, and conjectured that his condition was also 
sufficient. Recently Simpson [5] found a counterexample to Znam’s conjecture, 
and gave a stronger necessary condition, in the hope that it might be sufficient; 
Zeilberger [7] frustrated this hope as well. 

Attempts have also been made to characterize infinite incongruent DCS’s 
(abbreviated IIDCS’s). Early work on this usually appeared together with work on 
finite DCS’s, such as in Stein [6], who asked whether the moduli of every nontrivial 
IIDCS are necessarily all powers of 2, and Krukenberg [4] who answered this in the 
negative. See also Dewar [2] and the survey article by Erddés [3]. Recently Beebee 
[1] raised renewed interest in the subject by asking about the existence of an 
IIDCS whose set of moduli was specified or was required to have certain 
properties. 

This note is a contribution towards the unification of these two directions of 
research. We strengthen Znam’s necessary condition while at the same time 
extending it to apply to infinite DCS’s as well. As an application, we give a 
negative answer to one of Beebee’s questions and to two similar questions. 

The concept of the density d of a subset of Y is central to our argument, so we 
start by collecting a few well-known or easy facts about it. A subset S Cc Y is 
periodic if there exists t € Y*, called a period of S, such that $ + t = S. The 
density of a periodic set S is defined as d($) = |S 1 B,| /t, where ¢t is any period 
of S and B, is any block of t consecutive integers. This definition is consistent with 
the more general definition of asymptotic density, which we do not require here. It 
is easy to verify that the family FY of periodic subsets of Y is closed under finite 
unions, and d(U/_,S,) = X*_,d(S;) whenever S,,...,S, are periodic and pair- 
wise disjoint. This property, called finite additivity, does not extend to infinite 
unions: First, U;_,S; need not be periodic even though each of the S$; is periodic; 
second, even if S$, € FY (i = 1,2,...) are pairwise disjoint and U?_,S; € F, it 
need not be the case that d(U?_,S;) = L7_,d(S;). What is true, however, under 
these assumptions is that d(U?_,S;) = X?_,d(S;); this is seen by observing that 
d(U_,S,) > d(U%,S,;) = XU ,d(S,), and letting N — «. We shall refer to this 
property of d as superadditivity of the density. 

Clearly d(b (mod a)) = 1/a, which suggests that the reciprocals of the moduli 
of a DCS must satisfy certain conditions. For a multiset of positive integers A, we 
denote r(A) = XL, < ,1/a. Superadditivity of the density implies that if A is the 
multiset of moduli of a DCS then r(A) < 1; we say that the DCS is saturated if 
r(A) = 1. Examples of both saturated and unsaturated IIDCS’s were given in 
Stein [6]. We restrict attention here to saturated DCS’s, because unsaturated ones 
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are, in a sense, too easy to obtain. Finite additivity of the density guarantees that 
every finite DCS is saturated. 
As usual, denote by (a, a’) the greatest common divisor of a and a’. 


THEOREM. Let A be the multiset of moduli of a saturated DCS. Then for every 
prime p there exists a partition of A into multisets Ay, A,,..., A p Such that: 

(i) a <A, if and only if p + a. 

(ii) r(A,) = (1 — r(A,))/p fori =0,...,p — 1. 

(iii) For each i € {0,..., p — 1} with |A,| > 1, if a,a' © A; then (a, a’) > p. 


Proof. Define the partition of A as follows. For every sequence b (mod a) in 
the DCS with pla, put a copy of a into that A,, i € {0,...,p — 1}, for which 
b € i(mod p); put those a € A with p + a into A p: Observe that each sequence b 
(mod a) with a € A,, i < p, is contained in i (mod p). If two such sequences (with 
the same i) had (a, a’) = p they would have to intersect; hence (iii) holds. Next, 
observe that each sequence with a € A, is split by the residue classes (mod p) into 
p sequences with modulus pa each. Hence, the superadditivity of the density, 
applied to the partition of the periodic set i (mod p) induced by the DCS, yields 


1 1 
(Ai) + or(A,) <7» i=0,...,p—1. 


Summing these p inequalities gives r(4) < 1. By assumption r(A) = 1, so each of 
the p weak inequalities must actually be an equality. This establishes (ii). 


REMARKS. 


1. Znam’s necessary condition, stated for finite DCS’s, was the existence, for 
every prime p, of a partition of A into Ao, A,,..., A, satisfying (i) and 
(ii). 

2. Our necessary condition is not sufficient; examples in Simpson [5] or 
Zeilberger [7] can be used to show this. 

3. Denote A(p) = {a € A: pla}. It is a corollary of the Theorem that if 
a € A(p) then r(A(p)) > p/a. Indeed, let i € {0,..., p — 1} be such that 
a €A,. Then r(A(p)) = 1 — r(A,) = pr(A,) = p/a. 

4. The Theorem was stated for multisets to accommodate both the finite and 
the infinite case. In the following applications, however, we speak of IIDCS’s, 
so the multisets of moduli are actually sets. 


Pp 


APPLICATIONS. 


1. It is easy to see that the sum of the reciprocals of the numbers 
2'3/, 2'5*, 3/5*, 2'3/5* (i, j,k € Y*) is 1. Beebee [1] asked whether there 
exists a saturated IIDCS whose moduli are these numbers. To have conve- 
nient notation, we let D,, for m € 9", denote the set of positive integers 
that are multiples of m and all of whose prime divisors divide m. With this 
notation, Beebee’s set of moduli is 


and r(D,,) = 3,4, %,% for m = 6, 10, 15, 30, respectively. Suppose A is the 
set of moduli of a (necessarily saturated) DCS. Using the Theorem for p = 2, 
we get a partition of A into Ay, A,, A>. Assume w.l.o.g. that 10 € Ay. By 
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(iii) De AN Ay = ©, so Dg C A,. But r(D,) = 3 while r(A,) < 3 by (ii). Thus 
the answer to Beebee’s question is negative. 
2. The following are two other candidates to be the set of moduli of a saturated 


IIDCS: 
A* = De U Dis UV Day U Dag U Dag U Digs U Dygo, 
A®™™ = De U Day U Dog U Day U Dag U Dsy U Digg U Doio. 


Each of these candidates can be rejected using Remark 3 above (for example, take 
p = 7 and a = 42 for A*, and take p = 13 and a = 52 for A**). 


Note. Through correspondence we learned that John Selfridge has shown that 
Beebee’s question has a negative answer by considering the residues mod 2 and 
mod 3. The same conclusion has been reached by Jamie Simpson, using a density 
argument. John Beebee has shown that our two additional systems of moduli 
cannot form a saturated ITDCS by using Selfridge’s argument, and Doron 
Zeilberger has reached the same conclusion by a density argument. 
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A Moment Convergence Theorem 


Lasos TAKACS 
Department of Mathematics and Statistics, Case Western Reserve University, Cleveland, OH 44106 


In the theory of probability we frequently encounter the problem of finding the 
limit distribution of a sequence of real random variables. If the random variables 
have finite moments and their limits exist, then the following theorem seems 
appropriate. 


THEOREM 1. Let {F(x)} be a sequence of distribution functions for which the 
moments 


M,(n) = fx" dF,(x) (1) 
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(iii) De N Ay = @, so Dg C A,. But r(D,) = 5 while r(A,) < 4 by (ii). Thus 
the answer to Beebee’s question is negative. 
2. The following are two other candidates to be the set of moduli of a saturated 
IIDCS: 


A®*™ = De U Dis UV Dy U D3 U Day U Digs VU Diao; 
A®™™* = Dg U Day U Dog U Day U Day U Dsy U Digg U Dio. 


Each of these candidates can be rejected using Remark 3 above (for example, take 
p =7 and a = 42 for A*, and take p = 13 and a = 52 for A**). 


Note. Through correspondence we learned that John Selfridge has shown that 
Beebee’s question has a negative answer by considering the residues mod 2 and 
mod 3. The same conclusion has been reached by Jamie Simpson, using a density 
argument. John Beebee has shown that our two additional systems of moduli 
cannot form a saturated IJDCS by using Selfridge’s argument, and Doron 
Zeilberger has reached the same conclusion by a density argument. 


REFERENCES 


1. J. Beebee, Examples of infinite, incongruent exact covers, Amer. Math. Monthly, 95 (1988) 
121-123. 

2. J. Dewar, On finite and infinite covering sets, in: Proc. Washington State Univ. Conf. on Number 
Theory (Washington State Univ., Pullman, Washington, 1971), pp. 201-206. 

3. P. Erdos, Problems and results on combinatorial number theory III, in: Number Theory Day, Proc. 
Conf. Rockefeller Univ. 1976, Lecture Notes in Math., vol. 626, pp. 43-49, Springer, Berlin, 1977. 

4. C.E. Krukenberg, Covering sets of the integers, Ph.D. Thesis, Univ. of Ill., Urbana-Champaign, IL, 
1971. 

5. R.J. Simpson, Disjoint covering systems of congruences, Amer. Math. Monthly 94 (1987) 865-868. 

6. S. K. Stein, Unions of arithmetic sequences, Math. Annalen, 134 (1958) 289-294. 

7. D. Zeilberger, On a conjecture of R. J. Simpson about exact covering congruences, Amer. Math. 


Monthly 96 (1989) 243. 
8. S. Znam, A survey of covering systems of congruences, Acta Mathematica Universitatis Comenianae, 


40-41 (1982) 59-79. 


A Moment Convergence Theorem 


Lasos TAKACS 
Department of Mathematics and Statistics, Case Western Reserve University, Cleveland, OH 44106 


In the theory of probability we frequently encounter the problem of finding the 
limit distribution of a sequence of real random variables. If the random variables 
have finite moments and their limits exist, then the following theorem seems 
appropriate. 


THEOREM 1. Let {F(x)} be a sequence of distribution functions for which the 
moments 


M,(n) = fx" dF,(x) (1) 
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exist for all r =0,1,2,.... Furthermore, let F(x) be a distribution function for 
which the moments 
M,=[ x’ dF(x) (2) 
exist for allr = 0,1,2,.... If 
lim M(n) = M, (3) 
n— oo 
for allr = 0,1,2,..., and if F(x) is uniquely determined by the sequence of moments 
M,, M,, M2,..., then 
lim F(x) = F(x) (4) 
n-©o 


in every continuity point of F(x). 


In applying this theorem, we should determine the limits (3), prove that {M,} is 
a sequence of moments of a distribution function F(x), and prove that F(x) is 
uniquely determined by (2). In 1920 H. Hamburger [7] gave a necessary and 
sufficient condition for a sequence {M,} to be a moment sequence, and we have 
various sufficient conditions for the uniqueness of the moment problem. By a 
result of G. Pélya [16], F(x) is uniquely determined by (2) if 


Maen 


lim sup 


mae 


< %, (5) 


and by a result of T. Carleman [2], F(x) is uniquely determined by (2) if 
°° 1 
LL im = © (6) 
m=1 M3? 
See also J. A. Shohat and J. D. Tamarkin [17]. Even if F(x) is uniquely determined 
by (2), we have to find ways to recover F(x) from (2). There is no general method 
of finding F(x). One possible way is to try to determine 


a(t) = fe dF (x), (7) 


the Fourier-Stieltjes transform of F(x), by {M,}, and invert (7). 

Theorem 1 has a long history. In the particular case when F(x) = ®(x), the 
normal distribution function, in 1887 P. L. Chebyshev [3] proved that ®(x) is 
uniquely determined by its sequence of moments and used the above theorem in 
his investigations in the theory of probability. However, he did not prove this 
theorem. For F(x) = ®(x), the theorem was proved only in 1898 by A. A. Markov 
[13]. See also A. A. Markov [14]. In 1920 G. Polya [16] proved Theorem 1 in the 
case when (5) is satisfied. In the stated general case Theorem 1 was proved in 1931 
by M. Fréchet and J. Shohat [6]. Their proof is based on some general results of E. 
Helly [9]. 

In many applications we encounter a sequence of discrete random variables 
€,,&5,...,&,,-.. which take on only nonnegative integer values and we want to 
determine the limit distribution lim, _,,.P{é, = k}, k =0,1,2,...,if it exists. In 
principle, we can apply Theorem 1, but we have to overcome all the difficulties 
mentioned earlier. On several occasions I used the method of binomial moments in 
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proving such limit theorems. See references [19], [20], [21], [22], [25], and [26]. This 
method has led to a general limit theorem whose proof is elementary and 
surprisingly simple. In addition it yields an explicit formula for the limit distribu- 
tion. This note contains this theorem and its proof. 

If € is a discrete random variable which takes on only nonnegative integer 
values, then its rth binomial moment is defined by 


B= e{(£)\ = > (K)Pte= 4) (8) 


k=r 
for r = 0,1,2,.... The rth factorial moment of € is defined as r!B, for r= 
0,1,2,.... It seems binomial moments were introduced by G. F. Hardy [8 p. 59] 


for the first time in 1909 for the practical purpose of calculating moments simply 
by repeated additions. Factorial moments were used in 1921 by R. v. Mises [15] 
and in 1923 by J. F. Steffensen [18]. Steffensen proved that if B. = 0 for r > m, 
then 


m UY 
P(E=k} = D(-1)"(Z)B, (9) 
r=k 
for k = 0,1,...,m. The wide dissemination of the use of binomial moments is due 
to K. Jordan [10], [11]. 
THEOREM 2. Let &,,&5,...,&,,-... be a sequence of discrete random variables 


taking on nonnegative integers only. Define P(n) = Pté,, = j}. Let us suppose that 
the r-th binomial moment of €,,, 


B,(n) = e{(&)\ = 5 (4) Pn), (10) 
jer 
is finite for allr => 0 and n = 1. If the limit 
lim B,(n) = B, (11) 
exists for all r > j = 0, and if the limit 
lim S(-1)""(})B, =P (12) 
r=j 


exists for j = 0, then 
lim P(n) =P. (13) 


Proof. Let us define a set of m+ 1 polynomials of degree m in x by the 


following formulas 
muted = Bor [ = (FJ Eeo- 
_p qym-il * x-j-1 _[*\[m-x 
ones Gz) 0 


for j = 0,1,...,m and m2=0O. The third equality can be proved simply by 
mathematical induction on m. We shall consider these polynomials only for 


r 
j 
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x = 0,1,2,.... Let us define 
1 forx=yJ, 


o(4) = f for x #j. (15) 


Since f,, (x) = 6(x) if 0<x<m and 0 <j <™m, and (-1)"f,, (x) 2 0 if 
x >j and m = j, we have the inequalities 
fps2s41,j(*%) S 6,(%) < fisrs j(%) (16) 
whenever x, j, and s are nonnegative integers. 
Putting x = €, in (16) and forming the expectation of each member, we obtain 
the following inequalities 
j+2st1 Up j+2s Up 
EY YF) se) s D(-Y7(F)B@) a7 
r=j r=j 
for j => 0 and s => 0. These inequalities are the generalizations of Bonferroni’s 
inequalities. (See C. E. Bonferroni [1] or W. Feller [5, p. 110].) If we let n — © in 
(17), by (11) we obtain that 
j+2s+1 P j+2s uy 
y (-1'"(5)B, < lim inf P(n) < lim supP(n)< ¥ (-1)'(5 JB, (18) 
r=j n-- co n--, co r=j 
holds for all s > 0. If s > ~, then by (12), the extreme members in (18) tend to P, 
and this proves that the limit (13) exists and is equal to P.. 
We note that if in Theorem 2, B, = a’/r! for r = 0,1,2,... where 0 <a < ~, 
then the random variables é, (n = 1,2,...) have a limiting Poisson distribution, 
that is, 


P.=e “al /j! (19) 


for j > 0. This result was proved by R. v. Mises [15] in 1921, and P. Erd6és and 
A. Rényi [4, p. 27] in 1960 by using a more complicated approach than Theorem 2. 
Finally, we note that if in Theorem 2, instead of (12), we assume only that the 
binomial moments {B,} uniquely determine the distribution {P}, then (13) remains 
valid, but (12) should be replaced by a more general formula. 
According to a result of A. Lenard [12] the distribution {P} is uniquely 
determined by the sequence {B_} if 


ae | 
Ls =: (20) 
r=1 B,/ 
If 
p = limsup(B,)'”” < , (21) 


rae 


then (20) is satisfied and {P} is uniquely determined by the sequence of binomial 
moments B. (r = 0,1,2,...); however, (12) should be replaced by the more 
general formula 


oO 


q’'B. 
r=j J 


l 


(22) 


in which g is any nonnegative parameter satisfying the inequality g > p* — 1. In 
some particular cases we can choose qg > (p — 1)/2. If p < 1, then the limit (12) 
exists and consequently (13) holds. In this case we can choose gq = 0 and (22) 
reduces to (12). The above result is based on L. Takacs [23], [24]. 
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Why the Method of Undetermined Coefficients Works 


CiLay C. Ross, Jr. 
Department of Mathematics and Computer Science, University of the South, Sewanee, TN 37375 


Introduction. As presented in many introductory differential equations books, 
the method of undetermined coefficients is somewhat mysterious. Zill [1] shows 
clearly how to use annihilators to solve undetermined coefficients problems, 
although he does not say why his procedure works. Hildebrand [2] hints at a similar 
method for difference equations. This note presents a simple theorem which 
explains why the method of undetermined coefficients works, both for differential 
equations and difference equations. 

For our purposes, an mth order linear differential operator has the form 
L(y) =y” + f,_a)y@—P + +++ +fCx)y’ + fo(x)y, and is defined on all C” 
functions over some interval I on which the coefficients are smooth (C®), and an 
nth order linear difference operator has the form M(y,) = y,4, + 4,-©K)Yran-1 
+++ +ak)y,,, + ao(k)y,, where a,(k) # 0, and y and every a, is defined for 
all k on some interval I of nonnegative integers. 


THEOREM. If L and A are both linear differential or linear difference operators 
of positive order, and f is a function annihilated by A, i.e., such that ACf ) = 0, then 

(a) every solution of L(y) = f belongs to Ker(AL); 

(b) in every subspace V of Ker(AL) complementary to Ker(L) (i.e., in every V 
such that Ker(L) ® V = Ker(AL)), there is a unique solution to L(y) = f. 


Proof. Suppose for definiteness that L is of positive order n and A is of positive 
order m. Then AL is of order n + m. The parts of the theorem are proved as 
follows: 

(a) Apply A to L(y) = f: AL(y) = A(f) = 0. 

(b) Let {y,, y>,..., y,} be a basis for Ker(L). Since AL is of order n + m, extend 
this to a basis {y,, ¥5,---5 Vys Z1,+++5 Zmt for Ker(AL), so that {z,,..., z,,} is a basis 
for V. Then L(z,),...,L(z,,,) all belong to Ker(A), since z,,...,Z,, all belong to 
Ker(AL). Moreover, L(z,),..., L(z,,) are linearly independent, for if La,;L(z;) = 0, 
then L(Xa;z;) = 0. This implies that La;z; is in Ker(L). But Ya;z, is also in V, and 
hence is 0 since Ker(L) and V are complementary. Thus a; = 0 for i = 1,...,m. 

Since Ker(A) has dimension m, {L(z,),...,L(z,,)} is a basis for Ker(A). Thus f 
in Ker(A) implies f has the unique representation f = Yc,;L(z;). Thus y = Yc,z; 
in V is a solution to L(y) = f, since L(y) = Xc;L(z;) = f. The coefficients c, are 
unique, so there is no other solution in V. O 


Coro.iary. If L and A commute then Ker(A) is a subspace of Ker(AL). If, in 
addition, Ker(L) and Ker(A) have only 0 in common, then in Ker(A) there is a 
unique solution to L(y) = f. 

747 
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Proof. Suppose L and A commute and z is in Ker(A) then AL(z) = LA(z) = 
L(O) = 0, so Ker(A) is a subspace of Ker(AL). The rest follows from (b) of the 
theorem since now one can choose V = Ker(A). O 


Observations. If L and A have constant coefficients, then L and A commute. 
This motivates the standard practice of undetermined coefficients; however, the 
complement of Ker(L) in Ker(AL) need have very little similarity to Ker(A), 
depending on just how large Ker(L) \ Ker(A) is, and this is what often leaves 
students mystified. 

It is not the fact that L and A are differential or difference operators which 
makes the theorem work, but the fact that for operators of these types, 
dim(Ker(AL)) = dim(Ker(A)) + dim(Ker(L)). 

The theorem explains exactly which nonhomogeneous linear differential or 
difference equations permit finding a particular solution by the method of undeter- 
mined coefficients: the right-hand side must be annihilated by some linear differ- 
ential or difference operator of positive order. In the constant coefficients case for 
either topic, it is clear from the theory how to obtain A and all of the basis 
functions specified by the theorem. In fact, the bases for Ker(L) and Ker(AL) so 
obtained will always give a basis for Ker(L) as a subset of a basis for Ker(AL). This 
means that one can immediately identify a basis for a subspace complementary to 
Ker(L) as those basis elements in Ker(AL) which were not in Ker(L). One then 
knows the form of a particular solution since the existence of a unique solution in 
this complementary subspace is guaranteed. These ideas are illustrated by the 
following examples. 


Examples. Consider L(y) = (D — 2)(D — 4)(D + 3)*y = 5x%e7**, where D = 
d/dx. Then A = (D + 3)4 is an annihilator of e~**, xe~**, and x7e~ >", as well as 


x%e~3*, and the various bases described in the theorem are: 

(Al ): {e?*, et e >. xe >*, x*e7 3%, xee*, xte7 3, x°e >} 
(L) {e2*, et e >. xe >*} 

(A): {e~>*, xe >*, x*e7 >, x°e7>*} 

(V): {x*e7 >, xee >, xte7 >, xe >} 


Now assume a particular solution of the form y, = c,x*e~** + c,x°e~** + 
c,x4e~** + c,x°e~°*. Note that the form of the assumed solution differs markedly 
from the form of the basis for Ker(A) because dim(Ker(L) M Ker(A)) = 2. 

Analogously, let L(y,) = (E — 2)(E — 4)E + 3)’y, = 5k°(1/3)*, where E‘y, 
=y,,,- Then A = (E — 1/3)* is an annihilator of (1/3)*, k(1/3)*, and k7(1/3)*, 
as well as k°(1/3)*, and the various bases described in the theorem are: 


(AL): {2*, 4*, (1/3)", k(1/3)", k7(1/3)", k3(1/3)", k4(1/3)", k5(1/3)"} 
(L): {2*,4*, (1/3)", (1/3) "} 

(A): {(1/3)", k(1/3)", k2(1/3)", k3(1/3)"} 

(V): {k?(1/3)", k3(1/3)", k4(1/3)", k°(1/3)"} 
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Now assume a particular solution of the form y, = c,k7(1/3)* + c,k°(1/3)* + 
c,k*(1/3)* + c,k°(1/3)*. Note again that the form of the assumed solutions 
differs from the form of the basis for Ker(A) because dim(Ker(L) M Ker(A)) = 2. 


Acknowledgement. The author expresses his appreciation to Stephen E. Puckette, whose incisive 
understanding at a critical moment made everything come together. 
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A Freshman Calculus Proof that Real Quartics Factor 


JOHN IsBELL 
Department of Mathematics, State University of New York, Buffalo, NY 14214 


This note gives a proof that is technically within the grasp of second semester 
freshmen, and as natural (I think) as the usual proofs that lim, _,,) sin x/x = 1, 
that a fourth degree real polynomial factors into two real quadratic factors. (So, by 
the quadratic formula, it has four complex roots.) The required calculus is the 
intermediate value theorem, in the form: If f(x) is a polynomial with positive 
leading coefficient and f(0) < 0, then there is r > 0 such that f(r) = 0. 

Two further selling points. (1) The idea is rather like completing the square. 
Just as every real quadratic x* + ax + b is I(x)* + c’, where /(x) is linear and c is 
constant, so a quartic x* + ax? + bx? +x 4+ dd is g(x)? — I(x)’, where q(x) is 
quadratic and /(x) is linear. (2) The proof turns on associated cubics and quadrat- 
ics. To get the formula for the solution of the quartic would be much harder, 
because this proof branches into three cases; but it certainly shows that any 
particular quartic equation has roots expressible in terms of roots of explicitly 
given cubics and quadratics (only one cubic). 


1. Derivation. Consider a real quartic polynomial Q(x) which we write as 
x* + 2Ax? + Bx? + Cx + D. We shall show that Q(x) = g(x)? — h(x)’, where 
e(x) =x? +Ax4+t and h(x) =7rx +5, for some real numbers r,s, t. This will 
give the factorization Q(x) = (g(x) + h(x))(g(x) — h(x)). Equating coefficients 
in g(x)? — h(x)? = Q(x) gives 

A*+2t-—r?=B 
2At —2rs=C 
t?-—s? =D. (1) 
Solving the first of these for f, 
t=35(r°+B-A’). (T) 


This is equivalent to the first equation of (1), so we may as well substitute in the 
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from a movie showing the complex roots of Q,(x) for t = 1,2,3, 4. Q,(x) has roots 
+i,1,2. The imaginaries move to 0, giving Q,(x) roots 0,0,1,2. Then the roots 
stay real and three of them stay constant while one grows, giving Q,(x) roots 
0,1,1,2. Finally the double root separates to 1 + i and 1 — i, which with 0 and 2 
are the roots of Q,(x). Q,(x) and Q,(x) are uniquely factorable into two real 
quadratics, and the factors of Q,(x) can’t be changed continuously into those of 
Q(x) by way of real quadratic factors of Q,(x), for t in (1, 4). 


On the Right-Hand Derivative of a Certain Integral Function 


JOHN KLIPPERT 
Department of Mathematics and Computer Science, James Madison University, Harrisonburg, VA 22807 


Suppose f is a real-valued function defined on the interval [0, 1] such that 

a) f is bounded on [0, 1], 

b) f is continuous on (0, 1], 
and 

c) lim, _, 9+ f(x) does not exist. 

Since f is bounded and has a single discontinuity on [0,1], it is Riemann 
integrable there. Let F(x) = /{f(t) dt, x € [0,1]. Since F is continuous on [0, 1] 
and differentiable on (0, 1] (see [2, p. 147]), a natural (and interesting) question for 
the student of “‘advanced calculus” to investigate is: 

Does 


F’.(0) = tim 2 FO) tim 


fof (t) at 
Ot x —0 x-ot X 


also exist? 
At first, the student is tempted to misapply I’Hospital’s Rule and conclude that 
the limit does not exist. To show that this need not be the case, we can take 
f(x) = sin1/x for x € (0,1] with f(0) = 0 (any other value for f at 0 will do just 
as well), and appeal to the elegant argument given by Steve Ricci in [1] that 
_ Jof(t) at 
lim ———— = 
x—0* Xx 


The argument, which relies upon integration by parts, is well within the grasp of 
the student. Based upon this example, the student may then wonder whether or 
not the limit in question always exists. Providing an example such that (a)—(c) hold, 
but lim, _, 9+/¢ f(t) dt/x does not exist affords the student a nice challenge. Such 
an example (which again appeals to integration by parts) is offered here. 

Let us put g(x) = sin(ln x) for x € (0,1] with g(O) = 0. Note that (a)-(c) hold 
for g. Using integration by parts, we find that the function 


= (sin(In x) — cos(Inx)), x <€(0, 1 
0, = 0 


F(x) = 
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> ot x —0 x-ot X 
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f(x) = sin1/x for x € (0, 1] with f(O) = 0 (any other value for f at 0 will do just 

as well), and appeal to the elegant argument given by Steve Ricci in [1] that 

_ Jof(t) at 

m _ 


li 
x-0t x 


The argument, which relies upon integration by parts, is well within the grasp of 
the student. Based upon this example, the student may then wonder whether or 
not the limit in question always exists. Providing an example such that (a)—(c) hold, 
but lim, _,9+/9 f(t) dt/x does not exist affords the student a nice challenge. Such 
an example (which again appeals to integration by parts) is offered here. 

Let us put g(x) = sin(in x) for x € (0, 1] with g(O) = 0. Note that (a)-(c) hold 
for g. Using integration by parts, we find that the function 


x 
5 (sin(In x) — cos(Inx)), x €(0,1] 
0, x=0 


F(x) = 
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is an antiderivative of g on (0,1]. Thus, since F is continuous on [0,1] (and g is 
integrable of [0,1]), we have by the Fundamental Theorem of Calculus (for 
x € (0, 1)] that {ig (t) dt = F(x) — F(O) = F(x) and the equation holds trivially 


for x = 0. Putting x, = e~"", n € N, we find that although lim, _,,, x, = 0, 
_ Jo"8(t) dt sin(—n) — cos(—n7) cos(nm) | (-1)""3 
lim ——~——_ = a en nr = lim ————— 


n 


which does not exist and, hence, lim, _, )+/jg(t) dt/x does not exist. 
In comparing f with g, notice that the intervals between consecutive zeros of g 
are exponentially smaller than those for f. For f, the length of the interval 


1 1 
(n+ 1)7’ nt 


is 1. (f) = 1/n(n + 1)a, while for g, the length of the interval [1/e“*?7, 1/e”7] is 
L(g) = (e™ — 1)/e"*" and we observe that lim, ,,, 1,(g)/l,(f) = 0. Thus the 
more highly oscillatory nature of g appears to prevent the right differentiability of 
its integral function at the origin. 

Finally, in light of these (and other) examples, the conjecture can be made 
(although the author is by no means certain of its validity) that if in addition to 
conditions (a)—-(c), the zeros of f form a null sequence of distinct terms whose 
associated sequence of distances between consecutive zeros is asymptotically 
proportional to 1/n* for some a > 1, then F’,(0) = 0. 
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Geographical Boundary Extrema 
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H. Thurston [1] has given Florida as an example of a geographical entity that 
has its highest elevation on the boundary. However, upon checking a detailed map 
[2] one finds, contrary to the claim, that the highest point in Florida is not on the 
border with Alabama, but rather is located approximately half a mile south of the 
border (just south of the village of Lakewood, Walton County; T. 6N, R. 20W, 
section 30). In [3, p. 4] the maximum elevation is given as 345 feet whereas the 
values at the border appear from the map to be bounded above by 340 feet. 
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For a valid example consider the province of Alberta. Since the B.C.—Alberta 
border in the vicinity of 12,294-foot Mt. Columbia is determined by the watershed 
line [4], and since Alberta has no interior points, or other boundary points, that 
high, Mt. Columbia’s summit is a boundary maximum for Alberta. 

British Columbia itself presents an interesting example that emphasizes the 
importance of the domain of definition in extremal problems. Following the 
decisions of the 1903 Alaska Boundary Tribunal the International Boundary 
Commission published a report [5] dealing with the survey and demarcation of the 
boundary between B.C. and Alaska. Since the placing of a permanent marker on 
the peak of Mt. Fairweather (approximate height 15,230 feet) was not feasible, the 
report [5, p. 175, boundary point 164] states only: “Station mark: the highest part 
of the snow cap as it was in 1907.” This statement provides the present legal 
description of the fixed boundary point. Coordinates of this snow cap had been 
determined from surveys, but since even for the key boundary point at Mt. St. 
Elias the calculations of two surveying parties differ by about 200 feet [5, p. 134], 
the actual location of the boundary point at Mt. Fairweather must be considered, 
in surveying terms, to be only very approximately known. 

Because snow accumulation since 1907 has surely changed the location of the 
highest part of the snow cap, the station mark no longer exists in a physical sense. 
Further, because of the inaccurate survey, we cannot state from a practical 
viewpoint—unless the drift has been very large—whether the present summit of 
Mt. Fairweather is legally in B.C. or in Alaska. In the former case we would have 
an interior absolute maximum for B.C.; in the latter case the present legal absolute 
maximum for B.C. is somewhere on Fairweather’s slopes. It should be noted that 
because of the watershed definition, this situation cannot occur on the B.C.- 
Alberta border and indeed the border has been recently redrawn after it was 
noticed that a body of water was flowing the “wrong way!”’ 


Acknowledgements. The authors wish to thank the staff at Canada’s Ministry of Energy, Mines and 
Resources, and the referee. 
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E 3460. Proposed by E. Ehrhart, University of Strasbourg, France. 


(a) Suppose we have n mutually perpendicular chords through a point P 
interior to a sphere S$ in n-dimensional Euclidean space. Prove that the sum of the 
squares of the lengths of these chords depends only on the radius r of the sphere 
and the distance d from P to the center of the sphere. 

(b) More generally suppose 1<k <n. Each set of k of the n mutually 
perpendicular chords through P given in (a) determines a k-dimensional affine 
subspace. Prove that the sum of the (2/k)-th powers of the k-dimensional 
measures of the cross-sections of S made by these (i affine subspaces depends 


k 
only on r and d. 


E 3461. Proposed by David Callan, University of Wisconsin, Madison, WI. 


Suppose r is a rational number but not an integer. It is known that tan(r7/2) is 
an algebraic number. (Cf. Ivan Niven, Irrational Numbers, Carus Mathematical 
Monographs No. 11, pp. 37-41.) Find the smallest positive integer k, such that 
k tan(r7/2) is an algebraic integer. 


E 3462. Proposed by J. J. Rotman, University of Illinois at Urbana-Champaign. 


Prove that any connected simple graph with an even number of edges has an 
orientation (assignment of direction to each edge) such that the number of edges 
leaving each vertex is even. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Extremal Morley Triangles 


E 3307 [1989, 154]. Proposed by Peter Andrews, Wilfrid Laurier University, 
Waterloo, Ontario, Murray Klamkin, University of Alberta, Edmonton, Alberta, and 
Edward T. H. Wang, Wilfrid Laurier University. 


The celebrated Morley triangle of a given triangle ABC is the equilateral 
triangle whose vertices are the intersections of adjacent pairs of internal angle 
trisectors of ABC. If s, R,r,F and s,,, Ry,ry, My are the semiperimeter, the 
circumradius, the inradius, and the area, respectively, of ABC and its Morley 
triangle, determine the maximum of (i) s,,/s, (ii) R,,/R, Gii) r,,/r, and (iv) Fy, /F. 


Editorial remark. The first, second, and fourth ratios considered in the problem 
achieve their maxima when A = B = C = 77/3 and have greatest lower bound 
zero. The third ratio, r,,/r, achieves a positive minimum when A = B = C = 77/3 
and has least upper bound equal to 2/9. The calculations of these extrema are 
given in their approximate order of difficulty. 


Solution of (ii) by the proposers. We prove that 
Ry/R < (8/V3 )sin>(7/9), 


with equality only when the original triangle is equilateral. We require the known 
fact that the Morley triangle has side length 8R sin(A /3)sin(B /3)sin(C /3). (Cf. 
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[1], [2], or [3] in the list of references.) Since the circumradius of an equilateral 
triangle is 1/ V3 times its side-length, we have 


Ry/R = (8/V3 )sin( 4 /3)sin( B/3)sin(C/3). 


Now if g,(x) = logsin(x/3), then g4(x) = —{3sin(x/3)}~*. Since 9%(x) < 0 on 
(0, 7), we have by concavity 


82(A) + 82(B) + 8(C) < 3g,({A + B+ C}/3) = 3g,(7/3), 
or 
sin( A /3)sin( B/3)sin(C/3) < sin?(7/9), 
with equality only if A = B = C = 7/3. Thus the claimed result follows. 
Solution of (iii) by the proposers. We prove that 
(8/V3 )sin3(7/9) <ry/t < 2/9, 


with equality on the left only if the original triangle is equilateral. We begin by 
using the formulas r,, = R,,/2 and r = 4R sin( A /2)sin(B /2)sin(C /2) to obtain 


ry/t = (1/8)(Ry/R)/{sin(A/2)sin( B/2)sin(C/2)} 
or, by the solution of part (ii), 
ru 1 sin( 4/3)sin( B/3)sin(C/3) 
r _- ¥3. sin( A/2)sin( B/2)sin(C/2) © 
Now if g3(x) = log{sin(x/3)/sin(x/2)}, then 

g3(x) = —{3sin(x/3)} “7 4 {2 sin(x/2)}° 
_ {sin(x/3)/(«/3)}° — {sin(x/2)/(x/2)}° 
7 {6x~' sin(x/3)sin( x/2)}° | 


Since (sin t)/t is strictly decreasing on (0,7 /2), it follows that g3(x) > 0 for 
0 <x <7. Hence by convexity 


g3(A) + 83(B) + 83(C) > 3g3,({A + B + C}/3) = 383(7/3) 
or 
sin( A /3)sin( B/3)sin(C/3) sin( 77/9) 
sin(.A/2)sin( B/2)sin(C/2) ~ fre 
Thus we have the lower bound r,,/r > (8/ V3 )sin*(7r/9), with equality only if 
A=B=C=~7/3. 


To get an upper bound we note that if0 <A <C <B<~7 and if « is a small 
positive number, then 


g,(A —«) + 8,(B + €) — 83(A) — 83(B) 


= [ {93(B +t) —8s(A — dj} at > 0, 


3 
= 8sin°(7/9). 


since g’, is strictly increasing on (0,7). Thus r,,/r increases if we increase the 
largest angle of the triangle and decrease another angle by the same amount. 
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Since gi(x) > 0 for 0 <x < 7, we have 
sin(A/3) — sin((m — 0,)/3) 
cos( A /2) < cos((ar — 0,)/2) < 70, 
sin( B/3) sin( 77/6) 
cos(B/2) — cos(77/4) 
sin(C/3) — sin((ma — 20,)/3) 
cos(C/2) < cos((7 — 20,)/2) < 087. 


It follows that in case (b) we have s,,/s < .06, a bound smaller than that found in 
case (a). 
Finally suppose that (c) holds. Say A > 6, and B,C < @,. Since g{(x) <0 on 
(0, 6,), we have by concavity 
§(B) + 8(C) < 28\((B + C)/2) 


with strict inequality for B # C. Thus we may assume B = C. Since A > 4,, 


= 2/2, 


1 1 
B= >(m—A) <>(m—- 6,) = 81275... 


If we set f(x) = 2g,(x) + g,(m — 2x), we have s,,/s = 3 exp f,(B). Now fi (x) = 
2 g(x) — 2g)(a — 2x) and we claim that f;(x) > 0 for 0 <x < (m7 — 6,)/2. In- 
deed, 


1 x 1 x x 1-x7/18 1 
(x) = < tan = + cot = > — + ———— > 
gi(2) = 5 tan + 30 > G+ Gas) 7 


and 
1 1 qT 2X 1 1 qT 2X 
gi(7 — 2x) = 5 cotx + = tan 6° 3 <5, 7 3 fan 6° 3 
For 0 <x < .82 we have 
1 2 T 2x 1 2 T 
fi(x) > — -— = tan{— + —]| > —~ —- = tan{ — + —— |] > .0007. 
x 3 6 3 82 3 6 3 
Thus, in this case 
T 0, 
Sy/S < 3exp f 77 = 16976... < .18479..., 


and hence the maximum of s,,/s can occur only in case (a) with A = B = C. Thus 
the claimed result s,,/s < (8/V3 )sin>(77/9) is established. 


Solution of (iv) by the editors. We prove that 
F,,/F < (64/3)sin®(7/9) = 0.034148..., 


with equality iff ABC is equilateral. Our argument is similar to that for (i), but is a 
little more complicated in case (c). 
We have 


F = abc/(4R) = 2R’ sin Asin B sinC 
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and 
Fy = (v3 /4){8R sin( 4/3)sin( B/3)sin(C/3)}’, 
so that 
Fy. ~ {sin( A/3)sin(B/3)sin(C/3)}° 
-F 8V3 sin A sin B sin C 


If g,(x) = log{sin*(x/3)/sin x}, then g(x) = (2/3)cot(x/3) — cot x and 
g(x) = {sin x} ~7 — 2{3sin(x/3)} 7. 


Now g/(x) is negative for 
0 <x < 3arcsin({3(2 — v2)/8}'”") = 1.46342... = 4, 


and is positive for 0, <x <7. We treat three cases. 
(a) If each of A, B,C < 6,, then, since g7(x) < 0 for 0 <x < 0@,, we have by 
concavity 


84(A) + 8,(B) + 84(C) < 38,((4A + B+ C)/3) = 38,(7/3) 
with equality iff A = B = C. Thus in this case 
Fy/F < 8v3 exp{3g,(7/3)} = (64/3)sin°(7/9). 

(b) If A>B> 0, then A <7 — 0,, B < 7/2, and C < 7 — 20,. Now gi(x) 
>0 for 0<x <7/2, since tan(x/3) < (1/3)tanx there, and g/(x) > 0 for 
17/2 <x <1, since cot(x/3) > 0 and —cotx > 0 there. It follows that in this case 
{sin((ar — 0,) /3)sin(7/6)sin((a — 20,)/3)}° 

sin(7 — 6,)sin(7/2)sin( a — 26,) 
= 023552... . 
(c) If A > 0, > B,C, then, since g4(x) < 0 for 0 <x < 04, we have by concavity 
84(B) + 84(C) < 28,((B + C)/2) 


with equality iff B = C. Thus we may assume that B = C and B = (17 — A)/2 < 
(7 — 0,)/2 = 83908... . Let f,(x) = 2g,(x) + g,(@ — 2x). We shall show that 


84(x) > 84(7 — 2x), O<x<(7— O,)/2, (*) 
and conclude that f, is increasing on this range. 
Suppose first that x € J = (.7,(a — 04)/2]. Since g/, is decreasing for 0 < x < 
6, and increasing for 6, <x < 7, we have 
g(x) > g4((7 — 6,)/2) = 1.42306... 
> 1.18884... = g(7 — 1.4) = g'(7 — 2x), 


so (*) holds for x € J. 
Next, suppose that x € J = (0,.7]. We begin by showing that 


Fy/F < 8v3 


3/2 sin(x/3) <sin(2x), x EJ. 
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Indeed 


Sin(2x) = 2eos| + 2cos x + 2c0s( 5 | 
sin( x /3) 3 3 
is clearly decreasing on J and we have 
sin(2x) /sin(x/3) > sin1.4/sin(.7/3) = 4.26192... > 3y2. 


Now g/(x) > (1/3)cot(x/3), since tan(x/3) < (tan x)/3 for x € J, and also we 
have 


2 T 2x 
g(a — 2x) = 3 tan{ 7 + =| + cot(2x). 


6 3 
Let h(x) = (1/3)cot(x /3) — cot(2x). We prove (*) for x € J by showing that 
h(x) > (2/3)tan(7/6 + 2x/3) ( * *) 


holds on this range. We have seen above that 
1 x v2 \° 1 ? 
h'(x) = 2csce*(2x) - = esc?( =| = |— — | ———— } <0 
9 3 sin 2x 3 sin( x/3) 
for x € J. Thus h(x) > hC7) = 1.23007... . On the other hand, if x € J, we have 
(2/3) tan(7/6 + 2x/3) < (2/3)tan(7/6 + 1.4/3) = 1.01637... , 


so that (* *) and hence (*) holds for x in J. 
It follows that f,(x) is increasing for 0 <x < (a — 0,)/2, so that in case (c) we 
have 


Fy,/F < 8v3 exp f,((7 — 0,)/2) = .032120..., 


which is smaller than the estimate found in (a). Thus the claimed result F,,/F < 
(64 /3)sin°(7/9) is established. 
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No other correct solutions were received. 
A Card Game with a Positive Payoff 


E 3330 [1989, 446]. Proposed by Charles Vanden Eynden, Illinois State Univer- 
sity, Normal, IL. 


In Littlewood’s A Mathematician’s Miscellany the author describes a game 
played with an infinite number of cards, each containing n on one side and n + 1 
on the other, with n = 1,2,.... A referee chooses a card at random and holds it 
so that the players see opposite sides. Each player may either play or pass. If both 
play, the player seeing the lower number must pay the other $1.00. Littlewood 
argues that the players will always pass. 
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This will be positive if n) > 7. 

In the case of a nonrandomized strategy a clever opponent can force negative 
expected winnings by choosing gq, = 0 and q, =p,_, for n > 1. (A clever oppo- 
nent knows the strategy of the person he is playing against.) For then, since 


P,»_-1 = Pe_1 =Pn_1In> We have 


Pry ~ Pn- 1Pn+1 
W=—+— + - 
80 7 5 ae 4m — gntl x 
Pi | PiPs = 1 ~ PnPn+2 
— 80 + 8 -— + L Flee gn+i — gna |P nv arr Qnte 
_ TLIO +d; —1.7 + Drs 
= Di) 8 nt 


Thus, since the coefficients of each p, are negative, we have W < 0. 


Editorial comment. Another recipe for foiling a nonrandomized strategy is to 
choose q,,, = 1 if r is the smallest natural number with p, = 1 and q; = 0 if 
L#rt+1. 


Solved also by P. Griffin, B. Kibler, I. Kozma Usrael), H. Lipman, and the proposer. Two incorrect 
solutions were received. 


A Monotonic Function Connected with the Error Function 
E 3351 [1989, 838]. Proposed by Jet Wimp and Um Bencharit, Drexel University, 
Philadelphia, PA. 
Suppose a > 0, and let f be the real-valued function on (1,0) defined by 
f(x) = [1 + erf(ax—!/?)]" + [1 — erf(ax~'/?)|", 


where 
erf(u) = 277!” fee dt. 
0 


Show that f is monotonic on (1, ©). 


Solution by Richard Stong, University of California at Los Angeles. We will 
establish that f is actually strictly increasing. Let h(x) = erf(ax~'/7), g(x) =x, 
U(x) = 1 + h(x))&™, and V(x) = A — A(x))®™. Then by elementary calculus, 


U'=U-[g'In(1+h)+gh'/1 +h) 
=(1+h)*'[g'(1 + A)In(1 + A) + gh’), 

V'=V-[g'In(1 — A) - gh'/(1 — h)] 
=(1-h)* ‘[g'(1 — A)In(1 — hh) — gh']. 


Since g’ > 0, we can use In(1 + z) > z/(1 +z) and In. — z) > —z/(1 — z) for 
z€[0,1) to obtain U> +h) [oh + gh] =1+h)* (ehyY and V's 
(1 — h)®~'[-g'h — gh'] = -—(1 — h)®&"'(gh). Therefore, f’ = U'+ V' > 
(1 + h)®-!-— 1 —h)®~!](ghy. Since h(x) € (0,1) and g — 1 > 0, the expression 
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in brackets is positive, and it suffices to show (gh) > 0. By a change of variable, 
the expression for {g(x)h(x)}’ can be written as (2a/vi)f{de~*’ ” 
[1 /2)x~'/? + a*t*x~3/*] dt, which is clearly positive for x > 0. 


Editorial comment. The proposers asserted the more general statement that 
[1 + h(x)]®™ + [1 — h(x)]&™ is strictly increasing on (1,0) as long as g,h are 
differentiable, 0 < h(x) < 1 < g(x), g’ > 0, and (gh) > 0, all on (1, ©), which can 
be proved as above. 

They noted that the problem arises in evaluating the signal to noise ratio in the 
so-called polarity threshold algorithm. The fact that f(x) is strictly increasing 
means that the signal to noise ratio increases as the number of sample points 
increases. 


Solved also by R. A. Agnew and the proposer. Three incorrect solutions were received. 


A Unique (2k + 1)-th Root of a Matrix? 


E 3362 [1990, 63]. Proposed by Bjorn Poonen, Winchester, MA. 


Given a positive integer n, let M denote the set of n by n matrices with 
complex entries. If A <€ M, let A* denote the conjugate transpose of A. Prove 
that for each positive integer k and each B © M there exists a unique A © M 
such that A(A*A)* = B. 


Solution by Stephen Pierce, San Diego State Univ., San Diego, CA. To show 
existence, let B = UH, where U is unitary and H is positive semidefinite. Let P 
be the unique positive semidefinite (2k + 1)st root of H, so that B = UP?**!. If 
we set A = UP, then A*A = P’, and the result follows. 

If B = A(A*A)* = C(C*C)*, then B*B = (A*A)?**! = (C*C)***1, Since B*B 
is positive semidefinite, it follows that A*A = C*C. Thus the null spaces of A, C, 
C*C, and A*A are all the same. Hence A and C agree on any null vector of A*A. 
If A is a nonzero eigenvalue of A*A = C*C corresponding to an eigenvector v, 
then Bu = A*Av = A*Cv, so that Av = Cv. Thus A and C agree on any eigenvec- 
tor of A*A. Since A*A has a basis of eigenvectors, A = C. 


Solved also by D. Callan, R. J. Chapman (U.K.), D. Z. Dokovié (Canada), M. Golomb, C. Hill, A. A. 
Jagers (Netherlands), F. Kittaneh (Kuwait), A. Kresch (student), O. P. Lossers (Netherlands), J.-M. 
Monier (France), T. S. Norfolk, B. N. Partlett, A. Pedersen (Denmark), D. Ruch, R. Stong, A. Tissier 
(France), University of Wyoming Problem Circle, and the proposer. Nine partially incorrect or 
incomplete solutions were received. 


Convergence of Some Recursively Defined Series 


E 3368 [1990, 150]. Proposed by I. E. Leonard, University of Alberta, Edmonton, 
Alberta, Canada. 


Suppose a > 0. Let us define a sequence a,,a,,... by taking a, as an arbitrary 
positive number and putting a,,,, = a, exp(—a®%) for n = 1,2,... . Find the values 
of a for which the following series converge: 

(i) La, (ii) Lat, (iii) Dale 
n=1 n=1 


n=1 
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Composite solution by Terence Shore and Douglas B. Tyler, California State Univer- 
sity—Dominguez Hills, Carson, CA, and by John Henry Steelman, Indiana University 
of Pennsylvania, Indiana, PA. (i) converges if and only if a < 1, (ii) diverges for all 
a, and (iii) converges if and only if 0 < a < 1/2. These results follow immediately 
from the more general result that for all real 8, the series Na? converges if and 
only if a < B. 

To prove this result, first note that the recurrence yields a,, = a, exp(— L7=/a®) 
for all n> 1. If Y_,a% =S <m, then lima, = a,exp(—S) > 0. This implies 
lim a* > 0, which contradicts Ua® converging. Therefore La® must diverge, which 
also implies lim a, = 0. 

Now suppose 8 < a. Since lim a, = 0, we have 0 < a, < 1 for sufficiently large 
n. For such n, we have a® > a®, and hence La® diverges by the comparison test. 

Now suppose 8 > a, so that B/a > 1. If we can prove a® < [a(n — 1)]- 87%, 
then we obtain the convergence of Xa? by the comparison test. To verify the 
desired inequality, it suffices to prove a,* > a(n — 1) for all n > 1 by induction. 
The inequality is trivial for n = 1. If we assume a,°% > a(n — 1) for a particular 
value of n, then a, ?, =a, exp(aa?) > a, °*(1 + aa’) =a, % + a > an, the last 
step by the inductive assumption. This completes the induction and the proof. 


Editorial comment. Shore and Tyler observed that on the other hand a,* < 
(a + 1)n + C for some constant C depending on a, and a, so that Lat is 
comparable to the harmonic series. 

Solved also by A. Fischer, E. A. Herman, P. W. Lindstrom, O. P. Lossers (Netherlands), J.-M. 


Monier (France), T. S. Norfolk, A. Pedersen (Denmark), A. J. Rosenthal, R. Stong, P. Tracy, University 
of Alberta Problem Solving Group, and the proposer. One incorrect solution was received. 


Folding an Equilateral Triangle Into a Large Hexagon 


E 3369 [1990, 150]. Proposed by Jiro Fukuta, Aichi University, Aichi, Japan. 


Suppose we are given a piece of paper in the shape of an equilateral triangle 
ABC. Suppose P is a point in the intersection of the three open circular discs with 
diameters AB, BC, CA. If we fold the three corners of the paper in such a way that 
the vertices coincide with P, we get a hexagon three sides of which are the creases 
formed and the other three sides of which are portions of the sides of triangle 
ABC. 

Prove that the area and perimeter of this hexagon are both maximized when P 
is the centroid of ABC. 


Solution by O. P. Lossers, University of Technology, Eindhoven, The Netherlands. 
The hexagon 7 can be formed by cutting off suitable triangles a = A,A,A, 
B = B,B,B, and y = C,C,C from the equilateral triangle 7 = ABC. Consider the 
triangle a; it has angles A,, A,, and A with A = 60°. By the folding structure, we 
have Area a = (1/8) AP)*(cot A, + cot A,), which, by the convexity of cot on 
(0, 7/2), is at least 


1 1 1 
—(AP)’ cot —(A, +. A,) = — V3 . 
g (AP) cot 5 (Ai + Az) 55 (AP) V3 


1991] PROBLEMS AND SOLUTIONS 765 


Applying the same argument to B and y, we have 


Area (a UB Uy) > V3 ((AP)* + (BP)? + (CPY’) 


1 
> 75 V3 (( AQ)’ + (BQ)’ + (CQ)’), 


where Q is the centroid of +r. Hence Area 7 < Area 7 — (1/4)V¥3 (AQ) = 
(2/3) Area 7, with equality achieved when P = Q. 

For the perimeter, let A, = AA, + AA, — A, A). By the folding structure, we 
have 


1 1—cosA, 1-cosA, | ep - - 
——_— + ————| = ~AP|tan —A, + tan = 
a 9 sin A, sin A, in ar a ss 
which, by the convexity of tan on (0, 77/2), is at least AP tan(1/4)(A, + A,) = 
(1/ 3) APY3 . Applying the same argument to B and y, we have A, + A g + A,> 
(V3 /3).AP + BP + CP) > (v3 /3X AQ + BO + CQ), since the sum of the dis- 
tances of a point from the vertices of an equilateral triangle is smallest when the 
point is the centroid of the triangle (cf. Ch. IX of [1] or §3.5 of [2]). Hence 
Perimeter 1 = Perimeter 7 — (A, + A, + A,) < 3AB — V3 AQ = 2AB, with 
equality achieved when P = Q. 


REFERENCES 


1. G. Pélya, Mathematics and Plausible Reasoning, Vol. I, Princeton, 1954. 
2. Nicholas D. Kazarinoff, Geometric Inequalities, New Mathematical Library, No. 4, 1961. 


Editorial comment. The result for area follows from that for perimeter, since a 
regular hexagon has area at least equal to that of any other hexagon of equal or 
smaller perimeter. 


Solved also by J. Dou (Spain), G. D. Chakerian and V. Schindler (Germany). Partially solved by 
H. Lipman and by the proposer. One incorrect solution was received. 


Iterating a Polygon on the Unit Circle 


E 3377 [1990, 240]. Proposed by Farhoud Pouryoussefi (student), Sharif Univer- 
sity of Technology, Tehran, Iran. 


Suppose we consider the polygon with vertices e’®', e/82,..., e!®» in the complex 

plane, where 
0<0,<0,< ++: <0,<27. 
Suppose a and £# are given positive numbers with a + B = 1. Define 0 = 6, for 
i= 1,2,...,m and 
A) = aA) + Bast? 
l l 

for k = 1,2,..., where subscripts are taken modulo m and the angles 9%) are 
taken modulo 27r. Prove that 


lim (0(9 — 6{) = 2a/n 


fori = 1,2,...,n. 
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Editorial remark. In the first line of the problem as published there was a 
spurious factor 27 in the exponents. 


Solution by Allan Pedersen, S@borg, Denmark. Let ¢; = 0;,, — 9; for i= 
1,2,...,.2 —land ¢, = 27+ 6, — 6,, so that £7_,¢, = 27, ¢, > 0. Define 6 
= ¢,; for i = 1,2,...,n and 


6” = adK—- + BAKZIY (i = 1,2,...,n) 


for k = 1,2,..., where the subscript n + 1 is interpreted as 1. For any fixed k the 
numbers $,...,¢% are positive real numbers with sum 27. Since ¢{* = 
6), — 9) modulo 27, it suffices to show that lim, ,,.¢% =27/n for i = 
1,2,...,”. Clearly 


(4%, ...,d)" = M($-,..., oH -YY’, 


where M is the n by n matrix whose nonzero entries are M,; , = a fori = 1,...,n, 
M; 54, = 6 fori = 1,2,...,n — 1, and M, , = 8. The matrix M is doubly stochas- 
tic and for k >n-—1 the powers M* have all entries positive. By the 
Perron—Frobenius theory (see D. Ruelle, Statistical Mechanics: Rigorous Results, 
Benjamin, 1969, p. 136), it follows that 


. (k) (k)\F _ 4: k( 4(0) (0)\7 
lim ({9,.... 69) = lim M*(o?,...,6) 


k—-o 


[n- Fay? 
i=1 


= (27/n)(1,...,1)’, 
which was to be proved. (The eigenvalues of M are a + pB, where p runs over the 
n-th roots of unity, and so the largest eigenvalue is a + B = 1, with eigenvector 


(,..., 1%.) 


Solved also by D. Callan, R. J. Chapman (United Kingdom), R. High, H. Kappus (Switzerland), 
M. E. Kuczma (Poland), J. G. Merickel (student), J. H. Lindsey II, O. P. Lossers (The Netherlands), 
A. Muller (Switzerland), M. Riazi, R. Stong, M. Vowe (Switzerland), Western Maryland College 
Problems Group, and the proposer. 


ADVANCED PROBLEMS 


6667. Proposed by George Baloglou and Phil Tracy, State University of New 
York, College at Oswego. 


If a,,a,...,a,,5,,5b5,...,5, are positive numbers such that a,a,...a, = 
b,b,...6, and 


> la, —_ a,| < >; |b; — bj, 
1<i<j<n 1<i<j<n 
(i) prove that 
n 
Yia;<(n-1)¥0b, and 
i=1 i=1 


(ii) show that the factor n — 1 cannot be replaced by a smaller one. 
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Editorial remark. In the first line of the problem as published there was a 
spurious factor 277 in the exponents. 


Solution by Allan Pedersen, Sgborg, Denmark. Let $; = 6;,, — 9; for i= 
1,2,...,2 —land ¢, = 27+ 6, — 6,, so that £"_,¢,; = 27, ¢, > 0. Define 6° 
= ¢,; for i = 1,2,...,n and 


PP = ag) + BON” (i= 1,2,...,0) 


for k = 1,2,..., where the subscript nm + 1 is interpreted as 1. For any fixed k the 
numbers $,...,¢% are positive real numbers with sum 27. Since #{* = 
6), — 0) modulo 27, it suffices to show that lim, ,,, 6% =27/n for i = 
1,2,...,n. Clearly 


(,...,6)) = M(%-Y,..., APY’, 


where M is the n by n matrix whose nonzero entries are M; ; = a fori = 1,...,n, 
M; ;4, = 8 fori = 1,2,...,n — 1, and M, , = B. The matrix M is doubly stochas- 
tic and for k>n-—1 the powers M* have all entries positive. By the 
Perron—Frobenius theory (see D. Ruelle, Statistical Mechanics: Rigorous Results, 
Benjamin, 1969, p. 136), it follows that 


Fim (49,549) = fim MEAP... 42) 


[n- en nee 
i=1 


= (27/n)(1,...,1)’, 
which was to be proved. (The eigenvalues of M are a + pB, where p runs over the 
n-th roots of unity, and so the largest eigenvalue is a + B = 1, with eigenvector 


(,..., D7.) 


Solved also by D. Callan, R. J. Chapman (United Kingdom), R. High, H. Kappus (Switzerland), 
M. E. Kuczma (Poland), J. G. Merickel (student), J. H. Lindsey IJ, O. P. Lossers (The Netherlands), 
A. Muller (Switzerland), M. Riazi, R. Stong, M. Vowe (Switzerland), Western Maryland College 
Problems Group, and the proposer. 


ADVANCED PROBLEMS 


6667. Proposed by George Baloglou and Phil Tracy, State University of New 
York, College at Oswego. 


If a,,a5,...,a,,6,,b5,...,b, are positive numbers such that a,a,...a, = 
b,b,...b, and 


> la, —_ a,| < >; |b; _ bl, 
1<i<j<n 1<i<j<n 
(i) prove that 
n 
Yia;<(n-1))¥0b,, and 
i=] i=] 


(ii) show that the factor n — 1 cannot be replaced by a smaller one. 
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6668. Proposed by Dragomir Z. Dokovié, University of Waterloo, Ontario, 
Canada. 


If n is even, show that there exists A € GL,(C) which cannot be represented in 
the form A = QS with Q € SO,(C) and S symmetric. (For every n it is known 
that every A € GL,(C) can be represented as A = QS with Q © OC) and S$ 
symmetric. ) 


6669. Proposed by Paul Erdos, Hungarian Academy of Sciences, Budapest. 


Prove that there is a constant c such that if n!= a!b! with 1 <a <b <n, then 
n—b<cloglogn. 
(It has been conjectured, but never proved, that if n > 10 and n!=a!b! with 


1<a<b<n, then b=n-—1 and n= al!. For n = 10 there is the exception 
10! = 6!7!.) 


SOLUTIONS OF ADVANCED PROBLEMS 


The Sum of the Prime Factors of a Positive Integer 


6599 [1989, 366]. Proposed by Nick MacKinnon, Winchester College, Winchester, 
Great Britain. 


Let m(n) denote the sum of the prime factors of n, repetitions counting. For 
example m(12) = 2 + 2 + 3 = 7. What is the asymptotic behavior of 
N 
Li m(n) 
n=1 
for large N? 
Solution by Aleksandar Ivic, University of Belgrade, Yugoslavia. For any fixed 
integer M > 1 we have 
d,N? d,N* dy N? 0 N? ' 
= +— — +°:' +—— + O| —— — ], 
log N log? N log” N log’ t! N (1) 
where d, = 7*/12 and dy,..., d,, are suitable constants. Formula (1) is proved in 
[2], where further references are given; a simpler proof is given in [3]. Here is a 
brief version of the proof. 


Let B(n) denote the sum of the distinct prime factors of n. If p runs over the 
primes and i, j over positive integers, we have 


Lmn)- Vibn)= LY Le (i-l)p 


L m(n) 
n=1 


n<N n<N n<N p'|n, p't!tn 
= YY p\n™| 
pi<N,j>2 
=N > p-' + of > p) + of > Np} 
p<N}/72 p<N}/?2 p<N'}/3 


= N loglog N+ O(N). 
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(A sharper formula for Lm(n) — XB(n) may be found in [4].) Thus it suffices to 
prove (1) with m replaced by B. 


Now let S(x) =, .,p. Then if w(x) denotes the number of primes not 
exceeding x and 


A(x) = a(x) — f (loge)! dt, 
2 
the estimate A(x) = O(x(log x)~“~') of de la Vallée Poussin gives 
x tat x 
S(x) = i lo? * i tdA(t) 


d 
=f a + xA(x) -f A(t) dt 


Hence, 


Mo 
DR 
a—~ 
a) 
— 
I 
As) 
I 
~ 
~~ 
= 
™~ 
3 
~— 


Njm tdt N? 
f + Ol ai 
2 logt m* log™ **(N/m) 


du N? 
=N? ¥ —— + O| —ym7 ws J- 
m<N/2°m UU log( N/u) log N 


I 
3 
IN 
= 
~\ 
N 
eT 


since 
N? N2 
en ae -o( |, 
man/2™ log (N/m), cyt vias logh*" N 
Thus 
n/2 |u| du ° 
= ———— + —_— . 
2 Bm) J u° log( N/u) | log’ *! =| 


If we now use 


1 - 1 M-1/ logy \* logu \™" 
log( N/u) ~ log N » Loew | +0 ead 


the desired result follows on writing [/* = /? - [x 2 and noting that 


olu| du 00 — 1 r 1 
J wo 1D 
Editorial comment. The assertion 
N 
y m(n) = 7°N*/(12log N) + O(N*/log* N) 
n=1 


given by the special case M = 1 of (1) occurs as formula (6.13) on page 150 of [1]. 
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REFERENCES 


1. J.-M. De Koninck and A. Ivi¢é, Topics in Arithmetical Functions, North-Holland Publishing Co., 
1980. 
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Solved also by Jean Anglesio (France), Joachim Herzog (Germany), Kee-Wai Lau (Hong Kong), 
Enoch K. S. Lee, Reiner Martin, Thomas Maxsein & Paul R. Smith (Germany), Ondrej Such 
(Czechoslovakia), and two referees. Two incomplete solutions were received. 


A Nondecomposition Theorem 


6620 [1990, 76]. Proposed by W. Allegretto and A. Meir, University of Alberta, 
Edmonton, Canada. 


Suppose f is locally integrable on R, but 
[ f(s) lax =. 
R 


Show that for every p > 0 there exists ¢ € L?(R) such that it is impossible to have 


f(x)O(x) =8(x)h(x) (xe R), 
where g € L/(R) and h € L?(R). 


Solution by Adam Riese, Wright State University, Dayton, OH. It is enough to do 
this for non-negative f that vanish for x < 0. 
Define k,, inductively by k, = 0 and 


[O° fe) ae=1 (n = 1,2,...). 


Let e = p/2 and for n > 1 let 


n 


r(x) =~, k,-1 <x <k,. 


We now show that $(x) = r(x)¢*®/?f(x)'/” is a function of the desired sort. 
Since 


f(x)’ dx = for "F) dx = y i" hx) dx < %, 


it is in L?. Now assume f(x)d(x) = g(x)h(x), where g € L' and h € L?. Let A 
be the set of all x > 0 on which g(x) > r(x) f(x), and let B be the complement of 
this set. For x in B we have h(x) > 6(x)/r(x), so that 


h(x)? > r(x)" ? f(x) > r(x) f(x). 
But 


f a(x) de + fh(xy? de> [ r(x) f(x) ae = rs. 


a contradiction to g € L' and h € L?. 


Solved also by Jaime E. Munoz Rivera (Brazil), Kenneth Schilling, and the proposers. 
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No Such Function Exists 
6622 [1990, 161]. Proposed by Robert C. Dalang, Ecole Polytechnique Fédérale, 
Lausanne, Switzerland. 


Does there exist a Lebesgue measurable function f: [0,1] — [0,1] which is 
positive on a set of positive measure and which satisfies 


oo gr-l_4 1/2 
27+1)/2" 2j7+2)/2" 
yy Lb [pore f(x) dx [OY f(x) ax} < +0? 
n=1 j=0 25/2" (2j7+1)/2” 


Solution by Marcin E. Kuczma, Institute of Mathematics, University of Warsaw, 
Poland. No. Assume such an f exists. Clearly f(x) > c > 0 ona set S of positive 
measure. There is an interval J (in fact, one of the form [ p2~%,(p + 1)2~%] with 
positive integers p and q) for which the measure inequality 


m(I NS) > gm(1) (1) 


is valid. This follows from the Lebesgue density theorem (or more elementarily, 
from the outer regularity and translation invariance of Lebesgue measure). 

Fix n > q and partition J into 2"~4~ + equal nonoverlapping intervals J, each of 
length 2-"*! GG = 1,...,2"-47!). We easily see that at least half of them satisfy 


mI; S) > am(J;). (2) 
Otherwise, 

mU;\ S) > m(U;)/4 
would hold for more than half of them, and by summation 

m(I\ S) > m(I)/8, 


contrary to (1). 

Hence we may choose 2”~4~ intervals from the I, each satisfying (2); call them 
J, iG =1,...,2"-4-*). Now subdivide each J, into two equal halves H, and K,,. 
Then 


mH, S) >m(J;N 8S) — m(K;) 
> gm(J;) — ym(J;) = 2", 


and similarly for m(K; N S). Hence 
[ff f> C727 en 2 1 < i < 224-2. 
H, °K, 


and the sum of the square roots of these products is at least 
27-"-19"-4~2-6 = 2-94-36 > 07, 
Thus all terms of the outer series with n > q + 1 exceed a fixed positive constant. 


Solved also by Duane W. Bailey, Richard A. Stong, and the proposer. 
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Riemann Integrability and Continuity 


6626 [1990, 252]. Proposed by Bruce G. Dearden and Michael B. Gregory, 
University of North Dakota, Grand Forks, ND. 


It is well known that if @ is a continuous function on R, then the composition 
go f is Riemann integrable on [a, b] whenever f is Riemann integrable on [a, b]. 
Is the converse true? That is, suppose @: R > R has the property that dc f is 
Riemann integrable on [a, b] whenever f is Riemann integrable on [a, b]. Does it 
follow that @ is continuous? 


Solution I by James R. Munkres, MIT, Cambridge, MA. Yes. This is a conse- 
quence of a stronger result. 


THEOREM. Let @: R — R have the property that ¢° f is Riemann-integrable on 
[a, b] whenever f is continuous on (a, b]. Then @ is continuous. 


Proof. Let C be a closed subset of [a,b] that contains no non-trivial open 
interval but has positive measure (see, e.g., Problem 3-11 of [2]; such sets are 
sometimes called ‘“‘fat Cantor sets’). Let g: R — R be a continuous function that is 
positive on (a, b) \ C and zero elsewhere. Given p € R define another continuous 
function by 


f(x) =p + a(x). 
By Lebesgue’s theorem (f is Riemann integrable if and only if f is bounded and 
the set of points at which f is discontinuous has Lebesgue measure zero) the 
function ¢° f must be continuous at some point c of (a, b) belonging to C. We 
claim that ¢@ is continuous from the right at p. Given e > 0, choose 6 > 0 such 
that 


lo( f(x)) -— o(f(c))| <e€ for |x -el <6. 
Choose d so 0 < d —c <6 and g(d) > 0. For 


y=[f(c), f(d)] =[p, f(d)] = [p,p +. 8(d)] 


there is an x €[c, d] with f(x) = y. Hence, 


ld(y) — 6(p)| =|6(y) — o(p + 8(c))| =|6(F(%)) - o(f(c))| <e. 


Since a similar argument with f(x) = p — g(x) shows that @ is continuous from 
the left at p, the result follows. 

The theorem is actually valid with the hypothesis “f is of class C®.”’ To prove 
this fact, replace the g(x) above by a C® function defined as follows. Write the 
complement of C relative to (a, b) as the union of a countable number of open 
intervals U,. Let g,; be a C™ function positive on U; and zero elsewhere. Let M, be 
the maximum of the absolute values of g, and its derivatives of order at most i. 
Finally, set 


Solution IIT by Lee Mohler, St. Martin’s College, Lacey, WA. Yes; we prove the 
contrapositive. Suppose ¢ is discontinuous at some x, € R. We will construct a 
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Riemann integrable function f on the closed interval [0,1] such that go f is not 
Riemann integrable. Moreover, a somewhat more elaborate construction provides 
a continuous such f. 

There is a real sequence {x,} such that x, > x, but y(x;) — y where y # ¢(x)); 
the possibility that y = + does not change the argument. Let qg,,q,,... be an 
enumeration of the set @ of rational numbers in [0,1]. Let P be the set of 
irrationals. Define f on [0, 1] by setting f(q,;) =x; and f(x) = x, for all x in P. It 
is straightforward to check that f is continuous on P. (This f is similar to the 
well-known example of a function continuous on P and discontinuous on Q. See, 
e.g. [1, p. 27].) The discontinuities of f are countable, so f is Riemann integrable. 

Given x in P there is a subsequence {q,,,} of {q,} that converges to x. Observe 
that 


lim 9° f(4uay) _ lim (2) =y F(X) =gef(x). 


Hence go f is discontinuous on P and so is not Riemann integrable. 

We now sketch a somewhat more elaborate construction that provides a 
continuous f. Recall that the classical Cantor set in [0, 1] is obtained by recursively 
deleting a sequence of “open middle thirds” from a sequence of closed intervals 
starting with [0, 1]. If somewhat smaller open intervals are removed at each stage 
of the construction, we obtain a set homeomorphic to the Cantor set, but with 
positive Lebesgue measure [1, p. 88]. Let P, denote this set, and Q, the set of 
midpoints of the deleted intervals. Let q,,q,,... be an enumeration of Q,. Define 
f as above on the set P) U Qj. Just as before, f is continuous on P,. Since each 
point of Q, is isolated in Py) U Qo, the function f is continuous on Q,. Moreover, 
P, U Qo is closed. Hence by the Tietze extension theorem, f can be extended 
continuously to all of [0,1]. But again as before, go f is discontinuous at every 
point of Py, a set with positive Lebesgue measure. Therefore go f is not Riemann 
integrable. 


REFERENCES 


1. B. Gelbaum and J. Olmsted, Counterexamples in Analysis, Holden-Day, 1964. 
2. M. Spivak, Calculus on Manifolds, Benjamin, 1965. 


Solved also by William C. Allen, D. R. Arterburn, Duane W. Bailey, Michael W. Botsko, Central 
Michigan University Problem Group, Frank J. Curtis, Lawrence E. Flynn, Michael B. Gregory, Don 
Hadwin, Chris Hill, Marcin E. Kuczma (Poland), Francisco Garcia Mazario (Spain), James G. Merickel, 
Mark D. Meyerson, Harry I. Miller (Yugoslavia), Jean-Marie Monier (France), Howard and Vicky 
Morris, Adam Riese, Kenneth A. Ross, Giinter Rote (Austria), Kenneth Schilling, Richard Stong, Dale 
Varberg, Xianggen Xia, and the proposers. One incorrect solution was received. 


Triangles With Integer Sides Whose Area Is the Square of an Integer 


6628 [1990, 350]. Proposed by Robert A. Melter, Long Island University, 
Southampton, NY. 


Call a triangle a Heron Triangle if it has integer sides and integer area. Fermat 
showed that there does not exist a Heron right triangle whose area is a perfect 
square. However, the triangle with sides 9, 10, 17 has area 36. 

Prove that there are infinitely many Heron triangles whose sides have no 
common factor and whose area is a perfect square. 


772 PROBLEMS AND SOLUTIONS [October 


Riemann integrable function f on the closed interval [0,1] such that go f is not 
Riemann integrable. Moreover, a somewhat more elaborate construction provides 
a continuous such f. 

There is a real sequence {x,} such that x, > x, but o(x;) — y where y # 9(x)); 
the possibility that y = + does not change the argument. Let q,,q,,... be an 
enumeration of the set Q of rational numbers in [0,1]. Let P be the set of 
irrationals. Define f on [0, 1] by setting f(q,) = x; and f(x) = x, for all x in P. It 
is straightforward to check that f is continuous on P. (This f is similar to the 
well-known example of a function continuous on P and discontinuous on Q. See, 
e.g. [1, p. 27].) The discontinuities of f are countable, so f is Riemann integrable. 

Given x in P there is a subsequence {q,,,)} of {q,} that converges to x. Observe 
that 


lim 9 ° f( doc) — lim (2) =y # P( Xo) — peo f(x). 


Hence go f is discontinuous on P and so is not Riemann integrable. 

We now sketch a somewhat more elaborate construction that provides a 
continuous f. Recall that the classical Cantor set in [0, 1] is obtained by recursively 
deleting a sequence of “open middle thirds” from a sequence of closed intervals 
starting with [0,1]. If somewhat smaller open intervals are removed at each stage 
of the construction, we obtain a set homeomorphic to the Cantor set, but with 
positive Lebesgue measure [1, p. 88]. Let P, denote this set, and Q, the set of 
midpoints of the deleted intervals. Let g,,q,,... be an enumeration of Q,. Define 
f as above on the set P, U Q). Just as before, f is continuous on P). Since each 
point of Q, is isolated in Py) U Qo, the function f is continuous on Q). Moreover, 
P, U Qo is closed. Hence by the Tietze extension theorem, f can be extended 
continuously to all of [0,1]. But again as before, go f is discontinuous at every 
point of P,, a set with positive Lebesgue measure. Therefore yo f is not Riemann 
integrable. 


REFERENCES 


1. B. Gelbaum and J. Olmsted, Counterexamples in Analysis, Holden-Day, 1964. 
2. M. Spivak, Calculus on Manifolds, Benjamin, 1965. 


Solved also by William C. Allen, D. R. Arterburn, Duane W. Bailey, Michael W. Botsko, Central 
Michigan University Problem Group, Frank J. Curtis, Lawrence E. Flynn, Michael B. Gregory, Don 
Hadwin, Chris Hill, Marcin E. Kuczma (Poland), Francisco Garcia Mazario (Spain), James G. Merickel, 
Mark D. Meyerson, Harry I. Miller (Yugoslavia), Jean-Marie Monier (France), Howard and Vicky 
Morris, Adam Riese, Kenneth A. Ross, Ginter Rote (Austria), Kenneth Schilling, Richard Stong, Dale 
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Triangles With Integer Sides Whose Area Is the Square of an Integer 


6628 [1990, 350]. Proposed by Robert A. Melter, Long Island University, 
Southampton, NY. 


Call a triangle a Heron Triangle if it has integer sides and integer area. Fermat 
showed that there does not exist a Heron right triangle whose area is a perfect 
square. However, the triangle with sides 9, 10, 17 has area 36. 

Prove that there are infinitely many Heron triangles whose sides have no 
common factor and whose area is a perfect square. 
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Solution I by Charles R. Maderer, Indiana University of Pennsylvania, Indiana, 
PA. For each positive integer k define 


a(k) =20k4+ 4k? +1,  b(k) = 8k° — 4k4 — 2k? +1, 

c(k) = 8k° + 8k* + 10k’. 
Clearly 

max{a(k),b(k)} <c(k) < a(k) + b(k), 
so that a(k), b(k), and c(k) are the sides of a triangle with longest side c(k). By 
Heron’s formula this triangle has area 
{(2k)(2k? — 1)(2k? + 1)". 
Since 
2 


a(k) + b(k) — c(k) =2(2k2-1)°, ak) + B(k) + c(k) = 2(2k? + 1), 


it is immediate that a(k), b(k), and c(k) have no common factor greater than one. 
The assertion follows. 


Solution IT by John Henry Steelman, Indiana University of Pennsylvania, Indiana, 
PA. An infinite family of such Heron triangles may be constructed inductively. 
Define a, = 9 and a, ,, = (2a, — 1)? for n > 1. Since 2a, — 2 = 4? and 


2An41 ~2= 4a,(2a, — 2), 
it is clear that both a, and 2a, — 2 are perfect squares for all n > 1. Now the 
numbers 
An+is (ay a L)an41 + I, Anant — 1 
are easily seen to have no common factor, and to be the side lengths of a triangle. 
This triangle has semiperimeter a,a, ,,, So by Heron’s formula its area is 
Anansy1” (a,, — L)an41 (Ay 41 a 1) ‘l= An +14,(24,, — 2), 
a perfect square. 


Solution IIT by Jim Buddenhagen, Southwestern Bell Telephone Co., St. Louis, 
MO. We show that there are infinitely many pairs of Heron triangles which share 
the same square area. To see this suppose m is an odd positive integer greater 
than 1 such that (m? — 1)/2 is a perfect square. From the theory of the Pell 
equation this means that m has the form 

m = (3 + 2v2)'/2 + (3 — 2v2)'/2 
for some positive integer j. Then the triangle with sides 
(m? + m’) /2 — 1,(m3 — m?)/2 +1, m? 
and the triangle with sides 
m> — (m—1)/2,m*> — (m+ 1)/2,m 


both have area m*(m* — 1)/2. For example if j = 1 and m = 3, the Heron 
triangle with sides 3, 25, 26 has the same area 36 as the Heron triangle with sides 
9, 10, 17. 
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It is not clear if there is an absolute bound on the number of Heron triangles 
that can share the same square area; the triangles with sides (113, 3137, 3150), 
(800, 1241, 2009), and (245, 1443, 1448) all have area 4207. 


Editorial comment. In the article On A* + B* + C* = D*, Math. Comp., 51 
(1988) 825-835, Noam Elkies proved that there are infinitely many quadruples of 
positive integers A, B, C, D such that 


gcd( A, B,C, D) = 1, At + B*+C*=D*. 


Since fourth powers are congruent to 0 or 1 modulo 16, in such quadruples exactly 
one of A, B,C is odd. Several solvers remarked that if we choose 

a = B*4+C7%, b =A*4+ C4, c= A*t+ Bt, 
then gcd(a, b,c) = 1 and a, b, and c form a triangle with area (ABCD)?. This was 
the only solution of which the editors were aware when the problem was pub- 
lished. 

Several solvers used the theory of elliptic curves (cf. §5.7 of I. Niven, H. S. 
Zuckerman, and H. L. Montgomery, An Introduction to the Theory of Numbers, 
Wiley, 1991 or J. H. Silverman, The Arithmetic of Elliptic Curves, Springer, 1986). 
For example, Jim Buddenhagen noted that a Heron triangle whose sides have no 
common factor is uniquely determined by its angles, or equivalently by the 
tangents ¢,,t,,¢, of its half-angles. The formulas 


t,=F/{s(s—a)}, t,=F/{s(s—b)}, t= F/{s(s —)}, 
where F is the area and s is the semi-perimeter of the triangle, show that for a 
Heron triangle ¢,, t,, t; are rational numbers. For example, the triangle with sides 
9, 10, 17 and area 36 has (t,,t,,t;) = (2/9, 1/4,2). The formula ¢,t,t, = F/s* 
shows that the area of a Heron triangle is a perfect square if and only if ¢,t,t, is 


the square of a rational number. Since the half-angles of a triangle add up to 7/2 
and thus ¢t, = (1 — ¢,t,)/(t, + t,), we are led to the family of elliptic curves 


u> = tytp(ty + t2)(1 — tifa), (*) 
where f,, say, is a parameter and ¢, and wu are variables. We need not be 
concerned whether ¢, is positive or negative, since if (¢,,u) is on (*) so is 
(—1/t,,u/t?). To complete the solution by this method we need only show that 
for some particular value of t,, for example t, = 1/4, the elliptic curve (*) has 
positive rank. 
Nathan J. Fine pointed out that formula (26) of his paper, “On rational 
triangles,” this MONTHLY, 83 (1976) 517-521, provides a solution along the same 
lines as Solution I. 


Solved also by Kevin Brown and Daniel Dufresne, Nathan J. Fine, Lorraine L. Foster, Richard K. 
Guy (Canada), Friend H. Kierstead, Jr., L. E. Mattics, Les Reid, David G. Robinson, Raul A. Simon 
(Chile), and Ivan Vidav (Yugoslavia). One anonymous solution was also received. 
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How Does One Cut A Triangle? By Alexander Soifer, Center for Excellence in 
Mathematical Education, 885 Red Mesa Drive, Colorado Springs, CO 80906, 
1990, xiii + 138 pp. 


Murray S. KLAMKIN 
Department of Mathematics, University of Alberta, Edmonton, Alberta, Canada T6G 2G1 


It is some thirty years since Dieudonné’s Royaumont Seminar Address [1] in 
which he proclaimed “Euclid Must Go” and that it should be replaced by 
two-dimensional linear algebra. Dieudonné’s quarrel was not with the purpose of 
geometry, which he considered important, but with the method of teaching 
geometry. Dieudonné’s reasons were that Euclid as taught was a “Process Fantas- 
tically Laborious”’ as well as having “the affine and metric properties hopelessly 
mixed up.” While there are many who would agree with the latter reasons, there 
are many who do not agree with the linear algebra replacement as proposed. For a 
good late survey of this, see Burn [2] and references therein. Burn notes that 
Freudenthal [3] in his chapter on “The Case of Geometry” offers a profound 
critique of Dieudonné and recommends a piecemeal school geometry program 
with strong intuitive spatial content. As to ‘““What is Geometry?,” see Freudenthal 
[3], Chern [4,5], Yaglom [6] and Kelly [7]. 

Although Dieudonné’s message was not heeded in the U.S. and Canada, it now 
appears that in our secondary schools Euclid and other geometry is essentially 
gone and has not been replaced. This treatment of geometry in our schools should 
be contrasted with that in the U.S.S.R., Hungary and Romania, for example, in 
which geometry is covered repeatedly in the different grades. They have many 
good books with challenging problems, e.g., [8—17] (fortunately, these have been 
translated into English) unlike most of the sterile ones used in the U.S. and 
Canada which take great pains to prove very obvious theorems but not unobvious 
attractive theorems and insist that proofs be done in “two columns.’ 

Some time ago Poincaré addressed the downgrading of geometry in his classic 
book The Foundation of Science [18] which I strongly believe should be required 
reading for all mathematical educators and teachers. I quote: 


It looks as if geometry could contain nothing which is not already included in.algebra or analysis; 
that geometrical facts are only algebraic or analytic facts expressed in another language. It might 
then be thought that after our review there would remain nothing more for us to say relating 
specially to geometry. This would be to fail to recognize the importance of well-constructed 
language, not to comprehend what is added to the things themselves by the method of expressing 
these things and consequently of grouping them. 

First the geometric considerations lead us to set ourselves new problems; these may be, if you 
choose, analytic problems, but such as we never would have set ourselves in connection with 


"Apparently in the 1990 geometry syllabus in Alberta, all proofs have been removed. 
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analysis. Analysis profits by them however, as it profits by those it has to solve and satisfy the 
needs of physics. 

A great advantage of geometry lies in the fact that in it the senses can come to the aid of 
thought, and help find the path to follow, and many minds prefer to put the problems of analysis 
into geometric form. Unhappily our senses can not carry us very far, and they desert us when we 
wish to soar beyond the classical three dimensions. Does this mean that, beyond the restricted 
domain wherein they seem to wish to imprison us, we should rely only on pure analysis and that 
all geometry of more than three dimensions is vain and object-less? The greatest masters of a 
preceding generation would have answered ‘yes’; today we are so familiarized with this notion 
that we can speak of it, even in a university course, without arousing too much astonishment. 

But what good is it? That is easy to see: First it gives us a very convenient terminology, which 
expresses concisely what the ordinary analytic language would say in prolix phrases. Moreover, 
this language makes us call like things by the same name and emphasize analogies it will never 
again let us forget. It enables us therefore still to find our way in this space which is too big for 
us and which we can not see, always recalling visible space, which is only an imperfect image of it 
doubtless, but which is nevertheless an image. Here, again, as in all the preceding examples, it is 
analogy with the simple which enables us to comprehend the complex. 


The downgrading of geometry also affects many courses down the line. In the 
last few years, there has been lots of activity about improving our calculus courses 
[19-20]. No doubt there are improvements to be made in our textbooks and 
teaching methods but another improvement should be in the students’ secondary 
school preparation. In particular, I and a number of my colleagues have found that 
wherever there is any geometry involved in the calculus, and there is quite a bit, 
students have all kinds of difficulties. In this regard, I quote from the beginning of 
the late N. Steenrod’s first lecture at a geometry conference [21] which addresses 
this point and which is highly in tune with Pioncaré (above). 


In this series of lectures I am going to discuss the geometric content of the freshman and 
sophomore mathematics courses. I shall criticize what we as teachers are now doing and suggest 
what we might do. Let me begin with what I believe to be the chief criticisms. 

(1) Although geometry pervades all of mathematics and is present at every stage of a 
development, too often do we fail to point this out to our students. We rely on analytical 
formulations since we realize that they are complete and we are in a hurry to get on to other 
material. We do not take time to look at geometric formulations. 

(2) We are too greatly impressed by the rigor of analysis. We seem to feel that geometry is 
not rigorous, or at least that the background needed for rigor is not avaialble. We feel that it is 
better not to do anything that is not rigorous. I think we are buffaloed too much by this. 

(3) When we do present geometry, it is too often the instructor who does the geometry while 
the student is merely a passive recipient. We present the geometry to him in order to explain the 
analysis, but then we require him to do only the analysis—no geometry. 

(4) We tend to avoid geometric formulations of questions in examinations. Questions are 
difficult to formulate geometrically. Almost any time you try such a question, you find that a 
large group of students misinterpret it. Such questions are hard to grade because the answers are 
so varied. The absence of geometric questions on final exams tends to degrade the geometric 
content of the course, and leads to its neglect. 

Now that I have listed the main criticisms, let me take them up one at a time and fill in some 
details. The pervasiveness of geometry is an idea that goes back to Descartes, for a coordinate 
system in the plane or in space sets up an equivalence between geometry and algebra-analysis. 
Every geometric proposition can be translated into its algebraic-analytic analog and vice versa. I 
am not proposing that we lead the student through the details of the formal isomorphism 
between these two systems, but I am trying to remind you that the geometry is always there and 
to keep in mind that the geometric language for the conversion is always at hand. For example, 
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here are a few geometric terms and their algebraic-analytic counterparts: 


Geometric language Algebraic-Analytic language 
point, vector number triple (x, y, z) 
projection coordinate, variable 
surface equation 

plane linear equation 

region system of inequalities 
mapping, transformation function 

neighborhood £,0 

limit (using deleted neighborhoods) limit (using e, 6) 

tangent derivative 


One might ask, in view of this equivalence, why bother with the geometry at all? The answer is 
clear to all of us. The first main reason is that many applications of calculus are to problems of 
conics, oscillating systems, the 2-body problem. A complete solution of such a problem has three 
main steps: The reformulation in analytic terms, the derivation of an analytic solution, and the 
interpretation of the solution in geometric terms. 


Geometry Analysis 
Problem _——— 
Solution __— - 


A second reason is psychological. Two views of the same thing reinforce one another. Most of us 
are able to remember the multitudinous formulas of analysis mainly because we attach to each a 
geometric picture that keeps us from going astray. Even better than that, the geometric view of a 
problem helps us to focus on the invariants and to weed out the irrelevant details. For example, 
a poor choice of a coordinate system may lead to a horrible mess in the analytic formulation, but 
with some geometric insight, we may be able to choose a much better coordinate system. 


Since students are not getting enough geometry in school, one of the avenues 
left is self-study. Any new attractive books in this field are indeed very welcome. 
How does one cut a triangle? is one of these and is mostly combinatorial geometry 
that is accessible to bright high school and college students even without too much 
prior knowledge. Also, there is quite a number of unsolved problems which will 
challenge professional mathematicians and other scientists. There are various 
monetary awards for the first received acceptable solutions of some of these latter 
problems. Like the challenge of climbing mountains because they are, there is a 
mental challenge of solving the problems not only because they are here (in this 
book), but also because they are simple to comprehend but still challenging. 

Two of the opening problems are “Grand Problem I. Find all positive integers 
n, such that every triangle can be cut into n triangles congruent to each other,” 
and “Grand Problem II. Find all positive integers n, such that every triangle can 
be cut into n triangles similar to each other.” Even deciding which of these two 
problems is harder is a problem. Although Dieudonné is strongly opposed to 
“triangles,” he may be attracted to the solutions of these problems since one of 
them requires quite a bit of linear algebra. However, I think Soifer’s excursion into 
the linear algebra necessary in the solution is perhaps too brief. 

Another set of problems deals with the problem of given a certain number of 
points within or on a triangle to determine the maximum of the minimum area 
formed by three of the points. In particular, three proofs are given to show that for 
any five points in a triangle of area 1, there are three points that form a triangle of 
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area not exceeding 1/4. A neater solution appeared in 1980 by Yang and Zhang as 
a lemma [22]*. With respect to points in a square, see also [23], [24]; with respect to 
a disk, see “100 Research Problems in Discrete Geometry,” 1986, #7 (privately 
printed by W. O. Moser, McGill University. This is an excellent reference source). 

The final set of problems and conjectures deal with extremal properties of 
convex sets. A number of these are due to Paul Erd6ds. One of the interesting 
functions defined here is S$ (CF). If 0 < @ < 1 and F is a convex figure of unit area, 
then S,(F) is the smallest n > 3 such that if n points lie within or on the 
boundary of F, there are three of these points which form a triangle of area < a. 
Andy Liu notes that a more informative function would be a,(n) = the maximum 
of the minimum area triangle formed from 3 of the n points. 

I like the style of the book. In this regard, I agree with Cecil Rousseau who says 
“it is an adventure story, complete with interesting characters, moments of 
exhilaration, examples of serendipity, and unanswered questions.” Although I 
gather from this book that Soifer is an enthusiastic teacher, which is a big plus in 
teaching, I would have preferred him being a little more modest in his writing. 

Finally, I would have preferred to see the book more moderately priced since its 
main target is “young, talented mathematicians who are still in high school or 
college” and also since it is paperbound of modest length. 
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Concrete Mathematics: A Foundation for Computer Science. By Ronald Lewis 
Graham, Donald Ervin Knuth, and Oren Patashnik, Addison-Wesley, 1989, 
xili + 625 pp. 


EDWARD A. BENDER 
Center for Communications Research, San Diego, CA 92121 


Most current calculus texts could almost have been created from one another 
with scissors, tape, and a color Xerox machine. Attempted innovations fall by the 
wayside. Rather than being due to the perfection of the standard calculus course, 
this state of affairs seems to me to be a sequence of the post-Sputnik push for 
more engineers and for improved science education. This led to a huge demand for 
“service courses” having a fairly standard list of ‘“must-cover” topics. Many of the 
intended students had little interest in mathematics for its own sake. These 
evolutionary pressures shaped the standard calculus text into a massive tome, 
designed to program uninterested students in the symbol-recognition and rote- 
manipulation skills required by various science courses. 

The proliferation of computers has led to a demand for computer scientists 
dwarfing the post-Sputnik demand for engineers. There is a list of must-cover 
mathematical topics for computer science students that has been drawn from 
combinatorics, logic, mathematical linguistics, and probability theory. Many com- 
puter science majors are not interested in mathematics per se. The setting is 
familiar. Is the basic mathematics course for computer scientists doomed to the 
same fate as the basic calculus course for engineers and scientists? The spate of 
new books on ‘‘discrete mathematics” suggests that it is. 

There is another strategy for teaching the mathematics need in computer 
science: Instead of relying only on high school mathematic training, build on the 
mathematical maturity attained by surviving a basic college calculus sequence. The 
mathematics can then be covered in the depth needed to do computer science 
instead of at the more superficial level needed to understand computer science 
textbooks. Graham, Knuth, and Patashnik apparently subscribe to this strategy. 

The list of must-cover topics is, however, too long for in-depth coverage in a 
single course. One obvious approach to this problem is to break the courses down 
by mathematical discipline; another is to give a brief introductory course and then 
teach the remaining mathematics as needed in the computer science courses. 

What approach is taken in the vaguely titled Concrete Mathematics: A Founda- 
tion for Computer Science? Neither. The text crosses mathematical disciplines, 
including material from combinatorics, probability theory, and number theory; the 
material is not integrated with computer science—an extended discussion of 
hashing is the only direct applications—and the level is deeper than a brief 
introductory course. Instead, the manipulative skills and associated concepts that 
are useful in doing computer science provide the unifying theme. The text grew out 
of a course initiated by Don Knuth in order to teach computer science students 
“what he wished somebody had taught him” and is primarily a much more 
leisurely treatment of material in Section 1.2 of the Art of Computer Programming 
[1]. A more informative title might be Concrete Mathematics: The Computational 
Foundations of Computer Science. 
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The approach in Concrete Mathematics is successful. 

Certain otherwise strange features of the text can be understood in light of the 
computational approach. To mention a few: The emphasis on computational skills 
as well as concepts leads to rather long chapters. Although some enumerative 
combinatorics is discussed, graph theory is not, presumably because its concepts 
are not inherently computational but instead tend to rely on other areas of 
mathematics for computational tools. On the other hand, a chapter is devoted to 
number theory (which plays less of a role in computer science than does graph 
theory) because some computational skills in number theory are important in 
doing computer science. 

I was saddened to find that the combinatorial understanding of generating 
functions is left at the Polya picture-writing stage—one may want to use Wilf’s 
Generatingfunctionology [2] as a supplementary text. Unfortunately, much of the 
ninety-page chapter that is devoted to manipulating binomial coefficients will need 
revision in light of the recent work of Zeilberger and Wilf. I was surprised to learn 
in Section 8.2 that Chebyshev’s inequality is the main reason for computing 
variance—I think that the heuristic result given by assuming asymptotic normality 
is aS important as the hard bounds given by Chebyshev. In contrast to these 
quibbles, there are many important pleasant surprises in this well-written text. For 
example: 


¢ Careful attention is given to imparting the ability to usefully manipulate the 
sorts of expressions that arise in the analysis of algorithms. A student who has 
to do some manipulations would profit from the relevant chapters. (“Student” 
is too limited—I learned some interesting things.) Particularly noteworthy are 
the chapters on sums and integer functions because they contain important 
ideas and techniques that I haven’t seen in any other text. 

e There is an introduction to asymptotic analysis which should make students 
reasonably comfortable with big-oh notation and teach them techniques for 
estimating simple sums. (Perhaps a future edition will include some discussion 
of obtaining estimates from generating functions.) 

¢ There is a nice selection of problems with ratings from routine to research 
level, with solutions and /or literature references included. Many of these are 
not the boring crank-turning type that flood the standard calculus course. 

e A bibliography with over 300 entries is a useful resource for the student or 
instructor who wishes to pursue a subject further. 


Even if you are not interested in considering this book for a course or in reading it 
yourself, I suggest you at least look at the marginal notes. Many have been written 
by students, and these range from complaints to witticisms to helpful hints for 
other students. This idea might liven up some calculus texts more than any number 
of colored pictures, historical notes or “applications.” 

In view of the broad mathematical basis of computer science, I hope some of 
you will regard this text as a challenge to help provide a diversity of well-written 
texts on the mathematical foundations for doing computer science. 


REFERENCES 


1. D.E. Knuth, The Art of Computer Programming, volume 1: Fundamental Algorithms, 2nd edition, 
Addison-Wesley, 1973. 

2. H.S. Wilf, Generatingfunctionology, Academic Press, 1990. Reviewed in this MONTHLY, 97 (1990) 
864-866. 
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Mathematics Appreciation, T(13-14: 1). The 
Mathematical Palette. Ronald Staszkow, Robert 
Bradshaw. Saunders College, 1991, xiii + 534 pp, 
$32. [ISBN: 0-03-033274-5] A visually attractive, 
lively, and appealing introduction to mathematics 
as a liberal arts subject, emphasizing historical and 
practical approaches. Assumes some command of 
symbolic algebra (through elementary trigonome- 
try), but most content is reviewed in the text. Calcu- 
lator use—together with a short history of mechan- 
ical calculation—is reviewed in an appendix. Con- 
tents include liberal arts perspectives on algebra, ge- 
ometry and trigonometry, logic, finance, probabil- 
ity and statistics, and calculus. Basic ideas, rather 
than detailed calculations, are stressed. Chapters 
include suggested study projects, with suggested ref- 
erences. PZ 


Precalculus, T(13). Precalculus, Second Edition. 
Jerome E. Kaufmann. PWS-Kent, 1991, xv + 640 pp, 
$35. [ISBN: 0-534-92474-3] Covers traditional top- 
ics with emphasis on equations and inequalities, solv- 
ing problems, graphing techniques, and functions; 
this edition includes two new sections and minor 
changes to several others. Supplements include an- 
swer key, test banks (printed and computerized), and 
a TrueBASIC precalculus package. JNC 


Precalculus, T(13: 1). Algebra for College Stu- 
dents, Third Edition. Bernard Kolman, Arnold 
Shapiro. Harcourt Brace Jovanovich, 1991, xv + 675 
pp, $32. [ISBN: 0-15-502162-1] Retains structure 
of earlier editions while including new emphasis on 
calculator usage and mathematical modeling, earlier 
introduction of exponent notation and linear equa- 
tions and inequalities, and added sections on linear 
programming and probability. JNC 


Precalculus, T(13: 1). College Algebra, Second 
Edition. Thomas W. Hungerford, Richard Mercer. 
Saunders College, 1991, xv + 570 pp, $34. [ISBN: 
0-03-054243-X] This major rewriting includes ear- 
lier coverage of complex numbers, easier method 
of graphing polynomial functions, more material on 
conic sections and new sections on variation, par- 
tial fractions, matrix algebra, systems of inequali- 


ties, linear programming, sequences, infinite series, 
and probability. A Student Solutions Manual with 
Graphing Calculator Supplement is available. (First 
Edition, TR, August-September 1982.) JNC 


Education, T*, S, P. Geometry for the Classroom. 
C. Herbert Clemens, Michael A. Clemens. Springer- 
Verlag, 1991, xiii + 335 pp, $29.95 (P). [ISBN: 0-387- 
97564-0]; Geometry for the Classroom: Ezercises and 
Solutions, vi + 167 pp, $19.95 (P). [ISBN: 0-387- 
97565-9] An innovative text in workbook format; 
presents concepts intuitively, then more rigorously in 
the construction, proof (stresses rigid motions) and 
computer program (uses LOGO) sections. Appro- 
priate for high school students, but would also pre- 
pare elementary teachers to implement recommended 
changes in geometry teaching. JNC 


Foundations, S, L*. Philosophy and Mathemat- 
ics: From Plato to the Present. Robert J. Baum. 
Freeman Cooper, 1973, x + 320 pp, $31.25 (P). 
[ISBN: 0-87735-514-2] Re-issue of a useful anthol- 
ogy of classic sources of philosophical commentary 
on things mathematical, including relevant excerpts 
from Plato, Aristotle, Descartes, Hobbes, ..., Kant, 
Mill, Frege. First published in 1973 (TR, May 
1974). LAS 


Foundations, T*(13-14: 1), L. Introduction to 
Abstract Mathematics, Second Edition. John F. Lu- 
cas. Ardsley House, 1990, x + 382 pp, $41.95. [ISBN: 
0-912675-73-X] Well-written text for introduction to 
proof or discrete mathematics class. Covers logic, 
proof strategies, sets and cardinality, relations, func- 
tions. Some examples and exercises assume knowl- 
edge of first-semester calculus. More emphasis on 
formal logic than most comparable texts. KES 


Number Theory, P, L. Cryptology and Computa- 
tional Number Theory. Ed: Carl Pomerance. Proc. 
of Symposia in Appl. Math., V. 42. AMS, 1990, 
xi + 171 pp, $57. [ISBN: 0-8218-0155-4] Proceed- 
ings of an AMS short course. Contains excellent 
articles on primality testing (A.K. Lenstra), factor- 
ing (Pomerance), discrete logarithms (McCurley), 
knapsack systems (Odlyzko), provably secure sys- 
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tems (Goldwasser), and pseudorandom number gen- 
erators (Lagarias). SG 


Linear Algebra, T(14: 1). Elementary Linear Al- 
gebra, Fourth Edition. Stanley I. Grossman. Saun- 
ders College, 1991, xvii + 588 pp, $42.50. [ISBN: 
0-03-031193-4] Includes new material on graph the- 
ory and least squares applications; additional figures, 
historical essays, and geometric interpretations; new 
chapter summaries; introduction of all systems of 
equations material before matrices; and recurring ap- 
pearances of a “Summing Up Theorem.” The Ap- 
plications Supplement also contains new examples 
and exercises. (First Edition, TR, April 1980; Sec- 
ond Edition, TR, January 1985; Third Edition, TR, 
August-September 1987.) JNC 


Linear Algebra, T*(13: 1). Elementary Linear 
Algebra, Sizth Edition. Howard Anton. Wiley, 1991, 
xv + 570 pp, $49.95, [ISBN: 0-471-50900-0]. Ele- 
mentary Linear Algebra: Applications Version, Sizth 
Edition. Howard Anton, Chris Rorres. Wiley, 1991, 
xv + 795 pp, $52.95. [ISBN: 0-471-52739-4] Re- 
visions include earlier coverage of eigenvalues and 
eigenvectors, rewritten material on linear transfor- 
mations, bases, row space, rank, etc., a separate 
chapter on inner product spaces, coverage of least 
squares fitting, new exercises and drill problems. 
The Applications Version (which adds an applica- 
tions chapter) includes a section on fractals. (First 
Edition, TR, March 1973; Extended Review, 1974; 
Fifth Edition, TR, August-September 1987.) JNC 


Algebra, P. Topics in Computational Algebra. Eds: 
G.M. Piacentini Cattaneo, E. Strickland. Kluwer 
Academic, 1990, 261 pp, $99. [ISBN: 0-7923-1117- 
5] Papers of ten invited speakers at the Semester of 
Computational Algebra, University of Rome II, ‘Tor 
Vergata,’ January-May 1990. LC 


Calculus, T*(13: 2). Introduction to Mathemati- 
cal Analysis for Business and Economics. André L. 
Yandl. Brooks/Cole, 1991, xxxii + 1000 pp, $52.75. 
[ISBN: 0-534-13494-7] Also available as two sepa- 
rate volumes: Finite Mathematics and Applied Cal- 
culus. Reviews algebra and functions, then pro- 
vides two separate paths: systems of linear equa- 
tions and inequalities, matrices, linear programming, 
sequences and mathematics of finance, probability; 
the derivative, applications of derivatives, integra- 
tion, applications of integration, functions of sev- 
eral variables. Replete with detailed explanations, 
examples and exercises (many illustrating applica- 
tions), optional topics. Readable, attractive, peda- 
gogically effective. Worth a look if you have this kind 
of course. DFA 


Algebraic Geometry, P. Linear Algebraic Groups, 
Second Enlarged Edition. Armand Borel. Grad. 
Texts in Math., V. 126. Springer-Verlag, 1991, xi 
+ 288 pp, $49. [ISBN: 0-387-97370-2] An enlarge- 
ment of the 1969 Benjamin edition. New sections 
include discussions of Cartan subgroups of solvable 
groups, isotropic reductive groups, and Tits sys- 
tems. SG 


Geometry, P. Integral Geometry and Tomogra- 
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phy. Eds: Eric Grinberg, Eric Todd Quinto. Con- 
temp. Math., V. 113. AMS, 1990, xii + 249 pp, $54 
(P). [ISBN: 0-8218-5120-9] Twenty papers from the 
Joint Summer Research Conference held at Hum- 
boldt State University, Arcata, California, June 24- 
30, 1989. Subjects represented include combina- 
torics, geometric inequalities, microlocal analysis, 
group theory, harmonic analysis, Lie group theory, 
Radon transforms, radiation dose planning, radar, 
partial differential equations. DFA 


Control Theory, T(18: 1), L. H®-Optimal Con- 
trol and Related Minimaz Design Problems: A Dy- 
namic Game Approach. Tamer Basar, Pierre Bern- 
hard. Systems & Control: Found. & Applic. Ser. 
Birkhauser Boston, 1991, xii + 224 pp, $49.50. 
[ISBN: 0-8176-3554-8] The task at hand in this 
book is to determine a controller design which op- 
timizes the worst case scenario. If we think of “op- 
timizing” a system as minimizing some cost func- 
tion, we can think of a random disturbance as a fac- 
tor seeking to maximize the same function. Thus, a 
minimax scenario arises, leading to the application of 
game theory. Written for second level graduate stu- 
dents, this book considers both continuous and dis- 
crete time systems. A course in linear control theory 
is an adequate preparation for this book. SM 


Control Theory, T(17: 1), P, L. Lecture Notes in 
Control and Information Sciences-152: Implicit Lin- 
ear Systems. J.D. Aplevich. Springer-Verlag, 1991, 
xi + 176 pp, $31 (P). [ISBN: 0-387-53537-3] An in- 
troduction to a class of models of linear dynamical 
systems. These systems are characterized by a sin- 
gle vector of external variables rather than the usual 
vectors for inputs and outputs. Meant for graduate 
students in engineering or applied mathematics with 
some prior experience with control theory. SM 


Statistics, P. Applied Computational Statistics in 
Longitudinal Research. Michael J. Rovine, Alexan- 
der von Eye. Academic Pr, 1991, xiii + 237 
pp, $34.50 (P). (ISBN: 0-12-599450-8] Companion 
volume to the 1990 two-volume Statistical Meth- 
ods in Longitudinal Research, edited by von Eye 
(TR, March 1991). Presents fourteen of the newer 
techniques—summarizes the method, outlines the 
computational steps, and gives an annotated data 


example. RSK 


Statistics, P**. Directions in Robust Statistics and 
Diagnostics. Eds: Werner Stahel, Sanford Weisberg. 
Springer-Verlag, 1991. Part J, IMA, V. 33, xviil + 
252 pp, $35, [ISBN: 0-387-97530-6]; Part JJ, IMA, 
V. 34, xviii + 380 pp, $45. [ISBN: 0-387-97531- 
4] Proceedings of the first four weeks of a six-week 
program of the IMA (Institute for Mathematics and 
Its Applications) at the University of Minnesota in 
the summer of 1989. Contains 39 papers by leaders 
in these areas, giving an overview of research being 
done. (Parts are divided alphabetically by principal 
author.) RSK 

Statistics, T(17-18: 1, 2), S, P, L. A User’s 
Guide to Principal Components. J. Edward Jack- 


son. Wiley, 1991, xvii + 569 pp, $69.95. [ISBN: 


1991] 


0-471-62267-2] Designed for practitioners of prin- 
cipal component analysis (PCA), this text carefully 
presents the methods of PCA and illustates them 
with several data sets. Topics include multidimen- 
sional scaling, principal components regression and 
analysis of variance, and applications of PCA to dis- 
criminant and cluster analysis, and time series. PCA 
is contrasted with factor analysis and competitors to 


PCA are discussed. RWJ 


Statistics, P. Some Recent Results on Chi-Squared 
Tests. M.S. Nikulin. Papers in Pure & Appl. Math., 
No. 86. Queen’s Univ, 1991, 74 pp, (P). Divided 
into three parts: the chi-squared test of Pearson, a 
chi-squared goodness-of-fit test for exponential dis- 
tributions of rank one, and chi-squared test for mul- 
tivariate discrete distributions. RSK 


Programming, S(13), P. Handbook of QuickBA- 
SIC. David I. Schneider. Brady, 1991, 797 pp, 
$29.95 (P). [ISBN: 0-13-381880-2] A reference man- 
ual suitable for beginning and experienced program- 
mers. Covers QuickBASIC, QBASIC, and the BA- 
SIC Professional Development System (PDS). As- 
sumes reader knows rudiments of QuickBASIC. Ex- 
tension Comments and over 700 programming exam- 
ples accompany the descriptions of functions, state- 
ments, and metacomments, as do hundreds of pro- 
gramming tips and techniques. Many useful appen- 
dices and tables address special topics. DFA 


Programming, T(13: 1), C. Common-Sense BA- 
SIC: Structured Programming with Microsoft Quick- 
BASIC. Alice M. Dean, Gove W. Effinger. Har- 
court Brace Jovanovich, 1991, xvi + 415 pp, $28 
(P) net. (ISBN: 0-15-512297-5] A leisurely intro- 
duction to rudiments of structured programming in 
Basic (specific references are to Microsoft QuickBA- 
SIC). Assumes no computer background whatever— 
initial sections include photos of computers, defini- 
tions of computer jargon, etc. Emphasizes program- 
ming structure: contro] structures, data types, data 
structures, etc. Final chapters introduce graphics, 
color, and sound. Chapters end with programming 
exercises. PZ 


Theory of Computation, T(17-18), P, L. Alge- 
braic and Structural Automata Theory. Ed: Boleslaw 
Mikolajczak. Annals of Disc. Math., V. 44. North- 
Holland (US Distr: Elsevier Science), 1991, xxi + 
402 pp, $131.50 (P). [ISBN: 0-444-87458-5] En- 
glish translation of the 1985 Polish edition. Prin- 
cipal results of the Group in. Automata Theory and 
Computability at the Technical University of Poz- 
nan. Addresses problems in automata morphisms, 
time-varying automata and their transformations, 
automata realizations, linear automata, input subau- 
tomata, cellular networks. Assumes only basic con- 
cepts of algebra and graph theory. Exercises, exam- 
ples, writing style make it quite suitable as a text. 
But note price. DFA 


Applications (Physical Science), P. Applications 
of the Mathematical Sciences to Materials Science. 
National Research Council. National Academy Pr, 
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1991, ix + 26 pp, (P). A non-technical review of 
mathematical challenges and opportunities in the 
rapidly growing field of materials science, where 
computation-intensive mathematical modelling is be- 
coming more effective than laboratory trial-and- 
error. Mathematical methods range broadly from 
partial differential equations and nonlinear optimiza- 
tion to tiling and cellular automata; applications are 
equally diverse, including ceramics, polymers, and 
protein folding. LAS 

Applications (Physical Science), T(17-18: 1), 
S, P, L. Radar and Sonar, Part I. Richard E. Blahut, 
Willard Miller, Jr., Calvin H. Wilcox. IMA, V. 32. 
Springer-Verlag, 1991, xi + 260 pp, $34. [ISBN: 0- 
387-97516-0] Thorough, carefully prepared lecture 
notes from three tutorials on remote surveillance al- 
gorithms, applications of harmonic analysis to radar 
and sonar, and echo structure, plus a previously un- 
published (1960) technical report on the synthesis 
problem for radar ambiguity. Can be used as a text- 
book. BC 


Applications (Physics), P. Soliton Phenomenol- 
ogy. Vladimir G. Makhankov. Math. & Its Ap- 
plic., V. 33. Kluwer Academic, 1990, xi + 452 pp, 
$149. [ISBN: 90-277-2830-5] Presents the essen- 
tials of soliton theory in six parts: Part I—Quantum 
systems and classical behavior; Part II—Some ex- 
act results in one-dimensional space; Part IJI— 
Noncompact symmetries and Bose gas; Part IV— 
Soliton-like solutions in one-dimension; Part V— 
Phenomenology of D = 1 solitons; and Part VI— 
Many-dimensional solitons. MPR 


Applications (Physics), P. The Geometry and 
Physics of Knots. Michael Atiyah. Cambridge Univ 
Pr, 1990, x + 78 pp, $39.50. [ISBN: 0-521-39521- 
6] Notes expanding on a series of lectures delivered 
at the University of Florence, November 1988, offer- 
ing a mathematical introduction to recent applica- 
tions of knot theory in theoretical physics. The Jones 
polynomial invariant of a knot finds clear conceptual 
interpretation in Witten’s topological quantum field 
theory. RB 


Applications (Physics), S(16-18), P. Wave 
Packets and Their Bifurcations in Geophysical Fluid 
Dynamics. Huijun Yang. Appl. Math. Sci., V. 85. 
Springer-Verlag, 1991, ix + 247 pp, $39. [ISBN: 
0-387-97257-9] This monograph is conspicuously 
well-written. Mathematical formulae are embedded 
in text that is both easy to read and highly illumi- 
nating. The subject matter is introduced carefully, 
making it accessible to the non-expert. MU 


Reviewers 


DFA: David F. Appleyard, Carleton; RB: Richard Brown, 
St. Olaf; JNC: Judith N. Cederberg, St. Olaf; LC: Laura 
Chihara, St. Olaf; BC: Barry Cipra, St. Olaf; SG: Steven 
Galovich, Carleton; RWJ: Roger W. Johnson, Carleton; RSK: 
Richard S. Kleber, St. Olaf; SM: Steve McKelvey, St. Olaf; 
MPR: Matthew P. Richey, St. Olaf; KES: Kay E. Smith, 
St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; MU: Milton Ul- 
mer, Carleton; PZ: Paul Zorn, St. Olaf. 


Push the power of Mathematica to its limits 
Mathematica in Action 


Stan Wagon, Macalester College 


Mathematica in Action extends your imagination and the power of Mathematica by providing 
alternative methods to generate three-dimensional graphics, iterative graphics, and animations. 
Its many valuable shortcuts, complete programs with line-by-line explanations, and hundreds of 
advanced examples worked in detail help you realize those aims. 

Whether you are a mathematics teacher, researcher, or enthusiast, you’ll find this to be the 
indispensable sourcebook to own because of its ¢ detailed examples that use animations to study 
dynamical systems and Julia Sets © attention to high-precision number theory—including many aspects 
of prime numbers ® and its in-depth study of cycloids. 


Contents ¢ A Brief Introduction ¢ Prime Numbers ® Rolling Circles © Surfaces © Iterative 
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1991, 419 pages, 188 illustrations, 4 pp. color plates; paper (2202-X) $29.95; cloth (2229-1) $39.95 


A remarkable visual 
exploration of the 
world of fractals and 
chaos in a 63-minute, 
full-color video— 


FRACTALS 


An Animated Discussion 
with EDWARD LORENZ and 
BENOIT B. MANDELBROT 


Mail to: W. H. Freeman and Company 

4419 West 1980 South, Salt Lake City, UT 84104 

Send me copies of Mathematica in Action, 
paper (2202-X), at $29.95 

hardbound (2229-1), at $39.95 


Send me____ copies of FRACTALS, (2213-5), 
individual price at $59.95 


Educational price at $149.95 (includes licensing 
and duplication rights) 


Add $1.95 shipping / handling for the first item 
$1.25 for each additional item 


UT, CA, & NY residents, please add 
the appropriate sales tax 


H.-O. Peitgen © H. Jurgens ¢ 
D. Saupe ® C. Zahlten 


The video turns the 
Mandelbrot set and the Lorenz 
attractor into visible and easily 
comprehensible objects as 
their discoverers, Benoit B. 


L] I enclose a check or money order Total 


payable to W. H. Freeman and Company 
(JCharge to my (CIMasterCard L)Visa Exp. date 


Mandelbrot and Edward Account # 

Lorenz, discuss the background, Signature 

history, and details of their (credit card orders must be signed) 
work in extended interviews. Name 

1991, VHS, color video, Address 

63 minutes, ISBN 2213-5, City 

$59.95 individual price; 

$149.95 educational price State/Zip 


(includes licensing and 
duplication rights) 
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is W. H. Freeman and Company 
GH The book publishing arm of Scientific American 
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‘Professional Standards for Teaching Mathematics — 
200 pp.. ISBN 0-87353-307-0, #43956, $25* 


Curriculum and Evaluation 
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Quantity discounts for each book are also available. a 
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When was 
the last time 
a computer ) 
program helped 
you think about 
mathematics? 
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Software and video tapes for the student, professional, and 
anyone who loves mathematics. Ask for our latest catalog. 


Lascaux Graphics (602) 544-4229 (800) 338-0993 
7601 N. Calle Sin Envidia, Suite 31 - Tucson, AZ 85718 USA 


Introducing E.Z. Math, E.Z. Algebra and E.Z. Arithmetic for the HP 48SX 


E.Z. Math, E.Z. Algebra and E.Z. Arithmetic are programs for the Hewlett Packard 48SX calculator conceived, written 
and programmed by Raymond La Barbera and the E.Z. Software Company. Each program comes on a 128K plug-in ROM 
card accompanied by an easy-to-understand, well-written, detailed manual loaded with lots of specific examples and _is 
designed for use by students, teachers. parents and business people. Each program features an easy-to-use, logically 
organized, user-friendly interface which enables those who consider themselves to be calculator and computer illiterates, 
as well as those who don’t like to read manuals, to have full access to all program features quickly and easily. Since the 

8SX is essentially an impressive, sharp-looking, 8 ounce pocket computer, students are easily motivated to take it 
along with them to study, practice, drill and master math in a study hall, on a train or bus, in a car, on line, on vacation, 
on a break—in short, for self-study at any time and in any place. 


What Can Be Done With E.Z. Math 


E.Z. Math effectively solves problems involving graphs, numbers, loans and savings. With E.Z. Math, anyone can: 
e Master the entire high school and college graphing curriculum, from algebra to calculus, with 188 families of equations, inequal- 
ities, functions, and systems, all arranged in an easy-to-use, user-friendly system of menus to make graphic analysis a snap! 
¢ Get extensive help with calculations involving fractions, whole numbers, complex numbers and number sequences. 
¢ Easily do savings and loan calculations and generate complete amortization tables, 
e Learn many basic concepts including those involving sets, variables, graphing, solving, numbers, loans and savings. 


What Can Be Done With E.Z. Algebra 


E.Z. Algebra is a comprehensive ninth grade high school basic algebra course as well as a high school and college remedial 
algebra course that builds a solid algebra foundation. With E.Z. Algebra, anyone can: 

e Learn about sets, operations, variables, relations and other concepts essential to a real understanding of algebra. 

¢ Understand the sets of natural numbers, whole numbers, integers, rational numbers and real numbers. 

« Master the meaning and properties of the operations of addition, subtraction, multiplication, division, power and root. 

¢ Do all kinds of problems involving algebra expressions, numerical phrases, equations and inequalities. 


What Can Be Done With E.Z. Arithmetic 


E.Z. Arithmetic is a comprehensive elementary school basic arithmetic course _as well as a high school and college 
remedial arithmetic course that makes solving most arithmetic problems a snap! With E.Z. Arithmetic, anyone can: 
+ Learn how to add, subtract, multiply, divide and order whole numbers, fractions, decimals, percents and integers. 
¢ Master the meaning, terminology and conversion methods for whole numbers, fractions, decimals and percents. 
¢ Drill and be graded on endless H varied, randomly selected sets of problems involving whole numbers, fractions, decimals and 
integers, with the difficulty level, number of problems, operation and type of number user selectable. 


How To Order Copies or Get Further Information 
Each E.Z. Software program costs $130.00 ($125.00 retail, plus $5.00 shipping and handling). Take a 10% discount when 
ordering ten or more units. We accept payment by check, money order, COL, VISA, MC, AK and purchase order. If 
within 30 days you find that any E.Z. Software program fails to meet your expectations, we'll gladly take back your copy 
for a prompt, courteous refund. To order copies, either individually or bundled with HP 48SX calculators, please contact: 

SMI Corporation, 250 West New Street, Dept MM2, Kingsport, Tennessee 37660 
(800) 234-0123 or (615) 378-4821 or (615) 245-8982 (Fax). 


Our Term Life Insurance Plan is custom- 
designed for members of our profession. What's 
more, each policy can then be tailored to suit 
your individual needs. As these needs change, So 
can the policy —and it can stay with you no 
matter how often you change jobs. 

Our group purchasing power helped us to 
negotiate top quality insurance, at a very low 
price. To take advantage of this benefit of 


membership, call 1-800-424-9883 for 
further details (in Washington, DC. call 
457-6820). 


MAA INSURANCE 
Designed by Members. 


For Members. 


The MAA Life Plan is underwritten by Connecticut General Life Insurance 
Company, A CIGNA Company, Hartford, Connecticut 06152 
on form number GM 3000 


ZENITH DATA SYSTEMS PRESENTS 
MASTERS OF INNOVATION 


Solving The Problem Of How Students Solve Problems. 


easy-to-use “IMMEX” consists 
of multiple cases.of immune 
defects and a set of results from 
45 laboratory tests (see figure 
below). Through a series of 
exercises and exams, students 
are asked to diagnose these 
cases by selecting the appro- 


ge’ Z priate tests and examining 
their results. 


Upon completion, graphi- 
cal representations are gener- 
ated by computer to show 
which tests were chosen and 
importantly, demonstrate how 
students searched for the 
solution to each problem. 


AY 
\ \\ 
x 


Dr. Ronald H. Stevens 


\\ 


Any professor will tell you 
that good teaching requires an 


understanding of their students’ —_It is then possible to 
thinking. At the University of _ visualize the students’ thought 
California Los Angeles, how- _ process in a way that 


ever, Dr. Ronald H. Stevens standard, multiple choice 


has found an especially testing doesn’t allow. For 
innovative way to get inside the instance, Dr. Stevens  . 
cranium of his second-year can learn how organ- | 
medical students. ized and focused their 
With his award-winning = knowledge is, how ; 
“IMMEX” software program, well their organization 
Dr. Stevens discovers not only relates tocriticalcon- | 
if his students can solve Immu- cepts in Immunology, | 
nology problems, but also where major miscon- | 
how information was gathered ceptions exist and 
and processed during the whether proper 
solving. knowledge links a 
Here’s how it works. are evident. = 
Programmed in Microsoft® In turn, these in- 
Windows™ version 3.0, the sights gained through 


sponsored the MASTERS OF INNOVATION Competition for the past three years. 


P.O. Box 14513, Chicago, IL 60614-9998. 
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(NATH for 3 doys and the supernatent saved os 
a source of IL 2 Doubling dilutions of this 

medium ore added fo cells dependent on IL 2 for 
growth ond DNA synthesis meosured 3 days later 
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ABOUT THE MASTERS OF INNOVATION COMPETITION. 


As acorporation committed to education, Zenith Data Systems encourages students and educators—like Dr. Ronald H. 
Stevens —to creatively explore the potential of computers within their fields of study. Towards that end, Zenith Data Systems has 


To obtain an unabridged copy of his discussion paper on “IMMEX? or an application to enter the MASTERS OF 
INNOVATION IV Competition, please write to us at: Masters Of Innovation Program, Zenith Data Systems Corporation, 


“TMMEX” can have a sig- 
nificant impact on teaching 
methods. As Dr. Stevens 
explains, “This approach can 
lead to rapid detection and 
remediation of individual 
students’ problem solving diffi- 
culties, and can greatly per- 
sonalize the education process: 
What was Dr. Stevens’ 
approach in developing his 
award-winning software? He 
chose to program “IMMEX” on 
Zenith Data Systems laptop 
PCs. They provided him with 
all the Random Access Memory 
and portability required to work 
after hours and over weekends. 
And that made solving the 
problem of how students solve 
problems, less of a problem. 
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“IMMEX” Software Program 


Microsoft and Windows are trademarks of Microsoft Corporation. Copyright © 1991 Dr. Ronald H. Stevens. 
Copyright ©1991 Zenith Data Systems Corporation. 


Springer for Mathematics 


C.H. Clemens, University of Utah, Salt Lake 
City, Utah and M.A. Clemens 


Geometry for the Classroom 


Intended for use in college courses for pro- 
spective or in-service secondary school teach- 
ers of geometry. Designed to give teachers 
broad preparation in the content of elementary 
geometry as well as closely related topics of a 
slightly more advanced nature. The presen- 
tation and the modular format are designed to 
incorporate a flexible methodology for the 
teaching of geometry, one that can be adapted 
to different classroom settings. The basic 
Strategy is to develop the few fundamental 
concepts of elementary geometry, first in in- 
tuitive form, and then more rigorously. The 
rest of the material is then built up out of these 
concepts through a combination of exposition 
and ‘“‘guided discovery” in the problem sec- 
tions. A separate volume including the solu- 
tions to the exercises is also available. 
1991/335 pp., 512 illus./Softcover/$29.95 
ISBN 0-387-97564-0 

Geometry for the Classroom; 


Exercises and Solutions 
1991/167 pp./Softcover/$19.95 
ISBN 0-387-97565-9 


P. Ribenboim, Queen’s University, Kingston, 
Ontario, Canada 


The Little Book of Big 


Primes 


This abridged version of The Book of Prime 
Number Records, also by Ribenboim, pre- 
sents records concerning prime numbers. It 
also explores the interface between computa- 
tions and the theory of prime numbers. Con- 
tains an up-to-date historical presentation of 
the main problems pertaining to prime numbers, 
as well as many fascinating topics, including 
primality testing. Written in a light and hu- 
morous language, this book is thoroughly ac- 
cessible to everyone. 

1991/app. 304 pp./Softcover/$29.50 

ISBN 0-387-97508-X 


New York « Berlin « Heidelberg « Vienna « 


6 Springer-Verlag 


London « 


D. Bressoud, Pennsylvania State University, 
University Park, PA 


Second Year Calculus 
From Celestial Mechanics to 
Special Relativity 


Covers multi-variable and vector calculus, 
emphasizing the historical physical problems 
which gave rise to the concepts of calculus. 
The book carries us from the birth of the 
mechanized view of the world in Isaac 
Newton’s Mathematical Principles of Natural 
Philosophy in which mathematics becomes 
the ultimate tool for modelling physical real- 
ity, to the dawn of a radically new and often 
counter-intuitive age in Albert Einstein’s 
Special Theory of Relativity in which the 
mathematical model suggests new aspects of 
that reality. The development of this process 
is discussed from the modern viewpoint of 
differential forms. Using this concept, the 
student learns to compute orbits and rocket 
trajectories, model flows and force fields, and 
derive the laws of electricity and magnetism. 
These exercises and observations of math- 
ematical symmetry enable the student to better 
understand the interaction of physics and 
mathematics. 

1991/app. 416 pp., 98 illus./Softcover/$29.95 
ISBN 0-387-97606-X 

Undergraduate Texts in Mathematics 
Readings in Mathematics 


Order Today! 

¢ Call: Toll-Free 1-800-SPRINGER(R): 
1-800-777-4643. In NJ call 201-348-4033 
(8:30 AM - 4:30 PM EST). 
Your reference number is $902. 

¢ Write: Send payment plus $2.50 for postage 
and handling to: Springer-Verlag New York, 
Inc., Order fulfillment-S902, PO Box 2485, 
Secaucus, NJ 07096-2491. 

¢ Visit: Your local technical bookstore. 

Instructors: Call or Write for information on 

textbook examination copies! 
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Casio Educational Product Distributors 


ABC SCHOOL SUPPLY 
800-669-4ABC 

(IN GA 404-497-0001) 
ALLIED NATIONAL 
800-999-8099 

(IN MI 313-543-1232) 


ARROWHEAD BUSINESS MACHINES 


800-234-3396 
(IN TX 214-869-0721) 

(IN MO 816-861-1113) 

(IN CA 213-946-6680) 
THE BACH COMPANY 
800-248-2224 

(IN CA 415-424-0800) 
BARDEEN SCHOOL SUPPLY 
800-831-0113 

(IN NY 315-437-7566) 
BECKLEY-CARDY COMPANY 
800-446-1477 

BHARDS PUBLISHING 
800-473-7999 

(IN IL 312-642-8657) 
CALCULATORS INC. 
800-533-9921 

(IN MN 800-533-9921) 
CAROLINA WHOLESALE 
800-521-4600 

(IN NC 704-598-8101) 


COLBORN SCHOOL SUPPLY INC. 


800-548-7031 

(IN CO 303-778-1220) 
(IN MT 406-245-3158) 
Copco 

800-446-7021 

(IN OH 800-589-3006) 


CUISENAIRE 
800-237-3142 
(IN NY 914-235-0900) 


DALE SEYMOUR PUBLICATIONS 
800-872-1100 
(IN CA 800-872-1100) 


THE DOUGLAS STEWART CO. 
800-279-2795 
(IN WI 800-279-2795) 


E.A.I. 
800-272-0272 
(IN NJ 201-891-9466) 


EDUCATIONAL ELECTRONICS 
800-526-9060 
(IN MA 617-821-6458) 


GLOBAL PRODUCTS 
800-633-0633 
(IN IL 708-397-4944) 


GREYSTONE EDUCATIONAL MATERIALS 
800-733-0671 
(IN MN 612-430-9857) 


KURTZ BROTHERS 
800-252-3811 
(IN PA 814-765-6561) 


LONGINO DISTRIBUTORS 
800-633-6224 
(IN NC 704-873-3282) 


NASCO 
800-558-9595 
(IN WI 414-563-2446) 


NATIONAL AUDIO-VISUAL SUPPLY 
800-222-0109 
(IN NJ 800-222-0109) 


NEMESIS DISTRIBUTING 
800-940-7407 

(IN FL 305-477-8822) 
PENNS VALLEY PUBLISHING/ 
LEARNING SYSTEMS 
800-422-4412 

(IN PA 215-855-4948) 
SERVCO PACIFIC 

(IN HI 808-841-7566) 

TAM’S 

800-421-5188 

(IN CA 800-244-5624) 
TAYLOR ELECTRIC 
800-558-6970 

(IN WI 800-242-8940) 
TECHLINE 

800-777-3635 

(IN VA 703-389-0857) 
TROXELL COMMUNICATIONS INC. 
800-528-7912 

(IN AZ 800-352-7941) 
VALLEY BUSINESS MACHINES 
(IN UT 801-969-6303) 
VISTATECH 

800-847-9851 

(IN NY 212-254-9851) 

(IN CA 213-602-0277) 
WHOLESALE ELECTRONIC SUPPLY 
800-527-2156 

(IN TX 800-441-0145) 


CASIO. 


sity THE MATHEMATICAL SYMBOL FOR QUALITY 


NEW FOR 1992 


Calculus and Analytic Geometry, 5/e 
Sherman Stein & Anthony Barcellos 


Calculus for Business, Economics, and /. Susan Milton 


the Social and Life Sciences, 5/e 


Numerical Methods and Analysis 
James Buchanan & Peter Turner 


Statistical Methods in the Biological and 
Health Sciences, 2/e 


Laurence Hoffmann & Gerald Bradley 


Calculus Gems: Brief Lives and 
Memorable Mathematics 
George Simmons 


Discovering Calculus with the HP-28 
and the HP-48 
Robert Smith & Roland Minton 


Introduction to MathCAD for 


Scientists and Engineers 
Sol Wieder 


Encounters with Chaos 
Denny Gulick 


Foundations of Abstract Mathematics 
David Kurtz 


Fundamentals of Computing: Volume 1 
Logic, Programs, and Computing 
Allen Tucker, James Bradley, Robert Cupper, & David Garnick 


Also of Interest from McGraw-Hill 


Applications of Discrete Mathematics 
Michaels/Rosen 


Discrete Mathematics and Its Applications, 2/e 
Rosen 


Differential Equations with Applications and 
Historical Notes, 2/e 
Simmons 


Partial Differential Equations and Boundary Value 
Problems with Applications, 2/e 
Pinsky 


For more information, please contact your McGraw-Hill 
representative or write to McGraw-Hill College Division, Comp 
Processing & Control, P.O. Box 448, Hightstown, Nj 08520. 


State-of-the-art software 


] 


MathWriter™ 2.0: The Scientific Word Processor 
for the Macintosh® 
J. Robert Cooke and E. Ted Sobel 


MathWriter is a complete “what-you-see-is-what- 
you-get” scientific word processing program that allows 
users to enter mathematical expressions as text, not 
graphics, so both text and mathematics can be edited in 
the same document. In addition to a spelling checker, 
thesaurus, and automatic hyphenation, the program also 
automatically sizes and centers mathematical symbols, 
italicizes variables, and formats tables and matrices. 
MathWriter shows onscreen renumbering of both 
equations and references to equations and can show both 
old and revised versions of the same document. 


1991. Professional version: $395. Reduced-featured 
Educational Version $99.95. 


EXP: The Scientific Word Processor, Version 2.1 
Simon L. Smith 

This easy-to-use, “what-you-see-is-what-you-get” technical 
word processing program for IBM PCs and compatibles 
offers fast, professional results. Used by mathematicians 
and scientists worldwide, EXP has the built-in capabilities 
of a complete word processor, near typeset quality output, 
the ability to import graphics and see them on screen, and 
much, much more! The new EXP DVI Output Driver 
allows an EXP file to be saved as a “device independent” 
file and printed on any device with TeX fonts available. 
Now EXP files can be printed at typeset resolutions up to 
2,000 dots per inch. 


August 1991. Single User Package $295. Upgrade 
package $35 for registered users of Version 2.0 and $150 
for registered users of any Version 1.0. DVI Output Drive 
Version 2.1 $195. 


Advanced Books 


Fourier Analysis and Its Applications 
Gerald B. Folland 


This undergraduate text presents Fourier analysis as a 
powerful tool for solving various differential equations of 
interest in science and engineering. The comprehensive 
work covers the whole realm of Fourier analysis including 
Fourier series and transforms, eigenfunction expansions, 
Laplace transforms, and Green’s functions. 


December 1991. ISBN: 0-534-17094-3. $59.50. 


AY q 


Wadsworth & Brooks/Cole 
Advanced Books & Software 


The Geometry of Computer Graphics 
Walter Taylor 


This senior-level text presents the power and beauty of 
pure geometry in a way that it can be applied. The text 
discusses how one commands a computer to make 
pictures. It presents the fundamental mathematical 
concepts needed for two- and three-dimensional computer 
graphics, the various languages in which these ideas are 
communicated, and many ways to describe pictures 
mathematically. Included are a wealth of exercises, 
from elementary problems to major works in analytic 

and projective geometry that will both test and 

challenge students. 


October 1991. ISBN: 0-534-17100-1. $58.95. 


A Course in Ring Theory 
Donald S$. Passman 


This graduate textbook offers a module theoretic stroll 
through a mixture of commutative and non-commutative 
ring theory, with emphasis on the latter. Projective and 
injective modules supply the framework for a coherent 
choice of material. 


1991. ISBN: 0-534-13776-8. $54.95. 


Ethnomathematics: A Multicultural View of 
Mathematical Ideas 
Marcia Ascher 


This unique book discusses the mathematical ideas of 
people in traditional, or “small-scale,” cultures such as 
American Indian tribes, Incas, and various cultures in the 
south Pacific and Africa. The book takes some common 
concepts such as number, chance, and geometry, and 
describes the role of these ideas in the cultures. 


1991. ISBN: 0-534-14880-8. $38.95. 


To order a personal copy of software or books, use 
our toll-free number (800) 354-9706, or write to us at 
the address below. To request complimentary copies 
for review, or to receive a copy of our 1992 Advanced 
Books & Software Catalog, please write: 


Wadsworth & Brooks/Cole 
Advanced Books & Software 


MM002 
511 Forest Lodge Road 
Pacific Grove, CA 93950-5098 
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Renewed for ’91: a free 
classroom display device 
with purchase of 30 
calculators. 


Showing is much more powerful 
than telling. So we've developed 
special classroom displays for 
our most advanced calculators. 


The HP 48SX scientific expand- 
able calculator, the cost-effective 
If}? 28S and the new HP 48S are 
designed to put your students 
on the cutting edge of calculus 
and engineering. With more built- 
in functions and graphics solu- 
lions than any other calculators. 


If vour department or students 
purchase 30 HP 48SX, HP 2858 or 
HIP? 18S calculators (or a mix of 
all three), we'll give you free an 
HIP? -48SX and plug-in classroom 
display (a $900 retail value). Or 

a Classroom display version of 
the HTP 28S (a $600 retail value). 
And free teaching materials. 


Call (503) 757-2004 from 

Sam to 3pm PDT for details. Or 
write: Calculator Support, 
Ilewlett-Packard, 1000 NE Circle 
Blvad., Corvallis, OR 97330. Offer 
ends October 31, 1991, and 
applies only to college and high 
school instructors. 
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Recursive & iterative programming ¢ Over 70 new math functions « New manual, 500 examples & exercises 


The Joy of BERW*EZING. 


DERIVE..— A Mathematical 
Assistant program. 

PC Magazine says it’s 

“a joy to use” and proclaims it 
“Editors’ Choice.” PC Week calls 
it “fast and capable.” The 
DERIVE® program Is de- 
livered with built-in 
standard equipment 
that delights 


MAGAZINE 


EDITORS’ 
CHOICE 


May 29, 1990 
Derive Version 16 


both 
math lovers and 
math phobics. 


High performance. 


DERIVE does numeric and symbolic 
equation solving, exact and approximate 
arithmetic to thousands of digits, calculus, 
trigonometry and matrices. It displays or- 
mulas in comprehensible 2D format using 
raised exponents and built-up fractions. It 
plots beautiful 2D curves and 3D surfaces on 
monochrome or color monitors. 


Compact and sporty. 


PC Magazine calls ita “small wonder.” All it 
takes is 512K of memory and one floppy disk 
drive. DERIVE takes to the road on PC com- 
patibles, and really gets around on laptop 
and even handheld computers! 


DERIVE \s a Hanacrafted 
registered trademark of Software 
Soft Warehouse, Inc for the Mind 


Human engineering. 


You don't have to be fluent in computerese to 
use DERIVE. |n fact, it’s the friendliest and 
easiest to use of any symbolic math package 
on the road today. 
Its menu-driven 
interface 


and 
on-line help 
make it easy— 
you'll soon be up to 
speed doing math, instead of 

trying to learn how to drive the 
software. 


Freedom to maneuver. 


DERIVE s automated expertise releases you 
from the drudgery of hand calculations. 
You can do problems you'd never attempt 
otherwise and obtain exact symbolic solu- 
tions, In addition to approximate numerical 
solutions. 


No sticker shock. 


DERIVE’s suggested retail price is $250. And 
of course DERIVE doesn't require an expen- 
sive computer, a math co-processor, or even 
a hard disk drive. 


Order DERIVE through your favorite software 
dealer or mail-order house. For a list of 
dealers, write Soft Warehouse, Inc. at 3615 
Harding Avenue, Suite 505, Honolulu, HI 
96816. Or call (808) 734-5801 after 11 a.m. 
Pacific Standard Time. 


And happy DERIVEing! 
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Box-Spline Tilings 


CARL DE Boor!” and KLtaus H6xiiG”3 


CARL DE Boor has a Ph.D. from the University of Michigan and has been a 
Professor of Mathematics and Computer Sciences at the University of 
Wisconsin-Madison since 1972. Piecewise polynomial functions of one or 
more variables have been his long-term interest. 


Kiaus HO. tic received his Ph.D. from the University of Bonn in 1979 and 
is now Professor of Mathematics at the University of Stuttgart. His current 
research interests are in computer-aided geometric design and related areas 
of numerical analysis and graphics. 


Abstract. We describe a simple method for generating tilings of R*?. The basic tile is defined as 
Q= {x ERG PO) <If(e +s) Wie Z4\ 0}, 


with f a real analytic function for which | f(x + j)| — ~ as |j] ~ © for almost every x. We show that 
the translates of © over the lattice Z? form an essentially disjoint partition of R¢. As an illustration of 
this general result, we consider in detail the special case d = 2 and 


f(x) = (€"x)(n'*) 
with €, 7 column vectors in Z*. Already this simple choice, which arises in box-spline theory, yields 
rather interesting partitions of R7. 


Let f: R¢ — R be a real analytic function such that, for almost all x and for 
je 2%, 


If(x+j)| 72 asljl>o. (1) 
Then the translates of the set 
QO = O( f) = {x € R*: | f(x)| <|f(x+/)| Wie Z7\0} (2) 


provide a tiling for R%, in the following sense. 


THEOREM. The sets 1. + j, j © Z4, form an essentially disjoint partition of R4, 
1.e. 
MANQD+/)=SVj #0; 
(ii) meas(R? \ ( + Z%)) = 0; 
Gii) meas(Q.) = 1. 
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Such sets 2 arise in box-spline theory, in the characterization of functions of 
exponential type as limits of multivariate cardinal series (cf. the Appendix). In that 
setting, the functions f have the simple form 


fa(x) — ie’ 


in which x, € are taken to be d-vectors, is a multiset from Z% \ 0 which spans 
R?, and é? denotes the transpose of €. Already for d = 2 and for = consisting of 
just two vectors, even these very simple f give rise to surprisingly complex (and 
strangely beautiful) O = Og. 


Y \ AN 
\) 


G3) Og for =|! 3] 


Proof of the theorem. To prove (i), let x = lim x, with x, € QO and x —j €Q. 
Then, the definition of ] leads to the contradiction 

|f(x-J)| IA(x-/l [fl = J)! 
— oo or Sooo i= im —— 2. 
If((~ -7) +/)| | f(x)| 


| f(x,)| 
For the proof of (ii), we deduce from (1) that the function 
jr f(x tj) 
has a minimum for almost all x. If this minimum is unique, then there exists j/* so 
that 


1> 


[f(x +i*) | <[f(et+s)| Wi#s*, 


1991] BOX-SPLINE TILINGS 795 


and therefore x € © — j*. Consequently, up to a set of measure zero, the set 
R?\ (Q + Z%) lies in the union of the zero sets of the (countably many) functions 


a(x) =|f(x+AP-lf(e+k)[, 7 #k. 


Since each such g is analytic, its zero set is of measure zero unless g vanishes 
identically. But, this latter possibility is excluded since g = 0 implies that f is 
periodic in the direction j — k and this would contradict assumption (1). 

For the proof of (iii), we conclude from (i) and (ii) that, up to a set of measure 
zero, [0, 1] is the disjoint union of the sets [0, 1]¢ A (QO + j) with j € Z’, while 0 
is the disjoint union of the sets ((0, 1]? — j) N O with j € Z?, and 


meas([0, 1]° —j) A0= meas([0, We n(at j)). 


rearranged to fill the unit square. 


Special case. In this article, we limit ourselves to the very special case 


f(x) = (é7x)(n"x), x E R’, 


with €,7 © Z? linearly independent. 
In this situation, it is convenient to introduce the new variables 


(u,v) = Bx = (€7x,*x). 
In these new coordinates, the definition of (1 becomes 
Q(T) = {(u,v): jullo] <u + allv + Bl for (a, B) € T \ 0} 


with 


a sublattice of Z?. 
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(5) Sublattice T for 2 = [1 3]. 


The original can always be recovered via the linear transformation 
jam me! -1 
0(2) = (27) ar). 

Therefore, in the new coordinates, 

meas(() = [det S|. (6) 
Also, the tiling is now obtained by translating © over the sublattice T (rather than 
over Z*). On the other hand, we have gained much simplicity since now all 
possible © are intersections of some of the same sets 0, , with 

OQ, 2 = {(u,v): lullol < lu + allo + Bh 


(see (7) Ficure), different © being obtained from different choices of the 
sublattice I. 
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Symmetries. We now investigate how many essentially different tiles we can 
obtain in this way. We begin by noting the following obvious symmetries. 

(i) Since T = —T, we also have 0 = —Q. 

(ii) [ does not change if &’ is multiplied from the right by a unimodular matrix, 
le. an integer matrix with determinant +1. 

In particular, we may restrict attention to B’ of the form 


a _ 
K | with p := |det &l/e, « = gced(7,, 72), 


and a € [0, p]. For, with o the appropriate sign, y* = o(y,, —7,)/e € Z* is 
carried by E’ to (ao det B/e,0) = (p,0), while the fact that 7,/e and 7,/e are 
relatively prime implies the existence of an integer vector y for which n’y = «. 
Thus, for some choice of the integer c, 2’ carries y = cyn* + y € Z* to (a, €) with 


a & [0, p]. Consequently, i "| = E’[n*, y], with [n*, y] necessarily unimodular. 


(iii) The scaling 


s QO 
To | 0 Ir 
changes {2 correspondingly to 
s 0 
| ; 0. 


We consider such (2 obtainable one from the other by such scaling as essentially 
the same. This means that we may further restrict attention to &’ of the form 


ST _ |? 


=) i a with 0 < a < p and gcd(a, p) = 1. In fact, since 


ate [o S18 Hho 11 
0 1 0 -1j)10 13,0 —-1]7 
it is sufficient to consider =’ of the form 
i ih withO <a<p/2 and gcd(a,p) =1. (8) 


In particular, there is just one lattice of interest for each value of p < 5, and 
p = 71s the first value for which there are, offhand, three lattices of interest. 
The resulting lattices 


pa 
0 1 


are indeed different one from the other in that, e.g., (a, 1) is the only point in T’, , 
of the form (b, 1) with 0 < b < p. This follows from the fact that 


r=.,,= | |z, O<a<p/2:-: <p/2 and gcd(a,p) = 1, 


min{b > 0:(b6,0) ET, ,} =p. (9) 
The corresponding statement 
min{b > 0:(0,b) €T,, ,} =p (10) 


also holds since 
(BT) ‘= MP “| 
0 1 | 
hence (&7)~1(0, b) = (—ba/p, b), and, since gced(a, p) = 1, this is in Z? iff plo. 
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Bounds. We conclude from (9) and (10) that 
YC 6 AN 49 ANG g ANG —, 


with a =p. The sets appearing on the right hand side are halfspaces (cf. (7) 
FIGURE); e.g. 


O00 = {(u,v):u> -a/2}. 
Consequently, 


Q ¢ (p/2)[-1, 1). (11) 

Note that this bounding square has area p”, while 0 has area p. This implies 
that Q = [-1, 1]*/2 when p = 1. It indicates that, for large p, 0 is a rather small 
subset of this bounding square. 

Certain lines are excluded from Q. Since |u + a| = 0 for u = —a, 0 cannot 
contain any point (u,v) with u = —a, for which (a, 8) € IT for some B. This 
condition holds for every a € Z\0, hence (1 meets none of the lines u + a = 0 
(therefore also none of the lines v + a = 0) fora € Z\ 0. 
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(12) O must lie inside such a set. 


We conclude from (11) that, in constructing 2 = 1 ;-;;, we only need to 


consider 


y? 


jep[-1,1). (13) 
For, if (u,v) € (p/2)[—-1, 1]? and, e.g., (a, B) > 0, then 
lut+allu+ Bl<lu+a+tmp||v + B+ np 
for any positive integers m and n. Consequently 


xE(p/2[-1,1~n NT a > xe NN Q,. 


jernto, pF jeTaZ 


Figures. We conclude this note with pictures of the first few essentially differ- 
ent tilings obtained in this special case. 

For every p, there is a lattice I generated by (p, 0) and (1, 1), viz. T = IT, ;. For 
p = 1, the corresponding tile is the centered square of side length 1. For p = 2, it 
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is the centered diamond with side length V2, i.e., the diamond with vertices at the 
unit vectors. As p increases, the central portion of the confining set shown in (12) 
FIGURE is too small to contain all of (2, and (1 sprouts four arms. The lattice is 
invariant under the map (u,v) — (v,u) (in addition to the symmetry [T= —T 
observed earlier), hence so is 1. The resulting four-fold symmetry implies that, in 
constructing (1, only one of its four ‘arms’ need be calculated. The corresponding 
{) all look similar, and the following figure gives a typical example. 


(14) O for BT = [8 1]. 


The first tiling of a different kind occurs for p = 5. Since its lattice, I’; , is 
invariant under rotation of 90°, so is the tile. 


Oo mT _|5 2 
(15) for s |. 
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Here are the next few ‘unorthodox’ tiles. 


(16) & for BT = |? | k | § | 
o 1f’?{o if’?fo 1]° 


ra) =T_[9 2] [9 4 10 3 
(17) © for BT = ° 1.12 ‘l.| ° ‘|. 
Based on the above figures, one might conjecture that the set (1) is confined to 
the union 


[-1,1] x [-p/2, p/2] V [-p/2, p/2] x [-1,1] 
of the two central strips of (12) Figure. As (18) Figure shows, this is in general 
not true. In fact, rather complicated patterns develop as p increases. The smallest 


p for which we first encounter a disconnected tile is p = 15, and this is the tile 
shown in (18). 


(18) A disconnected tile: 2? = E ‘|. 
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The next figure shows a more elaborate tile. 


(19) Tiling for BT = % 5]. 


As we mentioned in the beginning, we have considered in this paper a very 
special choice of f, motivated by results from box-spline theory. Our final figures 
give a hint of things to come [BH,]. 


\ 


(20) The BUG: generating function 
flx, y= x2 +y? — 2x. (21) NOVA: generating function f(x, y) =x? + y?— x. 


Acknowledgement. The authors would like to thank Stefan Pumm and Oliver Schule from the 
University of Stuttgart for helping with the figures, and two referees for very helpful comments. 


Appendix. We discuss briefly the connection to box-spline theory. Let 
M: R@ & R be a bounded function with compact support and denote by 


M,:=M+*--+*M 
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the n-fold convolution of M. Further, denote by 


s,={ EMC se(ic eh(24)| 
jez? 

the linear span of the translates of M, with square summable coefficients. We 

showed in [BHR] that a function g € L,(R%) can be approximated by a sequence 

g, <=S,, n EN, if and only if the support of the Fourier transform of g is 

contained in the set 


D(M) = { x ER?: [M(x + 2aj)| <|M(x) 


,j €Z\ 0}. (22) 
With minor modifications, this agrees with the definition of the basic tile in (2), i.e. 
D(M) =27Q(f), with f = 1/M(27-). 

Thus the fundamental domain D generates a tiling of R%. 


In the main application of this result, M is chosen as the centered box-spline. 
Its Fourier transform has the simple form (cf. [BH,], [H]) 


M(x) = Mz(x) := i sinc(€"x/2) 


where sinc(t) := sint/t and & is a multiset of integer d-vectors. Because of 
periodicity, the factors sin(é’x/2) are irrelevant for the definition of the funda- 
mental domain, hence 


D(Mz) = 27Q(fz), with fg= [] é*~. 
EGR 


The simplest special case, when d = 2 and & consists of just two vectors, is 
considered in the present paper. 
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Hex Games and Twist Maps on the Annulus 
STEVE ALPERN*, London School of Economics 


ANATOLE BEck, University of Wisconsin 


STEVE ALPERN studied game theory at Princeton under Oskar Morgenstern 
and wrote his Ph.D. thesis on ergodic theory with Peter Lax (and unoffi- 
cially, John Oxtoby) at the Courant Institute. Since 1980 he has been at the 
London School of Economics. 


ANATOLE BEcK: I was born in the Bronx and educated at Stuyvesant High 
School, Brooklyn College, and Yale University, where I syudied with Shizuo 
Kakutani. After two postdoctoral years at Tulane and in Europe, I joined 
the faculty of the University of Wisconsin in 1958. I left Wisconsin in 1973 
with the intention of spending my remaining years as Professor of Mathe- 
matics at the London School of Economics, returning after two years for 
personal reasons. I have retained my professional and personal ties to the 
LSE, however, and work there whenever my schedule permits. 


1. Introduction. In an elegant paper [G], David Gale has given a combinatorial 
proof that the game of hex cannot end in a tie, and shown that this fact (and its 
higher dimensional analog) is equivalent to Brouwer’s Fixed Point Theorem. In 
this paper we define a new version of hex, called either cylindrical or annular hex, 
by wrapping the standard hex board around a cylinder. It may then, if desired, be 
flattened out onto an annulus. It turns out that this game, too, cannot end in a tie. 
This fact is then used to prove a fixed point theorem for twist maps of the annulus. 
We also give an explicit winning strategy for one of the players. (No such strategy 
is yet known for standard hex.) 

We begin by describing the standard version of the game of hex, which we will 
call m X n rectangular hex. Actually only the square game (m = n) is ever played, 
but our analysis requires this greater generality. Two players, called Black and 
White, alternate in coloring the hexagons of the m <n rectangular hex board 
which is pictured (in two possible drawings) in Figure 1 with m = 8 and n = 7. 
There are two versions corresponding to which player moves first. In any case 
White wins if he can connect the top to the bottom by white hexagons; Black wins 
if he can connect the left to the right by black hexagons. Observe that Black has a 
win in both of the colorings of FiGuRE 1. 

The m Xn cylindrical hex board is obtained by identifying the right and left 
boundaries of the rectangular board. This is the same as adding an additional 
column of hexagons on the right and identifying these hexagons with the ones at 
the same height on the left column. The cylindrical hex board can also be drawn in 
the plane, on the annulus (region between concentric circles). The cylindrical 
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Fic. 1. Two drawings and colorings of the 8 by 7 rectangular hex board. 


8 < 7 hex boards are shown in FiGuRE 2, drawn on the annulus. The left and right 
boundaries of a rectangular hex board have been identified along the thickened 
line on the annulus going from the inner circle to the outer circle. The colorings of 
the top and bottom rectangular boards of FiGuRE 1 have been wrapped around the 
cylinder and flattened onto the top and bottom annular boards of FiGureE 2. 

In cylindrical hex White wins, as before, if he obtains a path of white hexagons 
from the top of the cylinder to the bottom. On the annulus this is a white path 
from the inner circle to the outer circle. For Black to win, he must obtain a circuit 
of black hexagons which goes around the central axis of the cylinder. On the 
annulus, this is a circuit with the origin (center of the concentric circles) inside. 
These definitions implicitly involve the Jordan Curve Theorem but this can be 
replaced by a purely combinatorial condition. Observe that the coloring of the 
board on the top in FiGuRE 2 is a win for White, but the coloring on the bottom is 
a win for Black. It is clear that the identification process which creates annular hex 
from rectangular hex preserves winning paths for White but not necessarily for 
Black. 

In section 2 we prove (Theorem 1) that cylindrical hex cannot end in a tie, and 
in section 3 we prove (Theorem 2) that when m is even White can always win. 

We now consider the question of the existence of fixed points of a homeomor- 
phism f of the annulus. At this point it is easiest to describe the annulus in polar 
coordinates, as the set of all points in the plane with radial coordinate r between a 
and b. The obvious example of a fixed point free f is a non-trivial rotation. This is 
excluded by requiring that f is a “twist.” This means that it rotates one boundary 
circle clockwise and the other, anticlockwise. This condition avoids rotations but 
still allows fixed point free homeomorphisms of the class f(@, r) = (6 + c(r), g(7)) 
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Fic. 2. Two colorings of the 8 by 7 annular hex board. 


where c and g are continuous on [a, b], with c(a) > 0, c(b) < 0, and g strictly 
increasing on (a,b). Such an example is not area-preserving. Poincare’s “Last 
Geometric Theorem” [P], established in full generality by G. D. Birkhoff [B], states 
that an area-preserving twist homeomorphism of the annulus has two fixed points. 
However we will concentrate on another property of the above family of fixed 
point free twist maps. They all have what we will call a “rising curve’. This is a 
curve which goes around the annulus, for which every point on it is moved closer to 
the outer circle. More generally, we will consider “monotone curves” which are 
either rising curves or falling curves. In section three we use the “‘No-tie’” Theorem 
(2) for annular hex to prove (Theorem 3) that a twist map of the annulus with no 
monotone curve must have a fixed point. The conditions of area-preserving and no 
monotone curve are independent in that in general neither implies the other. Of 
course for the special case of ““monotone twists” the no monotone curve assump- 
tion is implied by area preserving. For a twist map with no monotone curve our 
proof of Theorem 2, which implicitly incorporates Gale’s algorithm, can be used to 
find approximate fixed points. 


2. No-tie theorem for cylindrical hex. In order to show that cylindrical hex 
cannot end in a tie, we give a graph theoretic formulation of both rectangular hex 
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and cylindrical hex. In this formulation the hexagons pictured in the previous 
section become the vertices of a graph, and hexagons sharing a common face are 
adjacent. Thus the hex graphs defined in this section are the dual graphs of the 
planar graphs of Ficures 1 and 2. 

We begin by defining an “infinite strip” graph S,. The vertices of S, are the 
lattice points V, = {(i, j): i,j € Z,1 <j < n}. Two vertices z = (z,, z,) and w = 
(w,,w,) in V, are adjacent if |z, —w,| <1, lz, -—w,| <1, and either z<w 
(meaning z, < w, and z, <w,) or w <z. A portion of the infinite planar graph 
S,, is shown in FiGure 3. Note that the edges are all either horizontal, vertical, or 


VAVAVAVAVAVAVA 
VAVAVAVAVAVAVA 
VAVAVAVAVAVAVA 
VAVAVAVAVAVAVA 
VAVAVAVAVAVAV A 
VAVAVAVAVAVAVA 


Fic. 3. H,(8, 7). 


The m X n rectangular hex graph H(m, n) is the subgraph of S,, determined by 
the vertex set V(m,n) = {z © V,:1 <z, < m}. To define the m Xn cylindrical 
hex graph H.(m,n) with the same vertex set V(m, n) first consider the map ¢: 
V, — Vim, n) defined by (z,, z,) = (z, mod m, z,), with z, mod m chosen in 
{1,..., m}. Two vertices z and w in V(m,n) are adjacent in the cylindrical hex 
graph H.(m,n) if and only if there are adjacent vertices z’ and w’ in S, with 
z = $(z’) and w = d(w’). This means that the cylindrical hex graph H,(m, n) has 
all the edges of H,(m, n) plus the n edges between (m, j) and (1, j) for j = 1,...,n 
and the n — 1 edges between (m, j) and (1,7 + 1) for j = 1,...,n — 1. Ficure 3 
depicts the 8 X 7 rectangular hex graph and Ficure 4 gives a planar (annular) 
drawing of the 8 X 5 cylindrical hex graph. 


Fic. 4. H,(8, 5). 
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Let L: V(m,n) — {black, white} be any coloring of the vertices (hexagons) 
Vim,n). An L-winning path for White in rectangular m Xn hex is any path in 
H (m,n) of white vertices from some z with z,=1 to some w with w, =n. 
Similarly an L-winning path for Black is any path of black vertices from some z 
with z, = 1 to some w with w, =m. Observe that if a player has an L-winning 
path, he also has a simple (non self-intersecting) L-winning path. 

In cylindrical hex H,(m,n) an L-winning path for White is similarly any path in 
Hm, n) of white vertices from some z with z, = 1 to some w with w, =n. An 
L-winning path for Black is a bit more difficult to define, but we can give a 
combinatorial version of a “‘circuit of black hexagons going around the axis of the 
cylinder”. Let L*: V, — {black, white} be the “lifting” of the coloring L of 
V(m, n) to the infinite strip V, by the formula L*(z) = L(¢(z)). Observe that the 
lifted coloring L* is m-periodic in the horizontal coordinate, since for all (i, j) in 
V, we have L*(i + m, j) = L(bGi + m, j)) = L(GG, J) = L*G, j). An L-winning 
path for Black in the cylindrical hex graph H,(m, m) can now be defined as the 
d-image of any path P in S, of L*-black vertices between distinct vertices z and w 
with o(z) = d(w) (in other words z, # w,, z; — w, = 0 mod m, and z, =w, ). 
The path #(P) is a circuit in the graph H,(m, n) which “goes around the axis of 
the cylinder’’. 

We are now in a position to give a proof of the no-tie theorem for cylindrical 
hex. This can be proved directly using the Jordan Curve Theorem, but we prefer to 
give a purely combinatorial proof based on the similar result for rectangular hex, 
which was established graph theoretically by D. Gale [G] (see also [B, B, C] for a 
topological proof). 


THEOREM Q. For any coloring L of the rectangular k X h hex graph at least one 
of the players has an L-winning path. 


THEOREM 1. Let L be any coloring of the cylindrical m X n hex graph H.(m, n). 
Then either White has an L-winning path or Black has an L-winning circuit “ going 
around the axis of the cylinder”’. 


Proof of Theorem 1 assuming Theorem 0. The coloring L of V(m,n) can be 
lifted to a coloring L* of V, by L*(z) = L(¢(z)). Let L’ be the restriction of L* 
to the subset V(nm + 1,n) of V,. Considering L’ as a coloring of the rectangular 
hex graph H.(nm + 1,n) we may apply Theorem 0 with k = nm + 1 and h =n to 
assert that at least one player has an L’ winning path. We claim that the same 
player also has an L-winning path or circuit in the cylindrical hex graph H,(m, n). 

To see this first observe that if P = z',...,z’ is an L’'-winning path for White 
in H(nm+1,n), then its image @(P) is an L-winning path for White in 
Hm, n), since ¢@ preserves adjacency. Next suppose that P = z!,...,z’ is an 
L'-winning path for Black in H,(mm + 1,n). Observe that for each of the n + 1 
numbers i = 0,...,7 there is at least one L’-black vertex w’ belonging to the path 
P and of the form (im + 1, y,), for some y, in the n-set {1,..., n}. Since there are 
more y,’s than numbers in {1,..., n}, it follows from the “pigeon-hole principle” 
that y, = y, for some distinct a and b. Hence the ¢-image of the portion of the 
path P between w? and w? is an L-winning circuit for Black in HC(m, n), since 
am + 1 =[d(w”)], # [d(wb)], = bm + 1,[60v%)], — [6(w?)], = 0 mod m, and 
[d(w”)], = y, = y, = [b(w”)],. This establishes the claim and the theorem. 
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3. A winning strategy for white in annular hex. Since all the variations of hex 
discussed in this paper are finite games of perfect information which cannot end in 
a tie, it follows from the standard recursive game tree argument that one of the 
players must always have a winning strategy. The strategic analysis of these games 
considers first which player has a win and second what is an explicit winning 
strategy. For rectangular hex it is known that the player with the closer sides to 
connect can win, regardless of who goes first, when the dimensions of the board 
are not equal. That is, White can win if m > n, and Black if m <n. In this case 
explicit strategies are known. For the most important case of square hex, m =n, 
the player with the first move has a win. This fact is based on an indirect argument, 
generally attributed to Nash. However no explicit general strategy is known for 
White to win at square hex. For a more extensive discussion of the strategic side of 
rectangular (actually square) hex, see [B, B, C]. 

In contrast to the situation for rectangular hex, we are able to give an explicit 
general winning strategy for White in annular hex. It is independent of the relative 
dimensions or the “advantage” of the first move, but has the curious hypothesis 
that the circular dimension is even. (We believe the result is true for odd m, but 
cannot prove it.) 


THEOREM 2. If m = 2k is even, then regardless of n or who has the first move, 
there is a winning strategy for White in m X n annular hex. It can be stated explicitly 
in terms of the 180 degree rotation map T(i,j) = (i+ k mod m,j) of V(m,n) as 
follows: 


Case 1 (Black moves first). If Black’s last move was to color vertex (hexagon) z, 
then White colors vertex T(z). 


Case 2 (White moves first). White plays as in Case 1 whenever possible. If this 
is impossible, either because it is the first move and z is undefined, or because 
T(z) is already colored, then color any uncolored vertex. 


Proof. We first consider Case 1, where White goes second. In this case the 
strategy suggested in the theorem is always feasible, since T(z) must be uncolored: 
If T(z) were already white, then z must have been black prior to Black’s move, 
contrary to assumption. If T(z) were black, then similarly z would have been 
colored white prior to Black’s last move, also a contradiction. 

We now show that Black can never attain a winning circuit if White adopts the 
Case 1 strategy. Hence by the ‘‘no-tie theorem for cylindrical hex” (Theorem 1), 
White will always obtain a winning path. 

Suppose that Black obtains a winning circuit y on the hex graph H,(m, n) with 
a winning move z*. Then, as shown above, the vertex T(z*) is uncolored at the 
time the game ends. Color this vertex T(z*) white (this is no longer to be 
interpreted as a move in the game, merely an argument in this proof). Since the 
map T preserves adjacency, y* = T(y) is a circuit of white vertices which would 
be a winning circuit for Black if it were colored black rather than white. We now 
use the fact that the annular hex graph H.(m, n) is planar and that in particular it 
can be drawn on the annulus A centered on the origin with inner radius 1 and 
outer radius 2 by sending the vertex (i, /) to the polar coordinates 6 = i27/m and 

=(n —2+j)/(n — 1). Hence we may view the circuits y and y* not only 
combinatorially, but as Jordan circuits in the plane which contain the origin in 
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their insides. They are disjoint, since otherwise they would have to meet at a 
vertex, which would then have to be both black and white. The Jordan Curve 
Theorem thus requires that one of the curves is inside the other. But since the map 
T preserves the second vertex coordinate j; which determines the radius, the two 
curves y and y* = Ty have the same maximum radius. Thus the requirement that 
one is inside the other is impossible (consider the point with maximum radius on 
the inside curve). It follows that the assumption at the head of this paragraph, that 
Black obtains a winning circuit when playing against the case 1 strategy of White, 
is false. 

The proof for case 2 is the same, as the additional elements of the strategy 
make it feasible. 


4. Fixed points of twist maps of the annulus. In this section we will show that a 
twist map of the annulus which has no fixed point must have a monotone (rising or 
falling) circuit. Before giving the more formal material needed to rigourously 
define the terms and prove the theorem (3), we now outline the main ideas of the 
proof. 

Consider that a very fine annular hex graph is drawn on the annulus. For any 
twist map f of the annulus, which has no fixed point, each vertex of the graph will 
be moved to another point (not necessarily a vertex) of the annulus. We label the 
vertex according to which direction it is moved. If it moves mainly out towards the 
outer boundary label it O; if it moves mainly in towards the inner boundary label it 
I. If it moves mainly clockwise, label it C; if anti-clockwise, label it A. Thus every 
vertex can be labeled with either O, J, C, or A. Since f is a twist map, all vertices 
on the outer boundary will be labeled A, and all vertices on the inner boundary 
will be labeled C. Now color all vertices labeled A or C white, and color all 
vertices labeled O or J black. Apply Theorem 1 to assert that there is either a path 
of white vertices going from the inner to the outer boundary of the annulus, or a 
circuit of black vertices going around the annulus. Since the map f is continuous, 
it follows that for a grid size (distance between adjacent vertices) sufficiently small, 
adjacent vertices cannot be moved in opposite directions. That is, adjacent vertices 
cannot be labeled A and C, or J and O. 

Now suppose that there is a winning path of white vertices, going from an inner 
boundary vertex labeled C, to an outer boundary vertex labeled A. Somewhere 
along this path a vertex labeled C would have to be adjacent to a vertex labeled A 
—but this has been shown to be impossible. So it follows that there is a circuit of 
black (that is, J or O) vertices going around the annulus. By the same argument 
about adjacent vertices not having opposite labels, it follows that this circuit is 
either labeled all J Gnwards) or all O (outwards). Suppose it is labeled all J, so 
that every vertex in the path moves outwards. If the grid size is sufficiently small 
with respect to the minimum positive distance a point must move under f, then 
any point on the edge between two of the outward-moving vertices of the black 
circuit must also be outward-moving. Hence if the combinatorial circuit of black 
vertices is viewed as a simple closed curve, it is the rising curve whose existence is 
asserted in Theorem 3. Our proof of Theorem 3 is merely an elaboration of the 
argument sketched here. 

In order to formalize the above arguments we will first of all use the cylindrical 
form of the annulus, that is, A = S' x [0,1] = {((6,r): 6 in R/Z, O<r< 1}. 
Furthermore we will “‘lift’’ all arguments to the universal covering space S$ of A, 
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the horizontal strip S = {(x, y): x in R, 0 <y < 1}, via the covering map 7: 
S > A given by 7(x, y) = (x mod1, y). A homeomorphism F: S — S is called a 
lift of f: A ~A if fo =F. All lifted concepts will be invariant with respect to 
the translation T(x, y) = (x + 1, y) of S. In particular, any homeomorphism f: 
A-A can be lifted to a homeomorphism F: § > § with FT = TF (which is 
defined up to a power of 7) and any such F is the lift of an annulus homeomor- 
phism f. For z in S write F(z) = (F(z), F,(z)), and ||z|| = max (|z,|, |z,)). 


DEFINITION. A homeomorphism f: A — A is called a twist homeomorphism if it 
preserves orientation and some lift F: S — S satisfies F,(x,0) > x and F(x, 1) <x 
for all x in R. In this case we will also call F a periodic twist homeomorphism of S. 


PROPOSITION 1. Let F: S — S be a periodic twist homeomorphism of the horizon- 
tal strip S. Then for any positive number M there is a simple polygonal curve y: 
R > S, with y(t + 1) = T(y(t)) for t in R, such that |F,(z) — z,| <M for every 
point z = y(t) on the curve y. 


Proof. Since F is continuous and commutes with 7, it is uniformly continuous. 
Hence, there is a 6, 0 < 6 < M/4, such that ||z — w|| < 6 implies that ||F(z) — 
F(w)|| < M/4. Choose n so that n > 1 + 1/6. Embed the vertex set V, = {(i, j): 
1 <j <n} into the horizontal strip S via the map a(i, j) = (i/n,(j — 1)/(n — 1)). 
This is the “lifted” analog of drawing the hex graph on the annulus. Observe that 
if two vertices v and v’ are adjacent in V, then |la(v) — a(v’)|| < 1/(n — 1) < 6. 
This is our formalization of a “sufficiently fine hex graph”. Define p: S — R by 
p(z) = F(z) — z, and observe that p is periodic in that pT = p. Let L* be the 
coloring of the vertex set V, according to the rule that v is black if and only if 
Ip(a(v))| < M/2. Observe that L*(i + n, j) = L*(i, j) is a periodic coloring of V, 
and thus lifts some coloring L of the annular hex graph H,(n, n). 

By Theorem 1, either White has an L-winning path in H,(n, n), or Black has an 
L-winning circuit. First suppose White has a winning path. Let v!,...,v* be a 
path of L-white vertices with (v'), = 1 and (v*), =n. Since F is a twist map it 
follows that p(a(v!)) > 0 and p(a(v*)) < 0, and hence for some consecutive 
white vertices v and v’ in the path we must have that p(a(v)) > Oand pla(v’)) < 0, 
lla(v) — a(v’)|| < &. Setting z = a(v) and w = a(v’), we have that ||z — wll <6 
and hence ||F(z) — F(w)||<M/4. Also p(z) — p(w) = (M/4) —- (-M/4) = 
M/2, since z and w are a-images of white vertices. However this last fact is 
contradicted upon observing that 


Ip(z) — p(w) = |F\(z) -— 2, — Fi(w) + wy 
< |F,(z) — F\(w)l + Iw, - 21! 
< |F(z) — F(w)Il + Ilw — zl 
<M/4+6<M/2. 

Since White cannot have an L-winning path, Black must have an L-winning 
circuit. By definition (see section 2), this is the ¢-image of an L* black path 
v',...,v* in V, with o(v') = d(v*) and v' #v*. Let y*(t), O<t <1, be a 
parameterization of the polygonal path on S which connects the a-image of these 
vertices, and let y: R — S be the extension of y* satisfying y(t + 1) = T(y(¢)). 


Now let z = y(t) be any point on the circuit y. For some i, z is on an edge of the 
polygonal path y with a vertex w = a(v’) at its end. Hence ||z — w|| < 6 for a 
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black vertex w satisfying |p(w)| < _M/2. The proposition now follows from the 
estimate 


Ip(z)l = IF \(z) — 2] 
= |F\(z) — F\(w) + Fi(w) — w, + w, - 2]! 
< l|F(z) — F(w)Il + Ip(w)| + lw — zl 
<M/4+M/24+ M/4=M. 


THEOREM 3. Let f:.A — A be any twist map of the annulus (cylinder) A = {(0, r): 
0 in R/Z,0 <r < 1} which has no fixed point. Denote (6', r') = f(0, r). Then there 
is a simple closed curve P which goes around the axis of the cylinder, with either 

(i) r' > r for all points (0,r) on P (P is a rising curve), or 

(ii) r’ <r for all points (0,r) on P (P is a falling curve). 

Furthermore, given any B > 0 we may choose P so that it rises or falls arbitrarily 
steeply, in that |0’ — 6| /|r' — r| < B. Hence any twist map of the annulus without 
arbitrarily steep monotone curves has a fixed point. 


Proof. Let F: S — S be a twist map which lifts f. If z’ were fixed by F then 
f(z’) = wF(z') = (z') so f would also have a fixed point, contrary to assump- 
tion. Hence F has no fixed points. Since F is continuous and FT = TF it follows 
that F is uniformly continuous. The last two sentences together imply the exis- 
tence of a positive number D such that ||F(w) — w|l > D for all w in S. Choose a 
number M such that M < BD and M < D. By Proposition 1 there is a curve y on 
S with y(t +1) = T(y()) such that |p(z)| <M for all z on y, where 
p(w) = Fw) — w,. Define q(w) = F,(w) — w,. Since D < ||F(w) — wll = 
max[|p(w)|, |g(w)|] for all w in S, it follows that |qg(z)| > D for all z on y. Since 
y is aconnected set it follows that either q is positive on all of y or q is negative 
on all of y. Without loss of generality we may assume gq is positive. In this case 
P = ry is a rising curve for f on A, since for any z on y we have F,(z) > z, and 
hence for any point (6, r) = 7(z) on P we have that r’ = F,(z) > z, = r. Further- 
more |0’ — @| = |p(z)| <M so that |6’ — 6|/|r’-—r| <M/D < B, by choice 
of M. 
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1. Introduction. Everyone is by now familiar with Godel’s famous incomplete- 
ness theorem for the natural numbers, N. If any computable list of true statements 
about N is taken as a set of axioms (for example, Peano’s Axioms), then the set of 
all statements that can be derived from these axioms by the usual rules of proof is 
incomplete; that is, there is a true statement about N that is not provable. A 
closely related result is that there is no algorithm that decides whether an arbitrary 
sentence (in the appropriate language) is true in N. 

Is the situation any better for the rationals, Q@? One way to approach this 
question is to explore the logical relationship between N and Q. If it is tight 
enough then perhaps incompleteness and undecidability for @ can be deduced 
from the corresponding results for N. That this program does, in fact, work out was 
discovered by Julia Robinson in her 1948 dissertation [13] written under the 
direction of Alfred Tarski. Her work—a subtle combination of number theory and 
logic—yields a result that confirms a strong logical connection between the 
integers and the rationals. This paper is devoted to an exposition of the details of 
her discovery. 

The essence of the problem is the definability of sets in terms of other sets and 
operations. An instructive example is whether positivity is definable in the arith- 
metic of the integers, Z. Precisely: Given the operations of addition and multiplica- 
tion on Z, and the usual notions of first-order logic (= , V, J, V, A, 4, >, @), 
is there a formula SCX) that is true in Z precisely when X is a nonnegative 
integer? The reader might like to ponder this before reading on. To appreciate the 
difficulty, note that it is a simple matter to define any individual integer. If $)(X) 
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is the formula X = X + X, and S,X) is (X¥ =X-X) A A(X =X +X), then, in 
Z, S,. is true only for X = 0 and S, is true only for X = 1. And if S,CX) is then 
defined to be the sentence JY(S(Y) AX =Y+Y+--:+Y)(n summands), it 
is clear that $,(X) defines n. Gathering these infinitely many formulas into the 
single formula S,(X) V S,X) V --: V S,CX) Vv +++ will work to define the natu- 
ral numbers in Z, but infinite conjunctions are not part of first-order logic. 

Yet the sought-after definition exists. Let S(X) be the formula: 
FAAABACAD( X =A-:A+B:B+C:C+D--D). Because of Lagrange’s theo- 
rem that every element of N is a sum of four squares, it is immediate that for 
t € Z, S(t) is true in Z if and only if ¢ © N. In short, N is a definable subset of Z. 
This shows the close connection between nontrivial number theory and questions 
of definability. Using deeper results of number theory one can give simpler 
definitions, that is, definitions using fewer variables (see §5). On the other hand, 
the four-squares definition is as simple as possible in the sense that only existential 
quantifiers are used. By the way, it is clear that not all subsets of Z are definable: 
there are uncountably many of them and only countably many possibilities for 
definitions. A more important reason is due to Tarski who proved that {n © N: the 
sentence with Gédel number! nv is true in N} is not a definable subset of N (or of 
Z). In short, truth is not definable. Every recursively enumerable set is definable 
(essentially due to Gédel), a result that was dramatically improved in 1970, when 
Matijasevic showed that every r.e. set is, in fact, Diophantine (i.e., definable using 
only existential quantifiers) (see [2], [9], and also §5 of this paper; the definable sets 
as defined here are usually called the arithmetical sets, for more on which see [4, 
§3.5] and [5)). 

Now, what happens when we move to the rationals (again, with addition and 
multiplication as the primitive operations)? It is an easy exercise (use the sum of 
four squares) to show that the positive rationals are definable in Q. But are the 
integers definable in ©? This is one of the problems studied by Robinson in her 
thesis, and her solution is extremely elegant (and important—an application is 
given at the end of §2; see also §5). Note that, again using a four-squares 
argument, N is definable in Q if and only if Z is definable in Q; we focus on the 
definability of Z. Robinson’s result Gif not its proof) is well known to logicians. Our 
purpose here is to give an exposition of her proof with more details than are given 
in [13], in the belief that a wider audience will enjoy the interplay between number 
theory and elementary logic. 


2. The definition of Z. It is not at all obvious how to proceed, assuming one has 
faith that a definition exists. There is no clear way to get at the denominators of a 
rational number, which can only be referred to by a variable-name. It might be 
impossible, or at least very difficult, to use first-order logic to uncover the hidden 
numerator and denominator of an arbitrary rational. Of course, if one could get at 
the exact denominator (i.e., the lowest-terms denominator), then checking whether 
it equals 1 would solve the problem. Some hope was provided by an argument Julia 
Robinson found that yields a definition of those rationals whose exact denomina- 
tor is not divisible by 2. But the argument is nontrivial and there is no clear way to 
extend it to the other primes. First of all, one cannot even say what a prime is, 


'Gédel’s revolutionary work in logic depended on the assignment of numbers to formulas of logic, 
thus enabling him to construct sentences that, when interpreted in N, asserted their own unprovability. 
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since that would assume knowing what an integer was. And even if a way around 
this difficulty could be found and formulas guaranteeing p-free denominators did 
exist, one could not simply gather all the formulas into a single large conjunction. 
Thus some sort of uniform solution to the problem of clearing the denominators of 
primes would be needed. 

The argument for destroying even denominators is not essential to the main 
theorem, but is worth examining. It is based on the Gauss—Legendre three-squares 
theorem for integers, or rather, its extension to the rationals, which we now state. 


THEOREM 1. A nonnegative integer n is a sum of three rational squares if and only 
if n does not have the form 4°(8k + 7). 


Theorem 1 is much more difficult than the four-squares theorem; the integer 
version (see, e.g., [6] or [10] for a proof) states that the only integers that are not a 
sum of three integer squares are those of the form 4°(8k + 7). The rational version 
is an easy consequence of the integer version. However, a more modern approach 
is to first derive Theorem 1 as a straightforward consequence of the 
Hasse—Minkowski Theorem (the argument is given at the end of §3). Then the 
integer version follows by a result of L. Aubry (see [24, Chap. 3, App. 2]), who, in 
1914, gave a beautiful geometric proof that an integer that is a sum of three 
rational squares is necessarily a sum of three integer squares. 

Julia Robinson used Theorem 1 to show that the formula! 


JAABAC(7X? + 2 = A? + B? + C2) 


defines those rationals whose exact denominator is odd. To see that this definition 
works, first note that for a rational x =n/d, 7x* + 2 is a sum of three rational 
squares iff 7n? + 2d? is a sum of three rational squares. Now, if d is odd then 
7n* + 2d’ is congruent to 1 (mod 8) if is odd, or 2 or 6 (mod 8) if n is even. In 
either case, Theorem 1 applies to 7n* + 2d’, which is therefore a sum of three 
rational squares. And if d is even then 7n” + 2d* = 7 (mod 8) whence 7n? + 2d? 
is not a sum of three squares. Therefore 7x? + 2 is a sum of three rational squares 
if and only if d is odd. 

To further motivate the positive solution to the problem of defining Z in Q, we 
point out another illegal way to define the integers. Consider the formula TCX): 


VA CO{[0EA AVR(REA>R+1€EA)| >X EA}. 


An easy induction shows that for each n € N, Q E I'(m) (meaning: [() is true in 
@). Conversely, if @ — I(t) then, because A = N satisfies the hypothesis of T (ie., 
induction is true in N), t € N. Thus [CX) defines N in Q. Of course, this definition 
does not belong to first-order logic, but to second-order logic; in first-order logic 
One cannot quantify over subsets. 

We can now state Julia Robinson’s definition of Z in Q. Let S(Q, P, N) be 


AAABAC(2 + OPN? = A* + QB* — PC”). 


Think of Q and P as being fixed, so that S(Q, P, N) defines a subset of Q via the 
variable N. Now let RCX), the formula that defines Z, be: 


VOVP([S(Q, P,0) A VT{S(Q, P,T) = S(Q, P,T + 1)}| = S(Q, P, X)). 


The use of “2” in a formula is allowed because 2 = 1 + 1 and 1 is easily defined. Precisely, we may 
recast the formula as: JUSAABAC(U2 = U) ACU + U # U) A (7X2 +U+U =A? + B? + C?)). 
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Exactly as for the second-order definition using (XY), induction yields that for any 
neN, QE R(n). Moreover, S(Q, P, N) is equivalent to S(Q, P, —N), which 
implies that for any n € Z, Q — R(n). 

It remains to show where the formula S(Q, P, N) comes from and why the truth 
of R(x) in Q means that x is an integer. As alluded to above, the idea is that the 
truth of R(x) forces x’s exact denominator to be free of all primes. 

The definition of S(Q, P, N) is motivated by two lemmas that Julia Robinson 
discovered, lemmas whose proofs are not elementary. 


Lemma 1. If x is rational and p is a prime with p = 3 (mod 4), then A? + B? — 
pC? = px* + 2 is solvable in rational A, B,C (i.e., Q — SC, p, x)) if and only if the 
exact denominator of x is odd and not divisible by p. 


LEMMA 2. If x is rational and p and q are odd primes with p = 1 (mod 4) and 
(q/p) = —1, then A? + qB* — pC? = qpx’ + 2 is solvable in rational A, B,C (i.e., 
Q & S(q, p, x)) if and only if the exact denominator of x is relatively prime to p 
and q. 


Now, assuming these two lemmas (which are proved in §3), we can complete the 
proof that RCX) defines Z in Q. Suppose R(x) is true for some rational x and p is 
a prime congruent to 3 (mod 4). Then S(1, p, 0) holds (by Lemma 1, since the exact 
denominator of 0 is 1; alternatively, just let_ A = B = 1, C = 0). Moreover, for any 
m & Q, if SC, p, m) is true then so is S(1, p,m + 1) (use Lemma 1 and the fact 
that m and m+ 1 have the same exact denominators). Therefore the truth of 
R(x) implies that SC, p, x) is true, which, again by Lemma 1, implies that x’s 
exact denominator is not divisible by p or by 2. It remains to show that this 
denominator is not divisible by p when p = 1 (mod 4). First choose q to be an odd 
prime that is a quadratic nonresidue mod p; such must exist, for if s is any 
quadratic nonresidue, then one of s, p + s is odd, and hence has an odd prime 
factor that is a quadratic nonresidue. Then use Lemma 2 to show that S(q, p, 0) 
holds and that S(qg, p,m) implies S(q, p,m + 1) for any rational m. A final 
application of Lemma 2 to x yields that x’s exact denominator is not divisible by 
p. Therefore the exact denominator of x is 1, completing the proof that RCX) 
defines Z. 

How did Julia Robinson discover this definition? Here is Raphael Robinson’s 
recollection: “I knew the three-square theorem and helped Julia find how to 
eliminate the factor 2 from the denominator of a rational number. But the 
three-square theorem exhausted my knowledge of ternary quadratic forms, and 
Julia knew nothing about them. She started searching the literature for relevant 
material. She had no idea that the theory was simpler over @ than over Z. She 
looked at a lot of things that were not helpful. It was several months before she 
found the Hasse paper. Then she had to find suitable forms to eliminate the 
various primes from the denominators. Note that it is only the fact that ternary 
forms represent most numbers with a few exceptions that makes the definition 
possible. Binary forms represent too few numbers, quaternary forms too many. I 
remember asking her what good it would do to find formulas for eliminating all of 
the primes, if these could not be combined into one formula anyway. But she also 
found how to do this. The proof would have been a lot easier for someone who 
already knew about Hasse’s work. But I guess that those who knew it had never 
heard of Tarski’s problem. It must often happen that the tools for solving a 
problem are known, but not to the people working on the problem.” 
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We now discuss briefly the consequence of the definability of Z in Q that 
motivated the study of the problem. A theory is a set of sentences in the language 
of first-order logic with some relation and function symbols. For example, the 
theory of the natural numbers is usually taken to mean the set of those first-order 
sentences involving + and - that are true in the structure (N, +, - ). Church (see 
[4, p. 231]) proved that the theory of the natural numbers is undecidable, that is, 
there is no algorithm that decides, on input an arbitrary sentence, whether that 
sentence is in the theory (i.e., is true in (N, +, - )). Because N is definable in Z, it is 
not hard to see that the theory of (Z, +,- ) is undecidable too. Similarly, Julia 
Robinson’s definition of Z in Q implies that the theory of (Q, +, - ) is undecidable. 
This is in contrast to Tarski’s work showing that the theories of R, C, A (the 
algebraic numbers), and AO R are decidable (all with the operations + and :). A 
consequence is that Z is not definable in any of these fields. In fact, Tarski’s work 
yields a complete description of the definable sets in (R, +,-: ): A set is definable 
in this structure iff it is a finite union of intervals with algebraic or infinite 
endpoints (see [15)). 

There are several noteworthy extensions of Julia Robinson’s theorem. In [14] 
she proved that Z is definable in any number field, that is, any field extension L of 
the rationals having finite degree. This was proved in two steps. First she gave a 
definition of Z in @,, the ring of algebraic integers in L (the definition is relatively 
simple); then a definition of @, in L is given, a definition that, as in the case of 
defining Z in Q, relies on a substantial amount of algebraic number theory. It is 
still open whether the field of constructible numbers is decidable. J. Denef (see [3]) 
provided a dramatic improvement in the case of quadratic fields (1: Q = 2); he 
showed that Z is Diophantine over @,, which means that there is a definition of Z 
using only existential quantifiers (the four-squares definition shows that N is 
Diophantine over Z; for more on this notion, see §5). Denef and Lipshitz have 
conjectured that Z is Diophantine over @, for any number field L; this has been 
proved for several classes of number fields (totally real number fields (Lipshitz), 
quadratic extensions of totally real number fields (Denef and Lipshitz), Abelian 
extensions of @ (A. Shlapentokh and H. N. Shapiro [20]). However, L. van den 
Dries [23], building upon work of R. Rumely, showed recently that the ring of all 
the algebraic integers is decidable, and so the integers are not definable in this 
ring. 

Theories also arise as the set of sentences that can be proved from certain 
axioms by rules of logic. For example, the elementary theory of groups is the set of 
those sentences in the language with one operation symbol (the group operation) 
and one constant symbol (the identity) that are provable from the group axioms. 
The undecidability of Z allows one to deduce the undecidability of the theory of 
groups and of the theory of rings (see [22, p. 136]); the main point is that the 
theories of groups and rings are contained in the theory of Z. It is a consequence 
of the definability of Z in Q that the theory of fields is undecidable. This result is 
striking because of results of Tarski that the theories of real closed fields and of 
algebraically closed fields of characteristic zero are complete, which means that 
these theories coincide with the theories of R and C, respectively; thus these 
theories are decidable (see [15], [12, $$1, 2], [22, §5.5]). 

One can also look at larger fields. For example, Raphael Robinson has shown 
[17] that the elliptic curve y* = x? — 4 can be used to define Q in any pure 
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transcendental extension of Q. This combines with Julia Robinson’s work to yield a 
definition of Z in such fields. 


3. The Hasse—Minkowski Theorem. We say that a quadratic form 


, 
f(X,...,X,) = » a,X; 
i=] 
with coefficients a; in a field K represents zero in K if and only if the equation 
f(X) = 0 has a solution with X € K” and X + 0. 

The nontrivial number theory called up by Julia Robinson can all be described 
as a series of statements that various rational quadratic forms in four variables do 
or do not represent zero in @. These statements are by now routine special cases 
of one of the great theorems of twentieth-century mathematics, the Hasse- 
Minkowski Theorem. Our goal in this section is to present an elementary state- 
ment of the Hasse—Minkowski Theorem and to show how Robinson’s Lemmas 1 
and 2 can be deduced from it. 

We focus on the case K = @. Clearing denominators, we henceforth consider 
only integral (a; € Z) quadratic forms f(X) = L/_,a,X7. If f represents 0 in Q 
then, by the homogeneity of the equation f(X) = 0, there exists x € Z’ such that 
f(x) = 0 and GCD(x,,...,x,) = 1. Then of course f(x) = 0 (mod n) for every 
positive integer n. Taking the contrapositive point of view, we note that for each n 
the existence of such a solution of the congruence f(X) = 0 (mod n) is a necessary 
condition for f to represent 0 in Q. Since these conditions are important, let us say 
that an integral quadratic form f(X) represents 0 mod n if and only if there exists 
x € Z’ such that f(x) = 0 (mod n) and GCD(x,,..., x,,n) = 1. 

There is another obvious necessary condition for a rational quadratic form f to 
represent 0 in Q. If all its coefficients a; are nonzero, then they must not have the 
same sign, for then (and only then) the form would not even represent 0 in R. 


THEOREM 2 (HAssE—MINkowsk1). An integral quadratic form f represents zero in 
© if and only if f represents zero mod n for every positive integer n and f represents 
zero in R. 


A variant of the three-variable case of Theorem 2 was first discovered and 
proved by Legendre in 1785. The full theorem was formulated and proved in a 
suitably modern p-adic way by Hasse in 1923 [7, pp. 3-22]. Minkowski had earlier 
(1890) considered the related problem of the classification of rational quadratic 
forms, finding conditions for a rational quadratic form to represent zero as a 
consequence. Fine expositions can be found in [1] and [21]. 

The Hasse—Minkowski Theorem must be supplemented before it can be used to 
prove the existence of nontrivial rational solutions of the equation f(X) = 0. As it 
stands, the theorem obliges us to check an infinite number of conditions, one for 
each n and one more for R. Is that any better than computing f(x) for every 
x € QQ’? What can we do? 

To begin with, we need only consider prime power moduli. Suppose that 
f(x,) = 0 (modn,), 1 <j <s, and that GCD(m,,n,) = 1 for i #j. The Chinese 
Remainder Theorem produces x € Z’ such that x =x, (modzn,) for all j. It 
follows that f(x) = 0 Gmod IIn,). In a typical application the n,; are prime powers. 
We state the alternative version of Hasse—Minkowski that results. 
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THEOREM 2'. An integral quadratic form f represents zero in Q if and only if f 
represents zero mod |” for every prime number | and every n > 1 and f represents 
zero in R. 


For each prime number 7 there is a field Q, of characteristic 0, called the field 
of /-adic numbers, which has the property that an integral quadratic form repre- 
sents 0 in @, if and only if it represents 0 mod /” for all n > 1. Theorem 2’ can 
therefore be reworded to state that an integral quadratic form represents 0 in Q if 
and only if it represents 0 in each of a number of extension fields of Q, which is 
how Hasse actually stated his theorem. The aesthetic advantage of doing this is 
that the hypotheses concerning R and the prime numbers / are put on a uniform 
footing. The mathematical advantage is that it is much easier to work with the 
fields Q, than with the rings Z//”Z. 

We must go further. For an odd prime / the condition that an integral quadratic 
form f represent zero mod /” for all nm can be expressed in terms of the Legendre 
symbols (a /l) where the a are divisors of the coefficients of f. Recall that (a//) is 
defined for odd primes 7 and integers a such that / + a by the rule: (a /l) = +1 if 
the congruence x”? = a (mod /) has a solution and (a//) = —1 if the congruence 
has no solution. We will express the condition that f represent 0 mod 2” for all n 
as a set of explicit congruences on the coefficients of f. 


LEMMA 3. Let | be an odd prime, and let a, b, c, d be integers not divisible by 1. 
Then 

(1) aX* + bY* represents 0 mod |” for every n if and only if (—ab/I) = 1. 

(2) aX* + bY? + cZ? represents 0 mod I” for every n. 

(3) aX’ + bY* + IcZ* + IdW? represents 0 mod |” for every n if and only if 
(—ab/1) = 1 or (—cd/1) = 1. 


Proof. We recall the fact (see, e.g., [6, p. 67]) that the congruence x* =f 
(mod /”) with ¢ an integer not divisible by the odd prime / has a solution for every 
n iff (t/1) = 1. It follows that if s and ¢ are prime to / and sx* = t (mod /) has a 
solution, then sx* = t (mod /”) has a solution for every n. 

(1). If (—ab/l) = -1 then aX’ + bY” does not represent 0 mod /. If, on the 
other hand, (—ab/l) = +1, then the congruence aX* = —b has a solution mod / 
(let X be €a~! where €* = —ab (mod /)), whence the congruence has a solution 
mod /”, which yields the desired representation of 0 mod /” (with Y = 1). 

(2). Let f =aX* + bY’ + cZ’, and let N = Kx € (Z/IZ)°: f(x) = 0 (mod J)}|. 
We first prove that f represents 0 mod / by showing that N > 1. 

Fermat’s Little Theorem asserts that for m € Z, m'~' = 1 (mod J) if 1 + m, and 
m'-!=0 (mod/) if I|m. Therefore N = L{1 — f(x)'"':x © (Z/1Z)*} (mod J). 
Since 1 — f(x)'~' has degree 2(/ — 1), it is a linear combination of monomials 
X’Y?%Z" where at least one of the exponents p,qg,r is less than / — 1. But 
Ly oW* = 0(mod J) for0 <s <1-—1.0f1<s <1—1, write Di ws = ri? ys 
= (1 — y—D)7(1 — y*) (mod /), where y is a primitive root mod /, and note that 
y-D) = 1, y’ # 1(mod /).) Therefore we can conclude that N = 0 (mod /). But 
N # 0 because f(0) = 0. It follows that N >/> 1. 

Thus there exists (x, y,z) € Z° such that f(x, y,z) = 0 (mod /) and at least 
one of x, y, z is not divisible by /. If, for example, / + x, then the solvability of the 
congruence aX* = —by* — cz? (mod 1) (with X = x) implies solvability mod /” for 
all n. Thus there exists € such that a€? = —by” — cz* (mod /”) and X =é, Y =y, 
Z =Z gives a representation of 0 mod /” by f. 
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(3). If (—ab/1) = 1 or (—cd/l) = 1, then by (1) f represents 0 mod /” with 
Z=W=O0orx=Y=Q0. 

Conversely, suppose that f represents 0 mod I? and that f(x, y,z,w) =0 
(mod /?) with x, y, z,w not all divisible by /. If (—ab/1) = —1, then the congru- 
ence ax” + by* = 0 (mod /) implies that /|x and /|y. Therefore cz? + dw? = 
(mod 1). Since / does not divide both z and yw, it follows that (—cd/l) = +1. O 


LEMMA 4. Let a,b,c, d be odd integers. Then 

(1) aX? + bY? + cZ? represents 0 mod 2” for every n except when a =b=c 
(mod 4). 

(2) aX? + bY? + cZ? + dW” represents 0 mod 2” for every n except when 
a=b=c=d (mod 4) anda+b+ct+d= 4 (mod 8). 

(3) aX? + bY? + cZ? + 2dW? represents 0 mod 2" for every n. 


Proof. We recall the fact [6, p. 67] that the congruence X? = t (mod 2”) with t¢ 
an odd integer has a solution for every n iff t = 1 (mod 8). It follows that if 
sX”? = t (mod8) has a solution with s and t odd, then sX¥* = t (mod 2”) has a 
solution for every n. 

(1). If a =b =c (mod 4), then aX’ + bY? + cZ? does not represent 0 mod 4. 
On the other hand, suppose, for instance, that a # b (mod 4). Then a + b =0 or 
4 (mod 8). Thus aX? + bY? + cZ? = 0 (mod 8) with X¥ = Y= 1 and Z = 0 or 2. 
Letting X satisfy aX? = —b — cZ? (mod 2”) gives a representation of 0 mod 2” by 
our form. 

(2). If a=b=c =d (mod4) anda +b+c+d = 4 (mod8) then aX’ + bY’ 
+ cZ* + dW” represents 0 mod4 only by taking X,Y, Z,W all odd and hence 
does not represent 0 mod 8. If the coefficients are congruent mod 4 and their sum 
is 0 mod 8, then X = Y = Z = W = 1 gives a representation of 0 mod 8 that can 
be refined as in the proof of (1) to a representation of 0 mod2” for every n. 
Finally, if, for instance, a 4 b (mod4) then it follows from (1) that there is a 
representation of 0 mod 2” by aX’ + bY? + cZ* + dW? with W = 0 for every n. 

(3). In view of (1) we may as well assume that a = b (mod 4). Hence a + b + 2d 
= 0 or 4 (mod 8). It follows that a + b + cy* + 2d = 0 (mod 8) with y = 0 or 2. 
Refinement as before leads to representations of 0 mod 2” for every n. O 


Starting with Lemmas 3 and 4, it is not difficult to show that every integral 
quadratic form in 5 variables represents 0 mod/” for every prime / and every 
n > 1. Thus the /-adic hypotheses of Theorem 2’ are automatically satisfied by all 
forms in 5 or more variables, which-dramatically reduces arithmetic interest in 
their coefficients. 

We can at last proceed to the number theory in Julia Robinson’s definability 
theorem. The key statements are Lemmas 5 and 6, which is where the 
Hasse—Minkowski Theorem makes its appearance. 


Lemma 5. Let p = 3 (mod 4) be prime, k € Z, p + k, and 4 + k. 

(1) f, =X*+ Y* —pZ? —kW* represents zero in Q if and only if k #p 
(mod 8). 

(2) f, =X? 4+ Y? — pZ* — pkW* represents zero in Q if and only if (k/p) = -1 
and pk # p (mod 8). 


Proof. By Lemma 3(2) applied to X* + Y* — pZ’, both f, and f, represent 0 
mod /” for all odd primes / # p. By Lemma 3(2) applied to X¥* + Y? — kW”, f, 
represents 0 mod p”. By Lemma 3(3) and the fact that (—1/p) = —1, f, repre- 
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sents 0 mod p” iff (-—k/p) = —(k/p) = 1. If k is even then both f, and f, 
represent 0 mod 2”, by Lemma 4(3). Finally, if k is odd, write f; = X? + Y? — pZ? 
— sW*. By Lemma 4(2), f,; represents 0 mod 2” except in the case where 1 = —p 
= —s (mod 4) and 2 — p — s = 4 (mod 8), conditions that are equivalent to the 
single congruence s =p (mod8). Thus Lemma 5 follows from Theorem 2’. 
OD 

Lemma 6. Let p and q be distinct odd primes with p = 1 (mod 4) and (q/p) = 
—]. Letk € Z be such thatp tk, q tk, and 4+ k. 

(1) f, =X* + qY? — pZ* — kW? represents 0 in Q. 

(2) f, =X? + qY* — pZ* — pkW? represents 0 in Q if and only if (k/p) = 1. 

(3) f; =X? + qY* — pZ* — qkW? represents 0 in Q if and only if (k/q) = 1. 

(4) fy =X? + qY? — pZ? — pqkW°* represents 0 in Q if and only if (qk /p) = 1 
and (pk/q) = 1. 


Proof. Similar in spirit to the proof of Lemma 5. Consider it an exercise. O 


LEMMA 7. Let n and d be integers with GCD(n, d) = 1. Let p = 3 (mod 4) be 
prime. Set pn* + 2d? = st?, where s,t € Z and s is squarefree. The following two 
conditions are equivalent. 

(1) d is odd and not divisible by p. 

(2) s # p (mod 8) and if p|s with s = pk, then (k/p) = —1. 


Proof. (1) = (2). If d is odd, then st? = 2 — n? = 1 or 2 (mod 4). But pt? = 
or 3 (mod 4). Hence s # p (mod 8). If p + d, then p + st’, from which it follows 
that pts. 

(2) = (1). If d is even, then 1 is odd so st? = p (mod 8), from which it follows 
that ¢ is odd and s = p (mod 8). If pld, then st? is divisible by p but not by p?. 
Therefore p|s and p + t. Writing s = pk, we have kt* = n* (mod p), and there- 
fore (k/p)=1. O 


Lemma 8. Let n and d be integers with GCD(n, d) = 1. Let p and q be odd 
primes such that p = 1 (mod 4) and (q/p) = —1. Set pqn? + 2d* = st*, where 
s,t € Z and s is squarefree. The following two conditions are equivalent. 

(1) d is relatively prime to p and q. 

(2) If p|s with s = pa, then (a/p) = 1; and if q\|s with s = qb, then (b/q) = 1. 


Proof. (1) = (2). If d is prime to p and gq, then neither p nor q divides s. 

(2) = (1). If pid, then st? is divisible by p but not by p*. Therefore pls and 
p +t. Writing s = pa, we have at* = qn’ (mod p), from which it follows that 
(a/p) = (q/p) = —1. The case qld is similar, since (q/p) = (p/q). O 


Proof of Lemma 1. Let x € Q, and let p = 3 (mod 4) be prime. Express x in 
lowest terms as n/d, and let pn* + 2d? = st*, where s,t € Z and s is squarefree. 
It follows immediately from Lemmas 5 and 7 that X¥* + Y* — pZ? — sW? repre- 
sents 0 in Q iff d is odd and not divisible by p; and in such a representation of 
zero we will have W # 0, since X* + Y? — pZ* does not represent 0 (by Lemma 
4(1)). Taking A = tX /dW, B = tY /dW, and C = tZ /dW, we find A* + B? — pC’ 
=px*+2. QO 


Proof of Lemma 2. The proof is similar to that of Lemma 1, using Lemmas 6 
and 8 in place of Lemmas 5 and 7. To show that W # 0, it is enough to prove that 
f =X*+qY? — pZ’ does not represent 0 mod p’. We need only show that if 
f(x,y, z) = 0 (mod p?), then x, y,z are all multiples of p. Since x* + qy’ = 
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(mod p) and (q/p) = —1, it follows that x and y are divisible by p, and hence 
that z is also divisible by p. O 


As a final application of the Hasse—Minkowski Theorem we prove the rational 
three-squares theorem cited earlier (Theorem 1). When is a positive integer m a 
sum of three rational squares? Write m = 4%s where a,s © N and4+ts. Then m 
is a sum of three rational squares if and only if f=X*2+Y?7+Z? -—sw? 
represents 0 in Q@ (since we cannot have W = 0 in such a representation). By 
Lemma 3(2), f represents 0 mod /” for all odd primes / and all n (with W = 0). By 
Lemmas 4(2) and 4(3), f represents 0 mod 2” for all nm except in the case s = 7 
(mod 8). Therefore, by Theorem 2’, m is a sum of three rational squares if and only 
if s # 7 (mod 8), which is what we set out to prove. 

We close §3 with a simple but important algebraic lemma crucial in many 
applications of the Hasse—Minkowski Theorem. Though it is avoided in the proof 
of Robinson’s theorem, it could have been used and should not be forgotten. 


Lemma 9. Let K be a field of characteristic different from 2. Let f(X) = 
i ,a;,X? and g(X) = Xi_,a,X? — sX?,,, where s and all the a, are nonzero 
elements of K. Then there exists x € K" such that f(x) = s if and only if g represents 
O in K. 


Proof. If there exists x € K" such that f(x) =s then g(x, 1) = 0, showing that 
g represents 0 in K. 
Now suppose that g represents 0 in K. There are two cases to consider. 


Case 1. f does not represent 0 in K. If g(x) = 0 with x # 0, then x,,, # 0 and 
F(X /% 413 ane. X,/Xp41) = 5. 


Case 2. f does represent 0 in K. Let x #0 € K’ satisfy f(x) = 0. Find y € K’ 
such that f(y) = 0 and Y'_,a;x,;y, = 1. Then f(x + $sy) = s. Produce y as follows. 
Choose u € K” such that )_,a,x,u; = 1. Take y = u + Ax, where A = —f(u)/2. 
O 


4. Cubic forms. From the point of view of number theory, the Hasse— 
Minkowski Theorem used in §3 is rather special. Straightforward generalizations 
of Theorem 2 to other classes of Diophantine equations are usually false. Such is 
the case even for homogeneous cubic equations, seemingly the simplest equations 
beyond those covered by Theorem 2. 

It is easy nowadays, using computers, to find likely candidates for counterexam- 
ples to a Generalized Hasse—Minkowski Theorem, but it is not so easy to prove 
that they are true counterexamples. One has got to show that a specific Diophan- 
tine equation has no rational solutions, and congruence considerations will not 
help. The method has usually been reliance on ad hoc computations. 

The theory of integral cubic forms! in 3 variables is inextricably bound up with 
the theory of elliptic curves. Selmer (1951) was the first to find a 3-variable cubic 
counterexample to Hasse—Minkowski: 3X? + 4Y? + 5Z°? represents 0 mod n for 
every n but fails to represent 0 in Q. 

It was thought for several years that Hasse—Minkowski might hold for homoge- 
neous cubic equations in 4 variables. Counterexamples proved very hard to 
construct, but they were found, the earliest by Swinnerton-Dyer (1962) and Cassels 


1” form is simply a homogeneous polynomial. 
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and Guy (1966). Recently Colliot-Théléne, Kanevsky, and Sansuc (1985) found the 
following nontrivial infinite family of counterexamples: X° + p?Y? + pqZ? + 
q°*W°, where p and gq are prime numbers, p = 2, g = 5 (mod 9), represents 0 
mod n for all n but fails to represent 0 in @. An idea of Manin (1970) provides a 
uniform explanation for all currently known 4-variable cubic counterexamples to 
Hasse—Minkowski. 

It is possible that the obvious analogue of the Hasse—Minkowski Theorem is 
true for cubic forms in 5 or more variables. No counterexamples have been found. 
There can’t be any counterexamples in 16 or more variables, since every integral 
cubic form in 16 or more variables represents 0 in Q, by a theorem of Davenport 
(1963). The conjecture that every integral cubic form in 10 or more variables 
represents zero in @ has been proved for nonsingular forms by Heath-Brown 
(1983). 

Though there has been recent progress, it remains an open problem to find a 
usable set of necessary and sufficient conditions that a cubic form represent 0 in Q. 
The Hasse principle, the working hypothesis that there should be some kind of 
strong relation between local (mod 7) solvability and global (Q) solvability, seems 
to be a reliable guide, even if not precise. 

See [11] and [19] for a discussion of these and related results. 


5. Hilbert’s Tenth Problem. The definition of Z in Q that Julia Robinson 
discovered is complicated, especially when compared to the definition of N in Z 
using the four-squares theorem, where only existential quantifiers are used. An 
interesting Open question is whether such a simple definition exists for defining Z 
in Q (i.e., Is Z Diophantine over @?). This problem is especially important because 
of its connection with Hilbert’s Tenth Problem, which asked for an algorithm to 
decide whether an arbitrary Diophantine equation has a solution in Z. Hilbert’s 
problem was solved, negatively, in 1970, by Y. Matijasevic who completed the final 
step in a program developed by Julia Robinson, Martin Davis, and Hilary Putnam 
(see [2]). In fact, Matijasevi¢é has shown (see [8]) that there is no algorithm for 
solving 9-variable Diophantine equations over N. Because N is Diophantine over Z, 
this implies that there is no algorithm for solving Diophantine equations over Z. 
For such an algorithm over Z would yield one over N, as we illustrate in the case of 
two variables: PCX, Y) = 0 has a solution in N? if and only if 

P(A* + B° + C* 4+ D’, E* + F* + G’?+H’) =0 
has a solution in Z®. Because 4 - 9 = 36, this technique shows that there is no 
algorithm for solving 36-variable equations over Z. We digress momentarily to 
show how 36 can be reduced to 27, a consequence of the following 3-variable 
definition of N in Z: An integer X is nonnegative iff X has the form A* + B? + 
C* + C. The validity of this definition (found by Raphael Robinson) can be seen 
by using the integer three-squares theorem (§2) to write 4X + 1 as a sum of three 
squares, of which two must be even and one odd. Alternatively, one could use the 
fact that every nonnegative integer is a sum of three triangular numbers, also a 
consequence of the three-squares theorem (see [6, p. 179]). Raphael Robinson [16] 
found a two-variable definition of N in Z by using properties of the Pell equation, 
but X is not expressed as a polynomial in the two new variables, and the definition 
also includes an inequation, so it does not serve to further reduce the 27 to 18. 
However, Matijasevic recently modified this idea, finding that an integer A is 
nonnegative iff there are integers X and Y with Y+#0 such that X?- 
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A Simple Proof of a Simple Version of the Riemann Mapping 
Theorem by Simple Functional Analysis 


P. R. GARABEDIAN 
Department of Mathematics, New York University, NY, NY 10012 


The purpose of this note is to sketch a proof of a restricted version of the 
Riemann mapping theorem that only requires an elementary knowledge of com- 
plex analysis and avoids concepts like completeness or compactness. The proof is 
motivated by the relationship of the map function to the Green’s function and the 
Szego kernel function [1]. 

Let D be a simply connected region of the complex plane bounded by a smooth 
Jordan curve 0D enclosing the origin, and let A be the class of functions f(z) 
analytic in the closure of D. Consider the problem of minimizing the norm 


2 2 
= dz 
fll fir \dz| 
subject to the constraint 
f(z) 
dz = 1, 
F(0 )= 271 | aD Zz 7 


which ensures that |||? has an infimum d > 0 by virtue of the inequality 


f(z) _ | 1 
I, z dz < | max s) flail |dz|. 


Because (f,,, + Jn )/2 is a normalized function in A whenever f,, and f, are, so 
that |If,, + f,lI? = 4d, the parallelogram law 


foe — fell? + Whe + fill? = 2ll fell? + IF, lI? 


implies that any minimal sequence f, for this extremal problem is a Cauchy 
sequence with ||f,, — f, ll ~ 0. Moreover, because ||f,|| remains bounded, we see 
from the estimate 


(fms Fin) — (8s fn) | S18 — SFn> fm) | + 18h n> fin — fn) | 
< (maxlgl)(IIfiall + lal lMfon — fal 


that the sequence of complex numbers (gf, f,,) converges and the formula 


Lg] = tim et) (Shiota) 


ne d 


= Jim sf glf, | ldz 


defines a bounded linear functional over the space of continuous functions g on 
the curve 0D. 
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If ¢ is any complex number and g is in A with g(0) = 0, then f, + eg isa 
normalized element of A and, therefore, 


d<\lf, + egll’ =Ilf,ll’ + 2Ref{e(gf,,, f,)} + lel"llafnll’. 


Since |g| is bounded on 0D there is a fixed number K such that lef, ll? < K, so 
that letting n — © we have 


d<d+2Re{eL[g]} + Klel’. 


That is only possible for arbitrary ¢ if L[g] = 0. Since L [1] = 1, substitution of 
the renormalized function g(z) — g(O) into this equation yields the variational 
condition 


g(0) =L[g]. 


Hence when w* is any point outside D we can put g(z) = log(z — w*) to arrive at 
the identity 


log|w*| — L[loglz — w*|] = 0. 
For w =u + iw inside D we proceed to establish that the formula 
log| F(w)| = log|lw| — L[log|z — wl] 


defines by analytic completion a function ¢ = F(w) mapping D conformally onto 
the interior of the unit circle || < 1. 
If g(z,t) depends on a parameter ¢ in such a way that 


g(z,t) — g(Z, to) - dg 


—|=0 


tt 


uniformly for all z on 0D, then the boundedness of the linear functional L implies 


that 
0 0g 
—L = L|—}. 
a rls] = 


Consequently L commutes with the Laplace operator 
a? a? 
—— + ——_. 
du- ~— dv? 
in the special case g(z) = log|z — w|, which shows that log|F'| is harmonic, 
A log|F| = A log|w| — L[A log|z — w|] = 0. 
The corresponding analytic function F(w) is seen by inspection to have a simple 
zero at the origin. 


If w and w%* are situated at equal distances from 0D along the same normal, 
then 


z2—W 
= 0 


lim log 
lIw—w*| >0 


Z—w* 


uniformly for all z on 0D. The reason for this is that for any 5 > 0 the region of 


826 ALFONSO VILLANI [November 


the z-plane satisfying 
1 
1+ 6 


is bounded by two disjoint circles of a coaxial system; and because 0D is smooth, it 
will lie entirely inside any such region when w and w% are sufficiently close. It 
follows that 

|-° 


as w and w* approach the same point on the boundary 0D, so |F| = 1 there. From 
the maximum principle we conclude that |F| < 1, and from the argument principle 
we conclude that F is univalent, since it has only one simple zero. Thus it is the 
desired map function, and we have proved the 


Z—wWw 
<1+6 


< 


zZ—w* 


lw | . z-—w 
lim log|F'| = lim log —— — lim L}to8 —_——_— 
|w* | z—-w* 


THEOREM. Any simply connected region of the plane bounded by a smooth Jordan 
curve can be mapped conformally onto the unit circle. 


This work was partially supported by NSF Grant DMS-8922805. 
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On the Equality L'(w) = Li, (w) 


loc 


ALFONSO VILLANI 
Department of Mathematics, University of Messina, Messina, Italy 


The starting point of this note is the consideration of two quite obvious facts. 
First, if we are given a Lebesgue measurable set FE C R” and a function u: E — R 
which is locally Lebesgue integrable on £, that is integrable on every compact 
subset of E, then u is not integrable over all E in general. Second, there are, 
however, Lebesgue measurable sets E for which local integrability of a function 
implies integrability over all EF; indeed, all sets E which are “essentially compact’’, 
that is which contain a compact set H such that & \ A has measure zero, have 
this property. 

Thus, a natural question is: for which E’s does local integrability imply integra- 
bility? In a more abstract setting, one can ask about a characterization of those 
measure spaces for which every locally integrable (in a sense to be specified) 
function is integrable on the entire space. Providing such a characterization is the 
aim of this note. It turns out, in particular, (see the Corollary below) that the 
answer to the starting question is: exactly those E’s which are essentially compact. 


Let (QO, Y, w) be a measure space. We will assume throughout that 

i) O is a metrizable space; and 

ii) if K CQ is a compact set, then K € Wand w(K) < ~. 

A function u: Q — R is said to be locally p-integrable provided that wu is 
measurable and f,|u|du < © for every compact set K CQ. The set of all 
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The starting point of this note is the consideration of two quite obvious facts. 
First, if we are given a Lebesgue measurable set E CR” and a function u: E —- R 
which is locally Lebesgue integrable on F, that is integrable on every compact 
subset of EF, then u is not integrable over all E in general. Second, there are, 
however, Lebesgue measurable sets E for which local integrability of a function 
implies integrability over all E; indeed, all sets E which are “essentially compact’, 
that is which contain a compact set H such that & \ A has measure zero, have 
this property. 

Thus, a natural question is: for which E’s does local integrability imply integra- 
bility? In a more abstract setting, one can ask about a characterization of those 
measure spaces for which every locally integrable (in a sense to be specified) 
function is integrable on the entire space. Providing such a characterization is the 
aim of this note. It turns out, in particular, (see the Corollary below) that the 
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Let (QO, Y, w) be a measure space. We will assume throughout that 

i) (1 is a metrizable space; and 

ii) if K CQ is a compact set, then K € Wand w(K) < ~. 

A function u: Q — R is said to be locally w-integrable provided that u is 
-measurable and f,lu|du < © for every compact set K CQ. The set of all 
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locally w-integrable functions is denoted by L},.(). Of course we have L'(u) € 


Linc H). 

It is worth recalling here before stating the Theorem below that a set A € LY, 
with (A) > 0, is said to be a -atom provided that if B € &, B CA, then either 
u(B) = 0 or pA \ B) = 0. 


THEOREM. The following condition (*) is necessary and sufficient in order that 
Li(w) = Ligc(u): 

(«) there exists a compact set H CQ, such that either w0Q \ H) = 0 or other- 
wise Q \ His a finite union of w-atoms of finite measure. 


We need the following easy topological lemma. 


Lemma. Let K be a compact subset of a Hausdorff space 0, and let G,,...,G, be 
open subsets of {1 such that K ©G,U-:: UG.,. Then there exist K,,...,K 
compact subsets of Q, with K,; © G;,, i= 1,...,r, such that K = K,U-::: UK,. 


l 


r? 


Proof. Consider first the case r = 2. 

Let H; =KO(G,), i=1,2. Then H,, H, are disjoint compact sets, with 
H, € G,, H, € G,. Since 2) is a Hausdorff space, there exist A,, A>, disjoint open 
sets, with H,; CA;, i = 1,2. Obviously we can assume A, CG,, A, CG,. Let 
B, =A, U(G, \ G,), B, = A, U(G, \ G,). Then B,, B, are disjoint open sets, 
with K \ G; CB,, 1 = 1,2. Consequently, if we put K; = K \ B,, i = 1,2, then 
K,, K, are compact sets, with K = K, UK, and K,; CG,, i = 1,2. 

An easy inductive argument completes the proof. 


Proof of the Theorem. It is well known (see [1], Lemma 6.5.8) that if u: Q > R 
is xmeasurable and A € YW isa p-atom, then uw is w~-almost everywhere constant 
on A; hence we have /,|u| du < © provided that w(A) < «. Thus, it is clear that 
if condition (*) is satisfied, then L'(w) = Lj, (wu). 

To prove the reverse implication, assume the equality L'(u) = Li,.(u) and 
consider the collection Y of all open sets G C1 containing no compact set of 
positive measure. Let V = U{G: G € ¥V}. In other words, V is the set of all points 
x € QO which have the following property: there exists an open neighbourhood of x 
which contains no compact set of positive measure. We will prove that: 

a)Ve YF, 

b) H = \ Vis a compact set, 

c) if uV) > 0, then V is a finite union of -atoms of finite measure. 

Of course, b) and c) imply condition (« ). 

To prove a), consider any compact set K C V. By the definition of V and the 
compactness of K there exist finitely many sets G,,...,G. © VY such that K CG, 
U-:: UG,. According to the Lemma, there are compact sets K,,..., K,, with 
K, ¢G, (and hence yu(K;) = 0), i= 1,...,r, such that K=K,U--- UK,,. It 
follows that wCK ) = 0 and this proves assertion a). 

Next, to prove b), assume by contradiction that there exists a sequence {x,} in 
H such that no subsequence is convergent in H, and hence in () since A 1s closed. 
Of course, it can be also assumed that x, #x,, if n #m. Then it is easy to 
construct a sequence {B,} of pairwise disjoint open subsets of 0, with x, < B, for 
every n = 1,2,... . Moreover, it can be assumed that the diameter of B,, with 
respect to any fixed compatible metric d on (), tends to zero as n ~ ©, As a 
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The Ranks of Tournament Matrices 


D. DE CAEN 
Department of Mathematics and Statistics, Queen’s University, Kingston, Canada K7L 3N6 


A tournament matrix of order n is an n Xn matrix M of 0’s and 1’s such that 
M;,= 0 and M,, + M,, = 1 for all i #j. In other words, M satisfies the matrix 
equation 


M+M'=J-I, (1) 


where M’‘ is the transpose of M, J is the n Xn matrix of all 1’s, and J is the 
identity matrix. See [1] for background information on tournaments. Since M has 
only {0, 1}-entries, it may be viewed as a matrix over any field F. We shall make a 
few observations concerning the rank of M over F. 

First of all, we mention a simple definition of the rank of any m X n matrix A 
over F.. This interpretation of rank is perhaps not widely known, but is occasionally 
quite useful. A rank-one matrix is, by definition, a matrix of the form uv’, where u 
is a non-zero m X 1 vector and v is a non-zero n X 1 vector. The rank of a 
non-zero matrix A is then defined as the minimum number of rank-one matrices 
that sum to A. (By convention, the zero matrix has rank 0.) We leave as an exercise 
the verification that this definition of rank 1s the same as the usual one, namely the 
dimension of the row space of A. Incidentally, our definition of rank makes it 
quite obvious that the row and column ranks of A are equal. This viewpoint 
originated in multilinear algebra [6, p. 38] when defining the rank of symmetric 
tensors. 


THEOREM 1. Let M be ann X n tournament matrix. Then the rank of M over any 
field is at least (n — 1)/2. 


Proof. The rank of J—J is always at least n — 1. Indeed, det(J — J) is 
+(n — 1), but it also contains the (J — /)-matrix of order (n — 1) as a principal 
submatrix, which has determinant +(n — 2). Thus, even if the characteristic of F 
divides (n — 1), J —J still has a non-vanishing minor of order (n — 1); so 
rank(J — I) >n — 1. The theorem now follows from equation (1). Indeed, if M 
were expressible as a sum of fewer than (m — 1)/2 rank-one matrices, then so 
would M‘, and so by (1) J — J would have rank less than (nm — 1), a contradiction. 

| 


We say that a matrix M is regular if it has constant row and column sums. Thus 
M is regular if and only if MJ = JM = kJ, for some k. If M is a regular n Xn 
tournament matrix, then clearly k = (mn — 1)/2, and n must be odd. 


THEOREM 2. Let M be ann Xn regular tournament matrix. Then the rank of M 
over any field of characteristic not dividing (n — 1)/2 is at least (n + 1)/2. 


Proof. Suppose that rank(M) = (n — 1)/2 = k, so that M = u,vj + +++ +u,v, 
for some rank-one matrices u,v;. Then u,,...,u, span the column space of M, 
and v,,...,V, span the column space of M’. Thus by (1), these 2k vectors 
altogether span the column space of J — J, which has dimension (n — 1) = 2k so 
these vectors are independent. However, since M is regular, the column spaces of 
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M and M‘ have a common 1-dimensional subspace, namely the one spanned by 
the vector (1, 1,..., 1)’, which is a contradiction. & 


Here is an application of Theorem 2. Let g be a prime-power congruent to 3 
modulo 4. The Paley tournament matrix M(q) is the g X gq matrix with rows and 
columns labelled with the elements of the finite field GF(q), with the (, /)-entry 
equal to 1 if i —j is a non-zero square in GF(q), and 0 otherwise. Lander [5], p. 
138, uses a bit of character theory to show that rank(M(q)) > (q + 1)/2. Since 
M(q) is a regular tournament matrix, we get a simpler proof using Theorem 2. 
Lander further notes that if p is a prime divisor of (q + 1)/4, then the rank of 
M(q) over GF(p) is at most (gq + 1)/2, so Theorem 2 is in general best possible. 
However, Theorems 1 and 2 can be improved upon if the underlying field has 
characteristic zero. 


THEOREM 3. Let M be ann X n tournament matrix. Then the rank of M over any 
field of characteristic zero is at least (n — 1). 


Proof. \t suffices to prove this for the field of real numbers (for then M will 
have a submatrix of order n — 1 of non-zero determinant, and this will be 
non-zero over any field of characteristic zero). We want to show that the kernel of 
M has dimension at most 1. If Mx = 0 for some vector x, then (1) implies that 
x'(J — I)x = 0. We claim that the set of vectors satisfying this last equation 
contains no 2-dimensional subspace, which implies that dim(ker(M )) < 1. 

Note that J —J has rank n and index 1, so we may work instead with the 
equivalent quadratic equation (here x‘ = (x,, X5,..., X,,)) 


Xp HxXZ Hc +Hx2. (2) 


n 


Let S be the set of solutions to (2); we will show that it does not contain a 
2-dimensional subspace W. (This is an elementary property of the isotropic cone of 
Minkowski n-space; see Efimov [4], section 7.2, for an introduction to Minkowski 
space.) Let x and y be basis vectors for W. Then ax + by € W C'S for all scalars 
a and b, so ax + by satisfies (2). We easily derive from this the relation 


X1Vy =X2V_ TF  TXyAn. (3) 
Now applying (2), (3) and the Schwarz inequality, we obtain 
XTY{ =|X2y2 + 0° +x 
< (xy +++ +xn)(y2 + °° Hye) 
= xiYT- 
Thus, equality holds in the Schwarz inequality, and so (y,,..., y,) = C(%5,..., X,)5 
for some scalar c. Using (2) we easily deduce that y? = c’x?, so y,; = +cx,. Note 
that we cannot have y, = —cx;,: otherwise, using (3) we obtain —cx? = cx{, and so 
either c = 0 (in which case y is the zero vector, which is impossible), or x, = 0 
(and so by (2) x is the zero vector, which is impossible). Thus y, = cx, and so 
y = cx, a final contradiction. & 


An entirely different proof of Theorem 3, due to D. A. Gregory, appears in [3]. 
N. J. Pullman (personal communication) has obtained the following generalization 
of Theorem 3. If an eigenvalue of a tournament matrix has real part not equal to 
—1/2, then its geometric multiplicity equals one. 
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THEOREM 4. Let M be ann Xn regular tournament matrix with n > 3. Then M is 
nonsingular, provided the underlying field F has characteristic zero. 


Proof. Suppose Mx = 0. Then by (1), 0 = x'(J — I)x = x"Jx — x'x. Moreover, 
kJIx = JMx = 0, so x‘Jx = 0 and we get 0 = x'x =x? +--+ +x?. Since F may be 
assumed to be the real field, we conclude that x is the zero vector. & 


Theorem 4 was first proved by Brauer and Gentry [2], by a different method. An 
interesting application of this theorem to a graph decomposition problem is given 
in [3]. In conclusion, we mention another interesting property of regular tourna- 
ment matrices: within the class of tournament matrices, they coincide with the 
normal matrices, i.e. those satisfying MM‘ = M'‘'M; cf. [2; thm. 2]. Indeed, since 
MM' = M(J —I-—M) and M'M =(J —I-—M)M, MM‘ =M'"M i¢f and only if 
MJ = JM, that is if and only if M is regular. 
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Convergence-Preserving Functions: An Alternative Discussion 


ARTHUR SMITH 
Department of Mathematics and Statistics, Carleton University, Ottawa, Ontario K1S 586, Canada 


In a recent article in this publication [1] a discussion was presented on 
convergence-preserving (real) functions f, ones for which Ya, convergent implies 
» f(a,,) convergent. It follows necessarily that f(x) must be a constant multiple of 
x in some neighbourhood of the origin. 

This problem has been around for some time and its discussion in the MONTHLY 
reminded me of the time I stumped some colleagues by asking for an example of 
sequence {a,} such that Ya, converges but Y sin a, does not. 

Below I give a proof that convergence-preserving (CP) functions must be of the 
form described above, which seems somewhat shorter than that given previously 
and does not require considering the differentiability of such functions. 

At the end I solve the specific problem posed above which the reader may like 
to try before she /he reads on. 

First, I show f is finitely additive in some neighbourhood of the origin. 

Suppose f is CP and let 


g(x,y) =f(xty) +f(—-x) +f(-y). 
Assume for the moment there is no neighbourhood of the origin in which 
g(x, y) = 0. Then there are null sequences {x,},{y,} with g(x,, y,) # 0. Let [ ] 
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denote “integer part of’ and let 


~1 
M,, = Hen. n) | + 1, 
and consider the series S below, whose sum to M terms is Sj: 
(x, FY) — XV t (4 FY) Hem Vy tr FOX +1) HH YY 


+(X2- V2) —X_— Vy $0 +( 4. 4+ ¥2) — Xp -— Vg +++ +( 4, + Y,) 
—X, —yY, + my 


where the group of terms (x, + y,) —x, —y, is repeated M, times. The series 
converges to zero since $3, = 0,83,,, =X, + Y,,53,42 =y, for some n = n(k) 
where n(k) ~ ~ as k > o, 

Rewrite S as Ya,,. Then © f(a,,) converges, and so by grouping the terms we 
see © M,g(x,, y,) converges. This is impossible since |M,,g(x,, y,)| > 1 by con- 
struction. 

Thus g(x, y) = 0 in some neighbourhood of the origin, i.e. 


f(xty)+f(-x) +f(-y) =0 (*) 
for |x| + |y| sufficiently small. 
Put x =y =0 in (*) to show f(0) = 0 (which can be proved directly by 
considering the convergent series all of whose terms are zero). Now put y = 0 in 
(«) to show f is odd so that 


f(x +y) =f(x) + fly) (* *) 
in some neighbourhood WN of zero. This condition together with continuity of f at 
zero is sufficient to show f(x) = kx for |x| sufficiently small and k a constant. 

Continuity of f at 0 is established as follows. 
Consider any null sequence {a,} and form the convergent series 
QAa,— a, +aA,—a,7°°' +ta,—-a, + °°°. 
Since f is odd it follows that 


f(a,) —f(a,) + f(a.) — f(a.) +++: +f(a,) —f(a,) + °°: 


converges, so f(a,,) - 0 as a, — 0, and hence f(x) — 0 as x — 0 and continuity 
at 0 is established since f(0) = 0. 

Continuity in N now follows by letting x — —y in (« *). Let xg be a nonzero 
point of N. It is easy to show f(x,/n) = (1/n)f(x,) for any nonzero integer n and 
hence prove that f(rx,) = rf(x,) for any rational r with |r| < 1. The continuity of 
f now implies that f(x) = kx, x € [—x9, xo], where k = f(xq)/Xo. 

To find a sequence {a,} such that L a, converges but » sin a, does not, we use a 
construction similar to that used in deducing the contradiction in the proof above. 

Put x, =y, =1/n in the above proof and note g(x,, y,) = sin(2/n) — 
2sin(1/n) # 0 since, in fact, g(x,, y,) < 0 for each n. Define M, as in the general 
proof (or, more simply, M,, = n° since n°g(1/n,1/n) > —1 # 0). Then 


1 1 
1+1-2+::-+-+2-1+-:: 
2 2 
1 1 2 
a a ce 
n n n 
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where each group 1/n + 1/n — 2/n is repeated M,, times, converges while the 
sum of the sines of the terms does not. 


I am obliged to the referee for suggesting an example like the one above which mirrors the general 
construction more closely than my original example. 
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Crossing the Square 


KEITH WHITTINGTON 
Department of Mathematics, University of the Pacific, Stockton, CA 95211 


This note gives a counterexample to the conjecture in [1] that there are no 
“level crossings” of the square, and then concludes by indicating that in two 
important ways, this example is as simple as possible. 

A crossing of the square S = [0,1] x [0,1] is a pair |X, Y] of disjoint, connected 
subsets of S, each containing two opposite corners. The crossing is level if X U Y 
does not contain disjoint, connected sets E and F such that each contains two 
adjacent corners of S. 

Our example, depicted in FiGure 1, consists of the sets 


X={p} U{p'} VA,UB,UA,UB,U°::: 
Y=CUB,VUA,UB,UA,U°::. 


The crossing [X, Y ] is indeed level, but in fact, there are no two nondegenerate, 
disjoint, connected sets E and F in X UY such that p € E and p’ € F. 
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A crossing of the square S = [0,1] x [0, 1] is a pair LX, Y] of disjoint, connected 
subsets of S, each containing two opposite corners. The crossing is level if X U Y 
does not contain disjoint, connected sets E and F such that each contains two 
adjacent corners of S. 

Our example, depicted in FiGure 1, consists of the sets 


X={p} U{p'} VA,UB,UA,UB,U°:: 
Y=CUB,VA,UB,UA,U°:::. 


The crossing [.X, Y] is indeed level, but in fact, there are no two nondegenerate, 
disjoint, connected sets E and F in X U Y such that p € E and p’ € F. 
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Suppose such sets EF and F exist. Then E must meet A’s of arbitrarily large 
odd subscript, and F must meet B’s of arbitrarily large even subscript. 

If x € A, and y € B,, with i > 1, then {x, y} separates all of the A’s and B’s of 
subscript less than 7 from those of subscript greater than 7. In fact, as depicted in 
FIGURE 2, an arc can be passed through the holes created by removing x and y, 
which separates the sets of small subscript from those of large. 


~~ -— = eo 


\ 

| 

\ 
J 
| 


Nz... 


Fic. 2 


Consequently, if & meets A, or B,, then for each i >/j, either A; CE or 
B, C E. The same holds true for F. 

Therefore, there is a positive integer N such that for i > N, one of A, or B; is 
in F& and the other is in F. 

It is also clear that for each i, the set A; UA,;,, separates B; from B,,, in 
X UY. Therefore, when i > N, neither set E nor F can contain both A, and 
A,,,, for the other set would then be a connected subset of XY U Y) — (A; UA;,,) 
which contains B; U B,,,. Of course, this also shows that neither set E nor F can 
contain both B, and B,,.,. 

Since F meets A’s of arbitrarily large odd subscript, we can now conclude that 
there is an integer n such that E contains 


Agns1 U Bons2 UAon43 U Bangg Ur’ 


This contradicts the fact that / meets B’s of arbitrarily large even subscript. 

The author has recently shown [2] that for a crossing [X,Y] to be level, one of 
the sets X or Y must have infinitely many constituents (maximal continuumwise 
connected subsets). It is also shown that if each of the sets X and Y contains a set 
which is irreducibly connected between its two corners, then the crossing is not 
level. 

In our example, the sets X and Y each have a countable infinitude of 
constituents. Also, Y is irreducibly connected between gq and q’ (there is no 
smaller connected set containing both of these points), but X does not contain any 
set which is irreducibly connected between p and p’. 
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Lines of Best Fit by Graphics and the Wald Line 


R. J. DUFFIN 
Department of Mathematics, Carnegie-Mellon University, Pittsburgh, PA 15213 


I. Introduction. Here is an old, important and somewhat controversial ques- 
tion: Given a set S of n points in the (x, y) plane and given a straight line then 
what is the line’s disparity? Of course this question is ambiguous on several counts. 
But as a first approach define ‘‘disparity” to be the greatest distance that a point 
of the set is from the line. Let us denote this distance as D. An example set S is 
shown in FiGuRE 1. 


Fic. 1. A set of 31 points to be fitted by a line. 


Then another question arises: If the line can vary what is the minimum value of 
the distance D? Let us denote this minimum as D%*. Let a line of best D fit serve 
as the midline of an infinite strip of width 2 D*. Then it is clear that this strip must 
contain every point of the set S. Next consider any other infinite strip which 
contains the set S. If its width is W the midline of this strip will have disparity 
D<W/2. Thus W>2D*. The problem of finding D* is thereby reduced to 
finding the thinnest strip containing the points S. 


II. An algorithm to minimize the distance disparity. A graphical algorithm 
begins by plotting the n points of S in the (x, y) plane as shown in FicureE 1. The 
points of S are contained in a convex polygon whose vertices are all points of S. 


Dedicated to a master educator, L. A. Karlovitz (1936-1990). 
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Fic. 2. The fence around the 31 points. 


FIGURE 2 shows such a polygonal fence constructed for the set of 31 points shown 
in FiGurE 1. This polygon has only 6 vertex points. 

It is geometrically obvious that no matter where the given line is drawn, there 
will be a vertex point on the fence which is at a greater distance from the line than 
any point in the interior of the polygon. For this reason the interior points are 
discarded. Like empty cans they have served their purpose and are put in the trash 
barrel. This graphical device greatly simplifies the calculations to be made. 

Next a straight line is passed through each edge of the fence polygon. For 
example, Figure 2 shows a line AA’ passing through edge (1,2). Then the vertex 
of the polygon is selected which is at the greatest distance from the line AA’. In 
FIGURE 2 this is seen to be the vertex 5. Then a line BB’ is passed through vertex 5 
and parallel to AA’. The strip formed by the parallel lines AA’ and BB’ contains 
all the points of the polygon. 

Such a strip is constructed for each edge of the polygon. Each strip has a width. 
Then a strip with the minimum width W is selected. By examining FIGURE 2, your 
eye should detect that the edge (1, 2) generates the strip of minimum width W. The 
other edges are seen to generate strips which are wider. 

Then a line CC’ is drawn parallel to the lines AA’ and BB’ but half way 
between them. By using Lemma 1 to follow it is seen that D* = W/2 and that the 
midline CC’ is an optimum line for best D fit. 


LEMMA 1. Suppose an infinite strip contains a prescribed convex polygon, and that 
its width is minimal among all such strips with this property. Then each edge of such a 
strip must contain at least one vertex of the polygon and one of the edges must 
contain a side of the polygon. 


Proof. If an edge does not contain a vertex the edge can be moved closer to the 
other edge and the width will be decreased. Thus it can be supposed that each 
edge contains at least one vertex. Suppose that each edge contains exactly one 
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vertex of the polygon. Then rotating the strip slightly about the midpoint of these 
two vertex points results in a strip of the same width which contains a polygon in 
its interior. This is a contradiction. Thus one edge must contain a side. 


III. An algorithm to minimize the vertical disparity. Now consider a second 
approach to the original question. Instead of taking the disparity of line and point 
as the distance between them we take it to be the vertical distance between them. 
Thus the disparity of a line to a set of points is the greatest vertical distance of the 
points to the line. Let this distance be denoted as V. Fortunately, the value of V is 
simply related to the value of D. Thus by geometry 


V = Dyj(1 +m’), (1) 


where m is the slope of the line. For example consider the V of the line CC’ in 
FiGurRE 2. The width of the strip formed from the lines AA’ and BB’ is W. This 
strip intercepts the y-axis in a line segment AB having length |AB| = W/(1 + m7) 
and consequently 


V =|AB|/2. (2) 


If the line is permitted to vary let the minimum V be denoted as V*. Then (2) 
gives 


V* = Minimum |AB|/2, (3) 


where the minimum is taken over all strips formed from the edges of the fence 
polygon. This is a consequence of Lemma 2 which follows. Formula (3) is readily 
evaluated graphically. From FiGure 2, your eye should detect that line CC’ is also 
the best V fit, and so V* = |AB|/2. 


LEMMA 2. Suppose an infinite strip contains a prescribed convex polygon, and that 
the length | AB| of its y-intercept is minimal among all strips with this property. Then 
each edge of such a strip must contain at least one vertex of the polygon. Moreover 
there is one such strip with an edge containing a side of the polygon. 


Proof. Otherwise each edge of a strip of minimum | AB| has exactly one vertex 
point. Without loss of generality it can be assumed that the lower edge contains 
the vertex (0, 0) and the upper edge contains the vertex (x’, y’). Any new line going 
through the point (x’, y’) and having a small difference in angle could serve as an 
upper edge for a strip containing the whole polygon. If x’ # 0 then it is obvious 
that the intercept on the y axis can be decreased. This contradiction shows that 
x' = 0. In that case the angle of the new edge does not change the y-intercept. 
Clearly it can be chosen so that one of the edges contains a side of the polygon. 


IV. Relation to linear programs. It is easy to see that the following linear 
program yields the minimum vertical disparity for a line, y = mx + b. 


Program 1. Seek V* = min V subject to 
—V<em,+b—-y,<V, L=1,...,n. 


Thus if S has 31 points there are 62 inequality constraints. 


838 R. J. DUFFIN [November 


Linus Schrage treats an example problem of fitting a line to five points using 
linear programming in his business school text book [1]. Our graphic solution to his 
problem is obvious without need to use the simplex algorithm on a linear program 
with ten inequality constraints. 

Now consider the minimum distance D*. Relation (1) above, gives 


Program 2. Seek D* = min D subject to 
Dy(1 +m?) <mx,+b-—y,;<DjA+m’*),  i=1,...,n. 


This is neither a linear program nor a quadratic program. So graphics may be the 
handiest answer to the original question. 


V. Is the method of least squares a virtue or a vice? Consider the problem 
fitting a line to experimental data points: (x,, y;), (>, y),...,(x,, y,,) in order to 
determine an underlying physical law, Y = mX + b. The trouble is that the data 
can have experimental errors. A common procedure is to minimize a mean of all 
vertical disparities. For example, the mean can be the L, norm of the disparities. 
For p = 2 Gauss found that explicit formulae are readily available. The p = 2 
method is called least squares. 

The genius of Gauss notwithstanding, least squares has had difficulty with 
“outliers”. Thus quoting from a book by expert statisticians, [2, page 129], “A wild 
data point can easily seize control of the fitted-line and cause it to give a totally 
misleading summary of the relationship between y and x.” In his book, Linus 
Schrage [1] compares the L,, the L, and the L,, line fitting methods. (L,, gives 
Program 1.) Schrage argues that the L, norm with linear programming is better 
suited to handle outliers than the traditional least squares method. 


VI. The Wald center of gravity method. By far the simplest method to fit a line 
to experimental data is the Wald formula advocated by the distinguished statisti- 
cian, Abraham Wald [2], [3]. His method requires no minimization. It involves only 
concepts which can be explained to elementary students and made: 

(1) understandable, (2) reasonable and (3) easy to calculate. Here is a scenario 
of such a lesson: 

Students please imagine that there are n points marked on a drawing board, say 
(X,, Y,),...,(X,, ¥,). It is known that these points all lie on a straight line, 
however the numerical values of the coordinates are not known. So you ask a 
skilled technician to measure the coordinates of these n points. He gives you the 
following results of his measurements: (x,, y,),..-,(%,, y,). Then how do you find 
the equation of the line? 

Of all curves, only the straight line has the property that the center of gravity of 
any points on the curve is also on the curve. Let 


X'=(X,+ °°: +X,)/n and Y'=(Y,+°:: +Y,)/n 


so the point CX’, Y’) is on the line. Likewise the center of gravity (x', y’) of the 
experimental data is defined by 


x'= (xX, + °°: +x,)/n and y’=(x,4+ °°: +x,,)/n. 


Your technician uses a measurement procedure with the virtue that a negative 
error is just as probable as a positive error. Then for large n it is to be expected 
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that the negative errors will tend to cancel the positive errors so as n becomes 
large 


X'—x' and Y’'-—y’ both become small. 


Thus the point (x’, y’) is a good estimator of the point (X’, Y’). You can use it as 
the ‘center point” of your approximate line. (A least squarer would have to agree to 
this point!) 

However, it takes two points to determine a line so two estimators are needed. 
Then two, seemingly conflicting, requirements arise. On the one hand all the 
points should be used because that is how the errors can cancel. On the other 
hand the two estimator points should be far apart because the true line and the 
approximate line will stay close between the estimator points. 

To meet these requirements start over. Suppose that actually there were 2n 
points on the line. Then split the (x, y) measurements into two political parties: 
“leftists” and ‘“‘rightists.”” This reordering makes 


X1,Xz,...,X, alllessthan 4X,15,...,Xo,- 


n 


Then let 
X" = (X,4, + ve +X,,)/n and Y” = (Y,41, + oe +Y,,)/n. 


Thus (X”, Y”) is a point on the line and its estimator (x”", y") is so defined. 

The point (x’, y’) is the center of gravity of the Leftists and (x”, y”) is the center 
of gravity of the Rightists. The line which goes through these two points is the 
Wald Line. Then, of course, 


(y—y) / (x -x’) = (y’-y) 7 (*" - x’) 


is the Wald Formula. That, in essence, is all there is to it. 
As homework problems for next time: Show that 


(1) The center of gravity of all the data points is on the Wald line. 

(2) The disparity between the Wald line and the true line in the interval 
(x', x") does not exceed the disparity at the estimator points. 

(3) If the x values of the data points are distributed rather uniformly in the 
whole interval then the disparity at any point does not exceed twice the 
disparity at the estimator points. 


The data enter linearly in the Wald formula and not quadratically as in the 
traditional least squares formula. Thus the Wald line is much less disturbed by 
wild data points than the least squares line. However least squarers are probably 
not familiar with the Wald Line. When they are told about it their first question is, 
“But what does it minimize?” 
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On the Mean Value Theorem, Inequality, and Inclusion 


M. Furi 
Dipartimento di Matematica Applicata, Universita di Firenze, Via S. Marta 3, 50139 Firenze, Italy 


M. MARTELLI 
Mathematics Department, California State University, Fullerton, CA 92634 


The Mean Value or Lagrange’s* Theorem and its consequences are one of the 
fundamental topics of a calculus course. The theorem is usually presented in the 
following form. 


THEOREM 1. Let f:[a,b]— R be continuous on [a,b] and differentiable on 
(a, b). Then there exists c € (a, b) such that 


f(b) — f(a) =f'(c)(b — a). (1) 
The result is frequently derived from the following theorem, which is credited to 
Rolle’. 


THEOREM 2. Let f:[a,b]— R be continuous on [a,b], differentiable on (a, b) 
and such that f(a) = f(b). Then there exists c € (a, b) such that 


fi(c) = 0. (2) 


The simplest way of obtaining (1) from (2) is to start from the desired result 


b) — f(a 
po) LOO 


The left side of this equality is the derivative at c of the function 


b) — fla 
x) <p(2) LOO, 


Since it can be easily seen that g(a) = g(b), the second part of the equality follows 
from Rolle’s Theorem. Moreover (2) is easily derived from (1) when f(a) = f(b). 
Hence the two results are equivalent. 

At the level of advanced calculus, students find that the Mean Value Theorem 
is not valid in the simple form expressed by (1), for vector-valued maps. One of the 
simplest examples to show this is provided by f(t) = (cost,sint). Since f(O) = 


*Joseph Louis Lagrange attempted a reorganization of calculus by representing functions with their 
Taylor’s series expansions. The Mean Value Theorem is frequently attributed to him, particularly in 
European textbooks. However the first statement of the theorem appears in a paper [2] of the 
renowned physicists André-Marie Ampére (1775-1836). In the paper Ampére uses the ideas of 
Lagrange to justify the result. The derivative is assumed to be continuous. About fifteen years later 
Augustin-Louis Cauchy (1789-1857) proved the theorem with the same hypothesis (see [3, p. 113)). 

It is worth mentioning that in the same paper Ampére claims to have proved a result that today we 
know to be false, namely that a continuous function is differentiable except possibly at finitely many 
exceptional points. The stature of the man who felt so strongly about this false result to the point of 
proving it, tells us that scientists have fought an uphill battle to bring calculus to today’s form. 

‘Michel Rolle (1652-1719) was a member of the French Academy and engaged in a heated debate 
on the foundations of “infinitesimal methods,” saying that these methods led to paralogisms. He was 
rebutted by John Bernoulli who maintained that Rolle did not understand the calculus. 
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f(27) and 


| f’(t) || = Vsin? t + cost = 1 


it is obvious that (1) is violated. 

At this point many authors prove that Lagrange’s Theorem can be extended to 
scalar functions of vector variables (see, e.g., [5, p. 174]). A few authors go ahead 
and present a result called the Mean Value Inequality (see, e.g., [4, p. 379]). To 
state the result as simply as possible we shall use the symbol [x, y] (Ix, y[) to denote 
the elements of the closed (open) line segments joining x with y. Moreover we shall 
use the Euclidean norm in R”. 


THEOREM 3. Let U C R” be open and f:U — R” be differentiable on U with 
continuous derivative. Let x,y € U be such that [x,y] c U. Then 


ll f(y) — f(x) || < max{]] f’(©) Il: ¢ © [x,y] }lly — xi, (3) 
where || f’(c)|l = max{ll f’(c) x|}: |xll = 1. 


A comparison between the Mean Value Inequality and Lagrange’s Theorem 
shows that the Inequality, as stated above, is based on assumptions a bit stricter 
than the ones used in the Theorem, although the conclusion is not as sharp. 
Theorem 3 is frequently proved by using, among other things, the Fundamental 
Theorem of Integral Calculus (see [1, p. 7; 4, p. 379]).* 

Our goal in this paper is to show that the Mean Value Inequality can be 
established in a more elementary manner, and under assumptions that are weaker 
than the ones used in the Mean Value Theorem. Moreover, we shall obtain a 
result, called the Mean Value Inclusion, which can be regarded as the form of 
Lagrange’s Theorem [the Mean Value Theorem] in dimension higher than 1. 

In our final remarks, we shall point out that the differentiability of f can be 
replaced by a less restrictive condition, called Gateaux differentiability. The two 
notions coincide in R, but differ in higher dimension. 


Consequences of the Mean Value Theorem. The following two corollaries of the 
Mean Value Theorem are usually included in textbooks. We mention them here 
because, as we shall see, the Mean Value Inequality provides their extension to 
vector functions. 


Coro.iary 1. Let f:[a,b] — R be continuous on [a, b\ and differentiable on 
(a,b). Assume that f'(x) = 0 for all x € (a, b). Then f is constant. 


CoROLLARY 2. Let IC R be an interval and f: I > R be continuous on I and 
differentiable on the interior of I. Assume that f' is bounded. Then f is uniformly 
continuous on I. 


*W. Rudin, in the first and second edition of [8] gives a different, more elaborate proof of a special 
case of (3). A shorter proof was presented by V. P. Havin, when he translated the book into Russian 
and it was added to the third edition of Rudin’s book. Our proof is along the lines of the one of Havin, 
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To prove Corollary 1 we choose any point x € (a, b] and apply the Mean Value 
Theorem to f on [a, x]. We obtain 


f(x) — f(a) =f'(c)(x — a) 
for some c € (a, x). Since f’(c) = 0, it follows that f(x) = f(a). 
To establish Corollary 2 we select any pair of points x, y € J, x < y. From the 
Mean Value Theorem applied to f on[x, y] and the boundedness of f’, we obtain 


If(y) —f(*)| < My -x| 
for some constant M > 0. This inequality implies the uniform continuity of f. 
We now present two more consequences of the Mean Value Theorem. They are 
not included in most textbooks. The reason for having them here is to simplify the 
proof of the Mean Value Inequality and of the Mean Value Inclusion. One could 
argue that the two results have also some interest on their own. 


Coro.iary 3. Let f:[a,b] > R be continuous on [a, b] and differentiable on 
(a,b) except possibly at finitely many points. Then there exists c € (a, b) such that 


f(b) — f(a)| <I f’(e)|(b — a). (4) 


Proof. Assume that there is only one point d € (a, b) where f is not differen- 
tiable. By applying the Mean Value Theorem to f on [a, d] and [d, b] respectively 
we obtain 


f(d) —flay=fFlay(d- 4), co € (4,4), 
f(b) —f(d) =fi(er2)(b-d), 2 € (d,d). 


From these equalities we derive 


|f(b) — fla)l <i f'(e)|(b — 2) 


where 


If'(¢)| = max{| f’(c;) f'(c2) |}. 
This proof can obviously be extended to the case when f is not differentiable at 
more than one point. O 


? 


CoroLiary 4. Let f:[a,b] - R be continuous on [a, b| and differentiable on 
(a,b) except possibly at a finite number, n, of points. Then there are n + 1 points 
a<c,<c,< +++ <c¢,,,<bandn + 1 positive numbers a,,...,@,4., Such that 


n+1 


La; = 1 


i=1 
and 


n+1 


f(b) — f(a) = ¥ a f'(c))(b = a). (5) 


(= 


Proof. Assume that there is only one point d € (a, b) where f is not differen- 
tiable. By applying the Mean Value Theorem to f on [a, d] and [d, b] respectively, 
we obtain 


f(a) — fla) = f'(ei)(4 — a) 
f(b) — fd) = fi(c2)(b — d) 
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for some c, € (a, d), c, € (d, b). Let a@,, a, be such that 
a(b-—a)=d-a 
a,(b—a)=b-—d. 
Then a, + a, = 1, a, > 0, a, > 0 and 


f(b) — f(a) = (a, f'(e1) + ao f'(c2))(b - a). 
This proof can obviously be extended to the case when f is not differentiable at 


n > 1 points. 


The Mean Value Inequality and Inclusion. We now present the Mean Value 
Inequality in a form that is more general than the one given in Theorem 3. We also 
include a particular form of the Mean Value Inclusion (Theorem 5). For an 
extensive analysis of this last result the reader should consult [6]. To simplify both 
statements we introduce the following: 


Definition. A subset V of R” is linearly finite if for every pair of points x,y € R” 
the set | 
V2 [x,y] 
is finite. 
THEOREM 4 (Mean Value Inequality). Let D be a disk in R” and f: D > R™ be 
continuous on D. Assume that f is differentiable on the interior, B, of D, except 


possibly on a linearly finite subset of B. Let x,y © D. Then there exists ¢ €]x, yl such 
that 


f(y) — FO) I <I f'(e) Illy — xl. (6) 
Recall that the convex closure of A, coA, where A is any subset of R”, is the 


closure of the set of all convex linear combinations of elements of A. 


THEOREM 5. (Mean Value Inclusion). Let D be a disk in R” and f: D > R”™ be 
continuous on D. Assume that f is differentiable on the interior, B, of D, except 
possibly on a linearly finite subset V of B. Let x,y = D. Then 


f(y) — f(x) € 004A, (7) 
where A = (f'(eXy — x): cE VO 1 Ix, yb. 


For completeness we shall state first, without proofs, all results needed to 
establish Theorems 4 and 5. 


Cauchy—Schwarz Inequality (see [4, p. 20]). Let u, v be two vectors of R” and 
denote by (u,v) their inner product. Then the well-known Cauchy—Schwarz 
inequality states that 


Iu, v)| < [lull il (8) 


Differentiability. Let U Cc R” be an open set, f:U — R” and c € U. We say 
that f is differentiable at c if there exists a linear operator L:R” — R” such that 


f(e + h) =f(c) + Lh + o(h), (9) 

where 
_o(h) — 
mio Th) 
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The linear operator L is usually denoted by f’(c) and, if we define 
| ZI] = max{||Zx\l: |x|] = 1} 
then 
Lx < I ZLIl [xl (10) 

for all x € R” (see [8, p. 208]). 

Chain rule. Let f:U C R” > R” and g:[a,b] — U be differentiable on the 
open set U and on the open interval (a,b) respectively. Then the map h: 
[a,b] > R” defined by h(t) = f(g(t)) is differentiable on (a, b) and its derivative 


at t € (a,b) can be identified with the vector obtained by applying the linear 
operator f’(g(t)) to the vector g’(t), i.e., 


h(t) = f'(8(t))(8'(t)). (11) 
Moreover, for every fixed vector z € R”, the derivative of the map k:[a,b] > R, 
defined by k(t) = <z, h(t)) is 


k'(t) = (a, h'(t)>. (12) 
Inequalities (8), (10) and equalities (11), (12) are used, in conjunction with 
Theorem 1 and with the Fundamental Theorem of Integral Calculus, to establish 


Theorem 3 (see [1, p. 7]; [4, p. 379]). Our proof of Theorem 4, which is more 
general than Theorem 3 uses only (8), (10), (11), (12), and Corollary 1. 


A separation theorem. We complete our list with a separation theorem needed 
in the proof of the Mean Value Inclusion. We shall use it in its simplest form. 


THEOREM 6. (Separation Theorem). Let C be a convex closed subset of R™” and 
let x € R”, x € C, be a fixed vector. Then there exists a vector w & R™, and a real 
number a such that 


(w,x) <a<iw,y) forally eC. (13) 


From a geometrical point of view, the above theorem says that a closed, convex 
subset C of R” and a vector x ¢ C, can be separated by a hyperplane, i.e., there 
exists a hyperplane H such that C cannot be reached from x without crossing H 
(see [7, p. 99). 


Proof of the Mean Value Inequality and of the Mean Value Inclusion. We are 
now ready to prove Theorems 4 and 5. 


Proof of Theorem 4. Let z be a fixed vector of R”, which will be specified later. 
Define 


k:[0,1] >R_ by k(t) = (z, f((1 — t)x + ty)). 
It is easily seen that k satisfies all assumptions of Corollary 3. Hence, by (4), 
|k(1) — k(0)| <|k'(to)]- 
for some ft, € (0, 1). Since 
k(1) — k(0) = «2, f(y) —f(x)> and k'(to) = ¢z, f’(e)(y — x)? 
with ec = (1 — t))x + toy, we obtain, using inequalities (8) and (10) 


Kz, f(y) — f(x)>1 < Ilzll I f’Ce) Hl tly — xtl. (14) 
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We can assume f(y) # f(x), for, otherwise inequality (6) is obvious. By selecting 


_ fly) — f(x) 
f(y) — FOOT 
and substituting it into (14), we obtain 
f(y) — FOI < I F'Cod Il lly — xi. Oo 


The Mean Value Inequality allows us to extend Corollaries 1 and 2 to vector- 
valued functions. The changes needed in the statements and in the proofs are so 
minor that we feel unnecessary to explicitly include them here. 

We now prove Theorem 5 following the pattern of the proof of Theorem 4. 


Proof of Theorem 5. Assume that 


f(¥) — f(x) € 0A. (15) 


Then, by Theorem 6, there exists a vector w € R” and a real number a such 
that 


(w, f(y) — f(x)) <4 < (w,2) (16) 


for all z € coA. 
Assume that there are n points of the open line segment joining x with y, where 
f is not differentiable. Then Corollary 4 can be applied to k:[0, 1] — R defined by 


k(t) = (w, f((1 — t)x + ty)>. 
We obtain, by (5), 


n+1 


k(1) — k(0) = )) a,k'(t;). 


1=1 


Hence 


iw. f(a) =f) = (Eafe) 


where c,; = (1 — ¢t;)x + t;y. This last equality contradicts (15). O 


We have stated and proved Theorems 4 and 5 for vector-valued functions which 
are differentiable in the sense of (9), a property usually called Fréchet differentia- 
bility. The two results can be extended with no changes to functions f: D C R” > 
R” which satisfy a weaker definition of differentiability, called Gateaux differen- 
tiability. 

We shall say that f is Gateaux differentiable (see [1, p. 5]; [9, p. 11]) at an 
interior point x € D if there is a linear operator L:R” — R” such that 


fn LOT EB) =F) 


Lh 
t—0 t (17) 


for all h € R”. 


846 M. FURI AND M. MARTELLI [November 


While the composition of two functions that are Gateaux differentiable need 
not be Gateaux differentiable, it can be seen that (11) still holds when the Fréchet 
differentiability is replaced by the Gateaux differentiability. 

In fact let u = a(t) = (1 — t)x + ty, t © (0,1) and let fG(u) be the Gateaux 
derivative of f at u. Let s be such that t+ 5 <1. Then f(a(t + s)) = f(a(t) + 
s(y — x)) and 


a(t+s))- — sft — 
tim fae #9) = fla) = fey =) _ 5 as) 
s—0 S 
which shows that foa is Gateaux differentiable at t and its Gateaux derivative is 
fo(wly — x). 
The Gateaux differentiability is strictly more general than the Fréchet differen- 
tiability as the following example shows. 


Example 1. Let 


[24 x; 0,0 
f(x,y) = 4 x2 4+ y4 (x,y) # (0,0) 


0 (x,y) = (0,0) 


Then f is continuous at (0,0) and its Gateaux derivative at (0,0) is the zero linear 
operator. However f is not Fréchet differentiable at (0,0) as can be checked by 
approaching the origin along the parabola x = y”. 

Theorems 4 and 5 can be extended to infinite-dimensional Hilbert spaces and to 
Banach spaces. In a Hilbert space H the proofs given before for the finite 
dimensional setting do not need any changes. In a Banach space E, however, there 
is no inner product compatible with the norm, and one has to use the Hahn—Banach 
theorem to prove the existence of a functional ¢: E — R such that ||é|| = 1 and 


bf) — fly) = IfG@) — FDI 


Acknowledgments. We would like to thank the referee for many helpful suggestions, which have 
improved the presentation of this paper. 
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The “big oh” notation means that there is a constant C such that 


|D, f(x) — f'(x)|/h <C. 


In fact, if we set 
|D,f - fll = sup | D, f(*) — f'(x)|, 
then 
|D,f — f'll= O(A). 
Suppose now that only an approximate version of f, f,» having the form 
fy(t) = f(t) + n(t) 


is known, where 7 is a bounded, but not necessarily differentiable, function 
representing the error in the approximation. The approximation to the derivative 
of f is then given by 


Di f,(*) = D, f(x) + Dyn(*). (2) 


Since 7 is not necessarily differentiable, lim, _,,. D,n(x) does not necessarily exist. 
Even in the case when 7 is differentiable, it can happen, as we saw earlier, that 
this limit can be very large even when ||7|| is very small. 

How bad can things get? Suppose that ||n|| <«, where e > 0 is a positive 
number giving a bound for the error. Then from (1) and (2) we have 


ID, f, — fll < O(A) + IlD,qll 
< O(h) + - (3) 


Note that for fixed « > 0, the error bound blows up as h — 0. Nevertheless, the 
beast can, in a sense, be tamed or at least regulated. We simply cannot let h —> 0 
in an arbitrary manner, but must choose a strategy of relating h to e« in an 
appropriate way. In fact, we see that any choice of h(e) > 0 which satisfies 
h(e) > 0 and e/h(e) > 0 as € > 0 results in 


Dicey fn — f'| > 0 ase dO. 


The forward difference approximation method with such a choice of grid size 
h = h(e) is then said to provide a regular algorithm in the sense of Tikhonov [2] for 
the problem of differentiation. 

From (3) we see that a choice of grid size of the form h(e) = CVe (obtained by 
“balancing” the terms O(h) and 2¢/h in (3)) leads to a Ve order of approxima- 
tion: 


Dicey fn — f'll = O(ve ) 


for all perturbations 7 satisfying ||7|| < e. 

Is it possible to do better? Obviously if f is a linear function, then D, f = f' 
and hence a stronger o(ve ) order of approximation is attainable for any choice of 
grid size of the form h(e) = 7,0 <a < 1/2 (recall that m(e) = o(n(e)) means 
|m(e)/n(e)| > 0 as ¢ > 0). Our aim is to show that only in this trivial case is a o(Vve ) 
order of approximation attainable for all perturbations. The theorem that we prove 
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below is akin to a result of BajSanski and Bojani¢é [1] on the impossibility of 


ce 99 


improving a “O” Bernstein polynomial approximation to a ‘“‘o”’ approximation, 
except in the linear case. 


THEOREM. Suppose that for some h = h(e) > 0, Dye f, — fll = oWe) for 
every perturbation 1 with ||n|| < «. Then f is a linear function. 


Proof. We show that f’(x) = 0 for every x € J. By Taylor’s theorem, 


f(x th) = f(x) + fiCah + [6 (x +h = 8) dt. 
Therefore, 


D, f(x) — f(x) = R(A), 


where 


VL px+ 
Rn) = fF x + = 8) at 


Suppose that for some x € I, f"(x) = 26 > 0. Then f(t) > 6 > Ofort €[x,x + 
h], if h is sufficiently small. It follows that R(h) > 6h/2. Define a perturbation 
y(t) for t € I by 


n= (5 toe 
Then 
Dif,(*) —f'(*%) = RCA) + e/h > 6h/2 + E/h. 
It follows, since ||D,..)f, — f'll = o(ve ), that 
Sh(e) ve 


— + 

2 ve h(é) 
which is clearly impossible. A similar argument shows that it is not possible for 
f"(x) to be negative. Therefore f’(x) = 0 for all x € J, and the proof is complete. 


—-0 ase J, 


Similar results can be proved for other approximations to the derivative. The 
reader is invited to show, for example, that the centered difference approximation 
(D, f(x) + D_,f(x))/2 and the extrapolated forward difference approximation 
2D, f(x) — D,, f(x) for bounded functions with three continuous bounded deriva- 
tives on an open interval both lead, for appropriate choice of grid size, to a 
O(e?/) approximation in the presence of perturbations of size ¢. Moreover, this 
order of approximation cannot be improved to o(e7/*), unless f is a quadratic 
polynomial. 

Of course it is not necessary to limit oneself to the first derivative. Finite 
difference approximations to higher order derivatives in the presence of errors can 
be analyzed in a similar way. 

Finally, we give a numerical illustration of the choice of grid size and resulting 
order of convergence for the forward difference approximation of the derivative. 
The numerical values given below are the results of a simple program in which the 
perturbation 7(x) € [—«, €] is produced by a (uniform) random number generator 
and the approximate derivative D, f,, is compared with the true derivative at grid 
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points in the interval [0, 1]. The grid size is taken as h = h(e) = Ve and therefore 
the error satisfies 


ID, f, — f'll= O(ve). 


This behavior can be numerically validated by assuming that the error has the form 
E ~ Ce”. Taking ratios, the order of convergence p can then be estimated in 
terms of two calculated errors E, and E,, corresponding to two perturbation levels 
€, and €,, respectively, by 


In( E£,/E,) /\n(é,/é2). 
For nonlinear functions, we should expect an order p = 1/2 at best, according to 
the theorem. For example, if f(x) = (2x — 1)e*, then the errors calculated in the 


program and displayed in the output below can be used to estimate p (to two 
decimals) as shown in the table: 


7109080 .2339649 07577038 02447224 
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More on Cross Products 


PAUL BINDING* 
Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada T2N 1N4 


A recent article [1] in the MONTHLY contains several proofs of the equivalent 
identities 
(a X b) - (c X d) = (a: c)(b- d) — (a: d)(b- cc) (1) 
and 
(aX b) X c = (a-c)b — (b- c)a. (2) 


Since one motivation for this work is the difficulty of completing a particular 
geometrical argument for (2), it seems worthwhile to give a simple, direct geomet- 
rical proof. 

We adopt the geometrical definition that a X b L (i.e., is orthogonal to) a and 
b, according to the right-hand rule, with 


lla X bll = |lall |lb|| sin 6, (3) 
6 being the angle between a and b. It follows that 
bxXa=-axXb (4) 


*Research supported by NSERC of Canada. 
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(Miquel’s Theorem) that the circles AB’C’, A’BC’, A’B’'C intersect in a point Y. 
Prove that X = Y if and only if X is the orthocenter of A ABC. 


E 3467. Proposed by Todd Feil, Denison University, Granville, OH, and Gary 
Kennedy, Oberlin College, Oberlin, OH. 


A permutation 7 on the set {1,2,..., m} is said to have j as a strong fixed point 
if m(k) <j for k <j and w(k)>j for k>j. Let h(n) be the number of 
permutations on {1,2,..., } having at least one strong fixed point. Prove that 


2(n —1)!-(n - 2)!< h(n) < 2(n—- 1)! 
for n > 1. 


E 3468. Proposed by Curtis Cooper, Central Missouri State University, Warrens- 
burg, MO, Robert E. Kennedy, Central Missouri State University, and Stanley 
Rabinowitz, Westford, MA. 


Suppose m and n are positive integers such that all prime factors of m are 
larger than m. 
(a) Prove that 
$*intn( AE) = 2) — 0 (am) 
kat n 4m m }’ 
where * denotes summation over integers relatively prime to n. (Here @ and pw 


denote the arithmetic functions of Euler and Mobius respectively.) 
(b) Find a similar formula for 


¥. cos?" kar/n). 
k=1 


SOLUTIONS OF ELEMENTARY PROBLEMS 
A Bounded Binomial Experiment 


E 3332 [1989, 524]. Proposed by John Mamer, University of California at Los 
Angeles, and Kenneth Schilling, University of Michigan-Flint. 


Let X be a binomial random variable with parameters n and p. 
(a) Prove that 


E(min{ X, t}) > {1 — (1 —p/t)"} 


for t= 1,2,...,n. 
(b)* Is there an upper bound for E(min{X, t}) better than min(np, t)? 


Solution I of (a) independently by Wolfgang J. Buhler, Johannes Gutenberg 
University, Mainz, Germany, Glenn A. Stoops, Dayton, OH, and the proposers. An 
archer, whose probability of hitting a chosen target is p, performs the following 
experiment. For each of n arrows he selects one of ¢ targets at random and fires 
the arrow at it. For any fixed target, his probability of not hitting it with a 
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particular arrow is 1 —p/t and so his probability of not hitting it at all is 
(1 — p/t)”. Hence the expected number of targets hit is t{1 — (1 — p/t)”}. 

On the other hand, if targets already hit are excluded from consideration before 
each arrow is fired, the expected number of targets hit can only increase. If this 
modus operandi is followed, the probability of a new hit with any arrow is p unless 
all targets have already been hit. Thus the expected number of targets hit under 
this revised procedure is E(min{X, t}). Comparing the two expectations, we obtain 
the inequality given in (a). 


Solution II of (a) by K. F. Andersen, University of Alberta, Edmonton, Canada. 
By definition 
t—1 n n n 
E(min{X,t}) = )o (' |pia —p)y’"j+) (' |pia —p)”"'t 


j=0 j=t 
tly | 
1 -) (‘Jr —p)" "(1 -1/0)| (*) 
j=0 
Using Bernoulli’s inequality 1 — j/t < (1 — 1/t)’ for j = 0,1,...,¢ — 1, we obtain 


t—1 


E(min{ X, t}) > (1 - ale. —p/ty'(1 oy 


> (1 -E(F)e- e/a ry" 


Editorial comment. Let 
Z=min{X,t} =(X +t)/2 —-lY|/2, 
where Y = X — t. Thus 
E(Z) = (pn +t)/2— E(|Y))/2. 


Hence lower and upper bounds for E(Z) can be obtained by finding upper and 
lower bounds, respectively, for E(\Y)). 
Since the square-root function is convex, Jensen’s inequality gives 


E(IY|) = E(V¥") < VE(Y¥?). 
Now 
E(Y?) = E(X —t)’ = E(X —np + np —t)y 
= E(X — np)’ + 2(np - t) E(X — np) + (np -t)° 
= npq + (np — t)’, ( * *) 
where g = 1 — p. Thus a lower bound for E(Z) different from that given in (a) is 
E(Z) > (pn + 1)/2~ Ynpq + (np ~ 0)? /2. 


If ¢ is much less or much greater than np, rather accurate upper bounds can be 
given for E(Z) by using (*). When ¢ is close to np, we can proceed as follows. 
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H6lder’s inequality gives 
E(¥*) = E(IYP? IVI?) < (E(IYP OEY} 


Or 
E(IY|) > {E(¥?)P{E(¥4)} 
Thus we have the upper bound 
E(Z) < (pn + t)/2— {E(¥?)}{E(¥*)} 
where E(Y*) is given by (* *) and 
E(Y*) = E(X —t)' = E(X — np + np -t) 
= E(X — np)’ + 4(np — t)E(X — np)’ + 6(np — t)’ E(X — np)’ 
+ 4(np — t)E(X — np) + (np —-t) 
= npq(1 — 6pq) + 3(npq) + 4(np — t)npq(q - p) 
+ 6(np — t)’npq + (np — t)*. 


~1/2 


[2, 


Part (a) was solved also by R. A. Agnew, F. T. Bruss, D. Callan, J. Guillerme (France), R. J. Hendel, 
F. Levron (France), S. Philipp, H. L. Stubbs, and the Chico Problem Group. Several of these solvers 
also included upper bounds. 


Zeros of Some Special Polynomials 


E 3348 [1989, 735]. Proposed by Alfred Witkowski, University of Wroclaw, 
Poland. 


(a) Suppose k is a given nonnegative integer. If n > k and 
n 
P(x) = [h(x — J); 
j= 


let x,,, be the zero of P, in the interval (k,k + 1). Find lim, ,,.x,,, and 
estimate the speed of convergence. 
(b) If n > 0 and 


Q,(x) = aC —j*), 


let ¢, be the-zero of Q’, in the interval (0,1). Find lim, _,,, t, and estimate the 
speed of convergence. 


Solution by Richard Stong, University of California at Los Angeles. The zero x, , 
converges to k as n — ©; more precisely x, ,, = k + (log n)~' + O(log n)~ 2), 
The zero t, converges to the unique solution ¢ in (0,1) of wvt cot(avt) = —1, 
roughly t = 0.4170236...; more precisely t, =t+cn '+ O(n’), where c = 
A4t?/(2t + 2) = 0.11374... 

Note that P’(x) = P(x) f,(x), where 


f(x) = Y(x-/7 


j=0 
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Finally 
ar 
h'(t) = — Gy ose*(ave ) — apF “_. cot(rVt) — 
1? 1 T 1 1 
- -F(1+ =| + EP ai? 3 
1? 1 
Ge Oe 


so that —1/h'(t) = 4t?/(ar*t + 2). Thus the result claimed is established. 


Solved also by W. E. Briggs, W. Janous (Austria), O. P. Lossers (The Netherlands), R. Martin, L. F. 
Martins, J.-M. Monier (France), K. Zacharias, The NSA Problems Group, the Western Maryland 
College Problems Group, and the proposer. 


A Sequence Defined by the Limits of Integration 


E 3383 [1990, 343]. Proposed by Harold G. Diamond, University of Illinois, 
Urbana, IL. 


Suppose a is a given number in (0, 1]. Show that there exists a unique sequence 


of positive numbers a) < a, <a, < ::: such that a,/n > 1 as n — © and 
Qn4, 1 + cost 1 
[ —— a = (n = 0,1,2,...). 
a t nt+a 


n 


Solution by David S. Ross, Kodak Research Labs, Rochester, NY. For any 
ay > 0, the integral /*(1 + cos t)t~' dt is a positive, strictly increasing function of 
x on [da, ©) that approaches co as xX — o; its integrand is positive except at isolated 
points, and the integral is the sum of In(x /a,) and a truncation of the convergent 
integral {, 7 (COS t)t~' dt. Thus, any choice of a, > 0 uniquely determines a strictly 
increasing ‘sequence {a,,}°_, for which the integrals have the desired values. 

To determine the growth rate of a,, note first that a, ~ © as n > &, since 
fan. + cos t)t~' dt = L7_o(i + a)~', which goes to © as n > », Now define 
y, = lim, ..(L7_9(i + a)~' — Inn}. The existence of this limit is proved in much 
the same way as the existence of Euler’s constant y. The condition a,/n — 1 is 
equivalent to lim, ,(In a, — Inn) = 0. Since 


nwo 


a, COS t n—1 
Ina, —Inn = Ina, — [ dt+ » — Inn, 
a i=o ! a 
we have 
2 COS f 
lim (Ina, — Inn) = In ay ~ [ dt + y,. 
no 


ag 


As a function of do, the right side of this equation is strictly increasing for a, > 0 
and ranges from — to ». Hence there is a unique value of a, > 0 for which the 
right side of this equation is 0. This value of a, determines the unique sequence 
with the desired properties. 
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Editorial comment. Needless to say, the assertion of the problem is valid for any 
positive a, since increasing a by 1 merely replaces a, by a,,, (n = 0,1,2,...). 
Also the condition a,/n — 1 can obviously be generalized to a,/n — B for any 
fixed positive number f. Further generalizations are possible. 


Solved also by M. Golomb, B. Haible (Germany), E. A. Herman, K. W. Lau (Hong Kong), 
E. Levine, J. H. Lindsey II, O. P. Lossers (The Netherlands), J. G. Merickel (student), J.-M. Monier 
(France), W. A. Newcomb, T. S. Norfolk, A. Riese, D. B. Tyler, P. J. Zwier, University of South 
Alabama Problem Group, Western Maryland College Problems Group, and the proposer. 


A Perfect Property of Handful of 2 Numbers 
E 3385 [1990, 427]. Proposed by Solomon W. Golomb, University of Southern 


California, Los Angeles. 


For what positive integers n is it true that 


~ pH=n. 


p<7(n) 


Here, as usual, w(x) denotes the number of primes not exceeding x and the 
Summation is Over primes p. 


Solution by Marcin E. Kuczma, University of Warsaw, Poland. The answer is: 
5,17, 41, 77, 100. 

Let p, be the k-th prime number, so that p, = 2, p, = 3,.... Define n, = 
Pi +p.+ ++: +p,. A positive integer n satisfying the condition of the problem 
must lie in the sequence {n,}?_, and moreover must satisfy 


» p= Ng. 


p<(n,;) 
Thus p, must be the largest prime not exceeding 7(1,) and so 


w((n,)) =k. (1) 


Thus the problem may be restated as that of finding all k satisfying (1); the 
corresponding values of n, provide the solution of the problem as stated. 
An easy induction using p, > 2k — 1 shows that 


n,>k? fork =1,2,.... (2) 
Less trivial is the inequality 
a(n) >nl/2 forn > 149. (3) 
Postponing the proof of (3) for the moment, we note that it implies 
w(m(n)) >n'/* for n > 859, (4) 
because 7(859) = 149. Since n,, = 874 > 859, relations (2) and (4) imply 
w(ar(n,)) >k fork > 23. 


Among the numbers between 1 and 22 inclusive, only k = 2,4,6,8,9 satisfy (1); 
accordingly n, = 5,17, 41, 77, 100 constitute the answer. 
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To justify (3) we use the estimate (({1], Theorem 2) 


a(n) > for n > 67, (5) 


n 
Inn — 1/2 
together with 

In(In.a — 1/2) 

Inn 

(The inequality (6) holds for nm = 241 and hence for all n > 241, since (in x)/ 
(x + 1/2) decreases for x > 4.) From (5) and (6) we obtain (3) for n > 241. For 
149 <n < 241 it is easy to verify (3) directly. 


We remark that (5) is a rather advanced tool. To make the argument fully 
elementary, we might apply instead the Chebyshev-type estimate 


a(n) > (2n)/(3lnn) for n > 200, (7) 


which is provable easily from the factorization of binomial coefficients (cf. [2]). The 
inequality (7) implies (3) for n > 6737, while the range from 149 to 6736 must be 
examined computationally. 


—1/V2 forn > 241. (6) 


REFERENCES 


1. J. Barkley Rosser and Lowell Schoenfeld, Approximate formulas for some functions of prime 
numbers, Illinois J. Math., 6 (1962) 64-94. 

2. Don Bernard Zagier, Die ersten 50 Millionen Primzahlen, Elemente der Mathematik, Supplement 
No. 15, Birkhauser, 1977. (English translation in Math. Intelligencer 0 (1977), 7-19.) 
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Smith (Germany), J. McHugh, A. Pedersen (Denmark), R. Stong, and the proposer. Five incorrect or 
incomplete solutions were received. 


Creating a Polynomial from the Distance to the Nearest Integer 


E 3389 [1990, 428]. Proposed by Joseph H. Silverman, Brown University, Provi- 
dence, RI. 


Let ||¢|| be the distance from the real number f¢ to the nearest integer. Prove 
that there is a unique real number r greater than 1 such that 


io.¢) 


y r 2 (p7—!¢]| 

n=1 
is a polynomial in ||¢||; give this polynomial explicitly for the special value of r 
referred to. 


Solution by Marcin E. Kuczma, University of Warsaw, Warsaw, Poland. The 
desired value is r = 2, which produces the polynomial (1 /2)(||¢|| — e{2). 

Let f(t) = repel 't||. For r > 1, f, is a continuous, nonnegative func- 
tion. If r is also an integer, then f. is an even function, periodic with period 1. 
Hence f,1 — ¢) =f (t) if r is any integer at least 2. For any real r > 1, if we 
isolate the first term in the series and restrict attention to the interval [0, 1/2], we 
obtain the functional equation 


rPf(t)=t+f(t), te ([0,1/2]. (1) 
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Assume that f. is a polynomial in ||¢||. In particular, f(t) depends only on ||¢|| 
and so f.(1) = f,(0) = 0. Hence all terms of the-series vanish for t = 1. The 
vanishing of the second term yields ||r|| = 0, so r is an integer at least 2. Since f. 
restricted to [0,1/2] is a polynomial, both sides of (1) are polynomials on the 
interval [0,1/(2r)]. Differentiating twice yields f(t) = f’(rt) on [0, 1/(2r)], show- 
ing that f” is constant on that interval. Hence the polynomial in question has 
degree at most 2. Since f, is nonnegative, the functional equation implies that 
r*f(t)—t>0 for t € [0,1/2], with equality for t= 0 and t=1/r (this uses 
r > 2). We now have a quadratic function vanishing at 0 and 1/r and nonnegative 
throughout [0,1/2]. This requires that 1/r is the endpoint of the interval, i.e, 
r= 2. 

To prove the formula claimed for f,(t), it suffices to show that the continuous 
function g(t) = f(t) — t( —t)/2 is 0 throughout [0,1]. The symmetry of f, 
about 1/2 implies g(t)=g(1 —¢t) for ¢ © [0,1], and the functional equa- 
tion implies 4g(t) = gt) for t © [0,1/2]. These two relationships imply 
MAX io, 1/2) § = MAX /2,1) § = MAXo4) & = 4MaX 1/2, &. Hence max g = 0, and 
similarly min g = 0. 

Solved also by O. P. Lossers (Netherlands), T. S. Norfolk, the late David R. Richman, R. Stong, 
A. Tissier (France), A. Zulauf (New Zealand), and the proposer. 


The Golden Ratio Revisited 


E 3391 [1990, 528]. Proposed by David Doster, Choate Rosemary Hall, Walling- 
ford, CT. 


Define a function on the positive integers by putting f(1) = 1 and setting 


f(n) +2 if f(f(n) -n+1)=n 


f(n) +1 if otherwise. 


f(n+1)= 


(i) Find and prove a simple explicit formula for f(n) involving the greatest 
integer function. 
(ii) Show that if f( f(x) —n + 1) #n, then f(f(n) —n + 1) =n +1. 


Solution by Nathan J. Fine, Deerfield Beach, FL. The formula is f(n) = |dén], 
where @ = (1 + V5)/2 is the golden ratio, satisfying 6? — ¢ = 1. We prove this 
inductively by showing that F(n) =|dn] satisfies the same recurrence as f(n) 
(note that F(1) = 1). This will include a proof that F satisfies (ii). 

First we observe that F(n + 1) — F(m) is 1 or 2. This follows from F(m + 1) — 
F(n) < dn + D - (dn - 1D =64+1<3 and Fu + 1)-F() > dm4+1)-1 
—~dn=6-1>0. 

Next, let H(n) = F(F(n) — n + 1). The fact that H(m) is n or n + 1 follows 
froom H(n) < d(@n —n +1) =(6?2-6)n+¢6<n+2 and H(n) > d(dn - 1 
—-n+1)-1=(¢*-¢)n-1=n-1. 

Now suppose F(n + 1) — F(n) = 1. This implies 


H(n) = F(F(n + 1) ~n) = [d([d(n + | —n)| > o(6(n + 1) — 1-7) -1 
= 6(¢-1)n+ ¢(¢-1)-1l=n, 


so we conclude that H(n) =n + 1. 
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On the other hand, F(n + 1) — F(n) = 2 implies H(n) = F(F( + 1) -—7n -— DD 
< b(d(n + 1) —n — 1) = d(¢ — 1)n + 6(¢ — 1) =n + 1, so this requires H(n) 
=n. This completes the inductive proof that F = f. 


Solved also by the proposer and 39 others. 


A Simple Sequence of Complex Numbers 


E 3396 [1990, 529]. Proposed by Bruno Haible (student), Karlsruhe, Germany. 
Suppose we define a sequence {x,,}”_,) by putting x) = 1, x, = 1/2, and 
(n? + 1)x, = (2n? —n)x,_,—(n?-—n)x,_. forn =2,3,.... 
Prove that the sequence {x,} is bounded and determine lim inf x, and lim sup x,,. 


Composite solution, based on the solutions of Jean Anglesio, Garches, 
France, and Richard Stong, University of California, Los Angeles. We show that 
liminf x, = — yw/sinh 7 and limsup x, = y7/sinh 7. 

Define a sequence {y,}°_,) by yp = 0 and 


y, =n x, —X,-;) forn =1,2,.... (1) 
Then the defining recurrence for x,, together with the value of x,, yields 
X,= —N(Y, —Yn-1) forn =1,2,.... (2) 
Now let z, =x, + iy,, where i = ¥V—1. Then Z) = 1, and (1) and (2) yield 
Zz, =in(Z,—Z,-;) forn =1,2,.... (3) 


Solving (3) for z, we obtain z, = (1 + i/n)7'z,_4, so 
n i\7! 
Z,= 1+—] . 
(t+ ¢] 
Now let (1 +i/k)"'=r,e-%, where r,>0 and 0<6,<7/2. Then r, 
= V¥k*/(k* + 1) and 6, = tan ‘(1/k). Thus 
k2 

k* +1 


1/2 
eat wre +n) 


SO 
1/2 
cos(6, + ++: +6,). 


n k2 
—- Rez = _—_ 
*n © 2n Ta 


Now from the infinite product [I1Z_,( + u?/k?) = (sinh wu)/(7u), we obtain 


° k? T 
Nos] -Vaer- 
kai k° +1 sinh 7 


Since 6, = tan” '(1/k) = k~' + O(k~*), we see that 6, + 0, + ++: +6, comes 
arbitrarily close to both even and odd multiples of 77, and the conclusion follows. 


1/2 


Solved also by S. Chen, C. Georghiou (Greece), I. Kastanas, O. P. Lossers (The Netherlands), F. W. 
Schmidt, J. S. Sumner, the University of South Alabama Problem Group, and the proposer. 
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On the other hand, F(m + 1) — F(n) = 2 implies H(n) = F(F(m + 1) — n — 1) 
< b(d(n + 1) —n — 1) = d(¢ — 1)n + b(¢ — 1) =n + 1, So this requires H(n) 
=n. This completes the inductive proof that F = f. 


Solved also by the proposer and 39 others. 


A Simple Sequence of Complex Numbers 


E 3396 [1990, 529]. Proposed by Bruno Haible (student), Karlsruhe, Germany. 
Suppose we define a sequence {x,}”_, by putting x) = 1, x, = 1/2, and 
(n? + 1)x, = (2n?—n)x,_,—(n*—-—n)x,_, forn =2,3,.... 
Prove that the sequence {x,} is bounded and determine lim inf x, and lim sup x,,. 


Composite solution, based on the solutions of Jean Anglesio, Garches, 
France, and Richard Stong, University of California, Los Angeles. We show that 
lim inf x, = — ya/sinh a and limsup x, = y7/sinh 7. 

Define a sequence {y,}°_,) by yp = 0 and 


y, =n(x, —*x,-,) forn =1,2,.... (1) 
Then the defining recurrence for x,, together with the value of x,, yields 
X,= NY, —Yy-1) forn =1,2,.... (2) 
Now let z, =x, + iy,, where i = V—1. Then Z) = 1, and (1) and (2) yield 
Zz, =in(Z, —Z,-1) forn =1,2,.... (3) 


Solving (3) for z, we obtain z, = (1 + i/n)~'z,_ , so 
n i\—! 
Z,= 1+—] . 
Ht] 
Now let (1 +i/k)"'=r,e', where r,>0 and 0<6,<77/2. Then r, 
= Vk*/(k* + 1) and 6, = tan’ ‘(1/k). Thus 
k2 

k* +1 


1/2 
eat wre +n) 


SO 
1/2 


— cos(@, + ::: +60,). 
p-1k74+1 (0; n) 


Now from the infinite product [1%_,(1 + u?/k?) = (sinh wu)/(cru), we obtain 


ora) k2 1/2 a 
[Ts ~ V sinh a’ 


Since 6, = tan” '(1/k) = k~' + O(k~*), we see that 6, + 0, +++: +6, comes 
arbitrarily close to both even and odd multiples of 77, and the conclusion follows. 


n k2 
n= Rez, {TT 


Solved also by S. Chen, C. Georghiou (Greece), I. Kastanas, O. P. Lossers (The Netherlands), F. W. 
Schmidt, J. S. Sumner, the University of South Alabama Problem Group, and the proposer. 
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ADVANCED PROBLEMS 


6670. Proposed by R. H. Jeurissen, Toernooiveld, Nijmegen, The Netherlands. 


Let {0, 1}” denote the set of n-bit strings of zeros and ones. If (a,, a,,...,a,) € 
{0, 1}”, let 77,(a,,a5,...,a,) be the string (b,,b,,...,b,) given by b, = a, and 
b, =a, + a,_, (mod 2) for 1 < k <n. Since (a,, ay,..., a,) can be retrieved from 
(b,,55,...,b,), it is clear that 7, is a permutation of {0, 1}”. Determine the cycle 
structure of the permutation 7,,, 1.e., the lengths of the cycles that occur and the 
number of cycles of each length. 


6671. Proposed by Carl Pomerance, University of Georgia, Athens, GA. 


Let V(x) denote the number of distinct values not exceeding x taken on by 
Euler’s arithmetical function ¢. Let V*(x) denote the number of these values with 
a unique pre-image. For example, V(15) = 7, V*(15) = 0. 

R. D. Carmichael conjectured that V*(x) = 0 for all x. Prove the weaker 
assertion that lim inf, _,.{V*(x)/V(x)} < 1. 


6672. Proposed by H. B. Kushner, Nathan S. Kline Institute for Psychiatric 
Research, Orangeburg, NY. 


If a and b are positive real numbers, prove that 


tr /2 _ 
i ; {(a cos? 6 + bsin* 6)(asin* 6 + bcos? 6)} 6 
0 


7/2 _ 
— i / {a2 cos? 6 + b? sin?6} '’° do 
0 


and use it to prove that the integral on the right is unchanged if a and b are 
replaced by (ab)'”” and (a + b)/2, respectively. 


SOLUTIONS OF ADVANCED PROBLEMS 
A Refinement of Descartes’ Rule of Signs 


6597 [1989, 265]. Proposed by Ichiro Murase, Science University of Tokyo, Japan. 
Let f(x) € R[x], say 
f(x) =a) t+ a,x + +++ +a,x", n> 0, a, #0. 

Let Z,(f) denote the number of zeros of f in (0, +), each counted according to 
its multiplicity. Let W(ap, a,,...,a,) denote the number of pairs (a,,a,) with 
k <I such that a,a, < 0 and a, = 0 for k < v <I. Descartes’ rule of signs says 
that Z,(f) < Wao, a,,...,a,) and Wao, a,,...,a,) — Z,(f) is even. 

If there is an index k, 1 < k <n — 1, such that 

Aya, <0, Ap Any, <9, i < Ay Ags 

prove that 


Zi(f) < Wao, 4;,...,4,) — 2. 
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Solution by Allan Pedersen, Séborg, Denmark. Introduce the notation W(f) = 
Wao, 4;,...,a,) and define f*(x) = x"fl/x) =a, +a,_,x + +++ +ayx” 


Lemma 1. W(f*) = Wf) and Z,(f*) = Z,(f). 


Proof. The first assertion is clear. For the second, note that the statement 
a > 0 is a root of f(x) of multiplicity k means f(x) = (x — a)*g(x), g(a) # 0. 
But then 


f*(x) = (a7! —x)"akx"*g(1/x), 


so a! > Oisa root of f* of multiplicity k. Since f**(x) = f(x), the result easily 
follows. 


Lemma 2. If Z.(f) = Wf), then Z,(f’) = W(f’). 
Proof. Since f’(x) = a, + 2a,x + +++ +na,x"~', either 
WF) =W(f) or Wf’) =W(f) = 1. 
By a well-known argument using Rolle’s Theorem 


ZiAf) 2 2Z.(f) — 1. 


Hence 
ZAP) SW Pf )-1 or Z,(f') > Wf’). 
Since W(f’) — Z,(f’) = 0 is even, the last alternative must hold. 


Lemma 3. Let f(x) = L?_ joj Je; x! where n > 2, the c; are real, and c,, # 0, and 
set 


g,(x) = Cy_y + 24% + Cy 1X", 1<k<n-1. 
If Z(f) = W(f) and g,(x) is not identically zero, then Z_,(g,,) = W(g;,). 


Proof. This is clear for n = 2; proceed by induction on n. For n > 2, 
n—1 n—-1 . 
f(x) =n » j Jena? 
j=0 
is a polynomial of the type considered. By Lemma 2, 


Z,(f'/n) = W(f'/n). 


Hence by induction Z,(g,) = W(g,) for 2 <k <n — 1. However, cy does not 
occur in f’(x), so the induction will not be completed until the result for k = 1 is 
established. Consider 


reo- E(jpoa'= E (5) 


where c* =c,_,;. By Lemma 1, Z,(f*) = W(f*). Hence by the case k =n - 1 
we have 


Z,(h) = Wh), 
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where 
h(x) = c*_, + 2c*_,x + c¥x? =c, + 2c,x + Cox” = g* (x). 
Hence Z,(g,) = W(g,) and Lemma 3 is proved. 
Now return to the original problem, and introduce numbers c; such that 
f(x) =cot+ (7 Jexx “hoe +(Z Jeux" +s +e,x", 
Since c, # 0, the hypotheses on the a, yield 


> kK+t1 n-kt+i1 ,y 
Cie SE OS 1k 41 


If Z,(f) = W(f) then Lemma 3 yields Z,(g,) = W(g,) where g,(x) =c,_, + 
2c,x + C,4,x*. The hypotheses on the a; now yield 


2=W(8,) = Z4( 8). 
Hence, 4c? — 4c,_,c,,, > 0, a contradiction. The result of the problem follows. 


Editorial comment. The solutions of John Steinig and the proposer did not use 
derivatives. They noted that if the sequence of coefficients of f(x) has w changes 
of sign, and if B > 0, then the sequence of coefficients of g(x) = (x + B)f(x) has 
at most w changes of sign (see Pélya-Szeg6, II.V.32). Thus if 8 can be chosen so 
the coefficients of g(x) have at most w — 1 changes of sign, Descartes’ rule 
applied to g(x) yields 


Z.A(f) =Z.,(g) <w-1 
and hence (by its parity assertion) Z,(f) < w — 2. It suffices to establish the 
existence of such a number £, and this they did. 
Steinig carried the analysis somewhat further. He showed that if f has N 


coefficients a, ,...,@,, with k;_,<k; —2 (i = 2,..., N) such that 
a,_,a,<0, a@,a,,,<0, and a*<a,_,4,4, 
for v = k,,..., ky, and if the intervals 
a a,-4 
- - > - F VD k,, Ky 
Aya) a, 


have a nonempty intersection, then 
Zi(f) < W(ao,...,a,) — 2N. 


Moreover, he exhibited polynomials of degree n for each n such that equality 
holds. He also observed that if k <1<m and a,,a,,a,, are consecutive coeffi- 
cients of f(x) but k, 1, m are not consecutive integers (i.e., m — k > 2 and a, = 0 
for k <v <m,v #1) the conditions 


a,a,<0, aa, <0, a?<a,a,, 
do not, in general, imply 
Zi(f) < W(ao,...,a,) — 2. 
For example, W(1, — 7,50) = 2, but for f(x) = x° — 7x* + 50 we have 
f(2)>0, f(¥14/3) <0, and f(3) >0. 
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Finally, Steinig remarks that this problem is related to results of Fekete and 
Pélya (Rend. Circolo Mat. Palermo 34 (1912), 1~32), and perhaps even more 
closely to a result of E. Balint mentioned towards the end of that paper. 


Solved also by J. Steinig (Switzerland), S. Gaignoux (France), and the proposer. 


A Congruence Related to an Elliptic Curve 


6604 [1989, 533]. Proposed by Michael Reid, University of California, Berkeley. 
For which primes p > 3 is it true that 


Po! (3k)! 
y (—6)°?-** = 0 (mod p)? 
kao (k!)° 
Combined Solution by Noam D. Elkies, Harvard University, Cambridge, MA and 
Ronald J. Evans, University of California, San Diego. The answer is: the primes 
congruent to 2 modulo 3. More specifically, we prove the following theorem. 


THEOREM. For each prime p > 3, define 


P~1 (3k)! _ 
S = » ki (-6)” an 
k=0 
Then 
O(mod p), if p = 2(mod 3), 


S= — |] 
216( = —]!-*(mod Pp), ifp = 1(mod3). 


Proof. All congruences below are (mod p). Let r =|p/3]. Then S = —216T, 
where 
rp — 1)!62-341 
pe ED 


 (p — 3k — 11k B? 


sO we must show that 


_ 0, if p = 3r + 2, 
| -1/rP, ifp =3r4+1. 
Since 
_ p — 1)!64 
(Ww +o3+wi+ 6uvw)” —_ y ( ) 3a td 3b +dy3e+d 


Ibictd! 
a.b.c.d,>0 a!b!cld! 
at+tb+c+d=p-1 


T is the coefficient of (uuw)?~! in the expansion of (u? + v? + w? + 6uvw)? 1 
Since Limod py = 0 for0 <k <p — 2, 


y (w+or+wet 6bww)’  =T YS (ww)? 


u,v,w(mod p) u,v,w(mod p) 


er( Eo w}e-r 


u(mod p) 
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Thus, 


| ~4 
T= y {1 — (u3 + v3 + w3 + 6uww)” \, 
u,v,w(mod p) 
so T = N, where N is the number of (u, v, w) € F? on the cubic wu? + v? + w? + 
6uvw = 0 (i.e., 1 more than (p — 1) times the number of F,-rational points on the 
projective curve defined by the same cubic equation). From the transformation 


u= —-2x+yt+zZ, v= —2x-yrz, w= —-2x -22z 


(of determinant —12), we see that N equals the number of (x, y,z) € F? on 
zy? =4x° +23. If p=3r4+2, then x >x° is a bijection of the field of p 
elements, so N=p* and T=N#=0O. Finally, suppose that p = 3r+ 1. Then 
there exists a primitive cube root of unity p (mod p). Since 


3(x3 +y3 +23 + xyz) =(x ty +z) +(x t+ py +p2z) +(x + py +z), 


we have N = M, where M is the number of (u,v, w) € F? on the Fermat cubic 
u> + vu? +w? =0. Reasoning as above with u? + v? + w? in place of u? + v? + 
w? + 6uvw, we find that M is congruent (mod p) to the coefficient of (uvw)?~! in 
the expansion of (uv? + v? + w3)?71, ie, M = Gr)!/(r!)?. Thus T=N=M= 
—1/rl?. 


Remark. It was already known to Jacobi that —1/r!? =A (mod p) for primes 
p = 3r+ 1, where A is uniquely determined by 4p = A? + 27B’*,3|((A +1) AE 
Z,B€EZ. 


Comments by N. D. Elkies. More generally we may consider the sum 


(3k 
Sip) = ¥ , A jo" mod p 
k=0 


for nonzero rational a, corresponding to the curve 
X?+Y? —a'Z3 4+ XYZ = 0 mod p. 


This curve becomes singular when a = 27, when it factors into the product of the 
line Z = 3(X + Y) and the conic 


Z? + 3Z(X+ VY) + 9(X* -XY+ Y*) =0; 


their two points of intersection are defined over F, if and only if (—3/p) = 1, so 
their total number of points mod p is 3p? — 3p + 1 or p? +p -—1 for p=1 or 
p = 2(mod 3), respectively, whence S,,(p) = +1 in the former case and —1 in the 
latter case. For a # 27 that cubic is an elliptic curve of j-invariant (a + 216)°a/ 
(a — 27)°. Thus for a = —216, 54 or 24, the j-invariant 0, 54000 or — 12288000 
indicates complex multiplication by the orders Z[p], Z[V— 3], or Z[3p] in QlV— 3], 
where p is a complex primitive cube root of unity, and so S,(p) = 0 iff p = —1 
(mod 3), as in the solution above; whereas for all other values of a the elliptic 
curve has no complex multiplication, and then all that can be said is that 
S(p) = 0 if and only if that curve is supersingular mod p, which happens for an 
infinite set of p of density zero. 


Solved also by the proposer. 
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Some Inequalities Involving Elementary Symmetric Functions 


6629 [1990, 350]. Proposed by W. E. Briggs, University of Colorado, Boulder, 
CO. 


Let ao, be the ith elementary symmetric function of 1 positive numbers. 
(i) Prove that 


3(6;_19;42 + 99, O:41) > 4(a;_ O74 + of 19, J + o7o 0; 42) 


forl <i<n-1. 
Giy* Is jit true that 


OG; 19, A + 0/6, 543 1 GO, 1 > G O74. 4(0;_ 19, O;43 + 26; J; +2) 


forl <i<n- 1? 
Gii* Is it true that 


forl <i<n- 1? 


Solution of (i) by the proposer. By a theorem of Malo and Schur (see §16 of 
M. Marden, Geometry of Polynomials, Amer. Math. Soc., Providence, 1966 or N. 
Obreschkoff, Verteilung und Berechnung der Nullstellen reeller Polynome, VEBDV, 
Berlin, 1963) if the zeros of the polynomials a) + a,x + ++: +a,x” and by) + b,x 
+ +++ +b, x” are all real, and if those of one are all of the same sign, then the 
zeros Of dob) + a,b,x + ++: +a,b,x" are all real. Let r,,r,,...,7, be the n 
positive numbers of the problem and apply the theorem to 


n n n a . 
(1+ x) =14+(, 7 jet +(7)s lay 
and 
IT (e +r) =¢, +0, 4x + wee +o,x"! +x": 
hence the zeros of 
P(x) = 0, + , 7 1) n-1* + cee +(Tox" + 7 


are all real. Now form Q(x, y) = y”P(x/y) which is homogeneous of degree n. A 
partial derivative of Q(x, y) of any order less than or equal to n — 1 will have 
zeros x/y that are all real (by a repeated application of Rolle’s theorem). The 
various partial derivatives of Q(x, y) of order n — 3 are, up toa constant factor, of 
the form o,_,x° + 30,x*y + 30,,,xy? + 6;,,y° for some i with 1 <i<n—-2. 
Since the case i = n — 1 (of the proposed problem) i is immediate, the result of the 
problem follows by calculating the discriminant of this cubic polynomial, which 

_-must be non-negative. (Note that by adding and subtracting a suitable multiple of 
a2az the discriminant of a,x* + 3a,x? + 3a,x + a, may be written as 


27| (3aja3 + 6aya,a,a3 + 3apa3) — 4(a9a3 + apaz + aja;)|. 


Editorial comment. The inequalities of this problem arose in connection with a 
paper of the proposer published in the Rocky Mountain J. of Math. 15 (1985) 
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75-89. This paper in turn seems related to “Critical points and sigmoidicity of 
positive rational functions,” by W. G. Bardsley and R. M. W. Wood, this MONTHLY 
92 (1985), 37-48. 


No other solution of (i) and no solutions at all of (ii) and (iii) were received. 


An Assertion of Almost Everywhere Convergence 


6630 [1990, 350]. Proposed by David Pagus, Portland, OR. 


Suppose that a,,a@,,a3,... are positive real numbers such that 


») a, log(1 + 1/a,) < &. 
n=1 
(i) If {b,}°_, is an arbitrary sequence of real numbers, prove that the series 
n= 
converges for almost all real x. 
(ii) If k is a fixed positive integer and {b,}°_, is an arbitrary sequence of vectors 
in R*, prove that the series 


io.¢) 


—k 
» a,llx — 5, 
n=1 
converges for almost all vectors x in R*. Here || || is the usual Euclidean norm on 
vectors in R*. 


Solution by Kenneth Schilling, University of Michigan—Flint. It is not hard to 
show that lim, _,., a, = 0, and to deduce from this that 


ya, < %. 


n=1 


We establish (ii) directly, since part (i) is simply the case k = 1 of (ii). Let R > 0 
be arbitrary and x a random variable that is uniformly distributed in the ball 
B = Bp of radius R centered at the origin. Let A denote the volume of the unit 
ball in R*, so that B, has volume AR*. We shall prove that 

y a,|lx — b ||“ < 0 

n=1 
with probability 1. Since R is arbitrary, this suffices to prove (ii). 

We may assume without loss of generality that |a,| <2R for all n, since 
la,| > 2R for at most finitely many n. Less obviously, we may assume ||b,,|| < 2R 
for all n. To see this, note that ||b,|| > 2R implies ||x — b,|| > R since ||x|l < R. 
Hence we have 


—k _ 
Ya, |x — BI" < Via,R* < 


for the sum over all n with ||b,|| > 2R. 
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Now define random variables Y, by 


—k ss: k 
la — BI" if lx - BI" > a, 


0 otherwise, 


Y = 


n 


and observe that 
P, = Prob(Y, # Ilx — Byll*) < A(al/*)"/(AR*) = a,R~. 
Since the sum of all P, is finite, the Borel-Cantelli lemma tells us that 
Prob( Y,, = ||x — b ||“ for all but finitely many n) = 1. 


Hence it suffices to show }a,Y, < © with probability 1. Now E(Y,), the expecta- 
tion of Y,, is 


E(Y,) = file - 8lI-* av, 


where the integral is extended over all ¢ in Bz with ||t — b, ||“ > a,,. Since all such 


t clearly satisfy ||t — b,||<3R, the expectation E(Y,) is bounded above by the 
above integral extended over all ¢ such that 


a, < lull < (3R)*, 
where u = t — D,. By the obvious change of variable, 
E(Y,) < fllull™* av, 


where the integral extends over the region between two concentric spheres given 
by the above u-inequality. Let us change to k-dimensional spherical coordinates; 


then dV has the form p*~'B(6,,...,6,_,)dp d0, -:: d0,_,, where 
B(O,,...,9,_,) is a bounded function of the angular variables 6,,...,0,_, and 
6,,...,0,_, run over a bounded region. Hence 


E(Y,) < K{log(3R) + k~' log(1/a,)}, 
where K is a constant. Thus 
1 
E(X4,Y,) < K log(3R) ia, + Kk dia, oe| < 00 
a, 
and the result follows. 


Editorial comment. Jeffrey Vaaler informs us that similar results are valid in a 
non-Archimedean setting. For example, in the field of p-adic numbers @, a 
similar argument yields the following 


THEOREM. Let {a,}7_, and {b,}"_, be sequences in Q, with each a, # 0. If 


1 
1+ < 0, 
=| 


Y la,lp log 
n=1 
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then for w,-almost all x in Q, the infinite series (of real numbers) 


io.¢) 


y 


n=1 


lanlp 
Ix — bly 
converges. Here yw, is a Haar measure on (),,. 


Solved also by Jean-Marie Monier (France), José Humberto Ferreira Rosa (Brazil), Jeffrey Vaaler, 
and the proposer. 


Some Unimodal Reciprocal Polynomials with Positive Coefficients 
6631 [1990, 432]. Proposed by Ronald Evans, University of California at San 
Diego, and Peter Montgomery, Unisys and University of California at Los Angeles. 


We say that f(z) = a) + a,z + +++ +a,z* is a unimodal reciprocal polynomial 
with positive coefficients if 


) < Qo = a, < a, = ar < a, = a) Stee < Qik /2| = Qik +1)/2)° 
(i) If z,,Z,..., Z, are the nth roots of 1 and if 27/n < t, prove that 


Il © — Z;) 


larg z,|>t 


is a unimodal reciprocal polynomial with positive coefficients. Here —7 < arg z < 
TT. 
(ii) If z,, Z>,...,Z, are the nth roots of —1 and if 7/n < t, prove that 


II] (2-2) 


larg z,|>¢ 


is a unimodal reciprocal polynomial with positive coefficients. 


Solution by the proposers. We introduce a real parameter @ that will later be set 
equal to m/n. For now, however, simply assume that k is a nonnegative integer 
and that none of sin 0,sin26,...,sin k@ are zero. Define 


sin ké sin k@ sin(k — 1)0 


P = + ——_______"_ y? + eee 4+7k 
(2) sin 6 sin 8 sin 20 
kK sin k@sin(k — 1)@ ---sin(k + 1—j)6 
= Ss ij_jgggg <<< 2, 
j=0 sin 6sin26@ --: sin j6é 


Lema. If k is odd, then 
P(z)=(z+1) JT] (2° + 2zcos jé + 1); 


2<j<k-1 
jeven 


if k is even, then 


P.(z)= [|] (2*+2zcos jo + 1). 
l<j<k-1 
j odd 
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Il @ — Z;) 


larg z,|>t 
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(ii) If z,, z>,..., Zz, are the nth roots of —1 and if 7/n < t, prove that 


I] © — Z;) 


larg z,|>¢ 


is a unimodal reciprocal polynomial with positive coefficients. 


Solution by the proposers. We introduce a real parameter @ that will later be set 
equal to 7/n. For now, however, simply assume that k is a nonnegative integer 
and that none of sin 8,sin26,...,sin k@ are zero. Define 


sin k6 sink@sin(k —1)6 , 


P = + 77 + +++ +2% 
eZ) sin 6 sin 6 sin 26 
kK sin k@sin(k — 1)@ ---sin(k + 1 —j)6 , 
= 2), 
j=0 sin @sin26 --- sin jé 
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jeven 


if k is even, then 
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Proof. Set q = e*’*. Define 
(U)m = (1 — u)(1 — qu)(1 — q?u) ++: (1 q™~"u) 
for integers m > 0 and complex u. Then 
P,(z) = (~zq0~”*) 
A famous theorem of Cauchy [1, p. 36] states that 


_ k (4) x 
(i= x (4) (4) eo 


(this reduces to the binomial theorem when g — 1). Thus 


vv 1)/2( —u); 


_ “ (9)k v(u—k)/2~v 
PA2= LO ae 


The coefficient of z’ in P,(z) is 


— g**} —j v q kK ti-)/2 _ qk ti -)/e 


qve- k)/2 2 — _— a 
j-i 1-q’ j=l qiv—@ql 


7 sin(k + 1—j)0 
7 j=l sin j0 
Thus the lemma is proved. 
We now set 6 = 7/n. To avoid trivial cases we assume n > 4. Let f(z) = a, + 


a,z+ccca ,z be the product considered in the problem. Since monic polynomi- 
als with the same roots are identical, f(z) = P,(z). Since k <n, the values 
sin @,...,sin k@ are all positive; in particular, they do not vanish. Hence the 


Lemma applies and 
v sin(k + 1 ~ 8 


a, = Il 
ja] 


sin j0 


The rest is straightforward. For example, 
a sin(k + 1 —v)6 1 , Sin((k + 1 — 2v)/2)6 cos((k + 1)/2)0 
eee + eS 


sin vO sin v8 


so a,_, <a, if 1 <v <[k/2]. Equality is possible only if cos((k + 1)/2)6 = 0, 


v 


1e., kK =n — 1. This is excluded by hypothesis. 


Editorial comment. The above proof of the Lemma is due to Evans. Peter 
Montgomery produced a more elementary. proof that uses no more than the 
addition formulas for sine and cosine, together with induction on k. 

If z,,...,2, are all the zeros of any monic polynomial of degree n with 
nonnegative coefficients, it is natural to ask whether the product of the problem 
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has again nonnegative coefficients. The proposers remark that an affirmative 
answer is given in [2]. 
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Pearls in Graph Theory, a Comprehensive Introduction. By Nora Hartsfield and 
Gerhard Ringel. Academic Press, San Diego, 1990, ix + 246 pp. 


JOAN P. HUTCHINSON 
Mathematics and Computer Science Department, Macalester College, St. Paul, MN 55105 


Who needs another book on graph theory? I do, if it’s a book as refreshingly 
different as Pearls in Graph Theory by Nora Hartsfield and Gerhard Ringel. For 
example, the opening illustrations of graphs include the San Francisco area road 
system connecting Half Moon Bay and the Bay Bridge, the chemical graph for 
aspirin, and the details of a printed circuit board. The book contains a variety of 
nicely presented topics in graph theory, some classical and some quite unusual. 
And best of all, it contains a clear and lively three-chapter introduction to the 
central topics of topological graph theory. 

Pearls in Graph Theory begins informally and at an elementary level, suitable for 
a substantial freshman-sophomore course. After intuitive introductions, concepts 
and theory are developed with increasing depth, leading into material for a good 
intermediate-level course. Included also are appropriate open conjectures on, for 
example, the Oberwohlfach problem, magic and antimagic graphs, and colorings of 
“earth-moon” maps (more details follow). 

In addition, for me the final three chapters are a splendid, enticingly elementary 
yet comprehensive introduction to topological graph theory, much of which has 
been pioneered by Ringel. Essentially no other introductory text presents the 
depth, breadth, and fun of topological graph theory as does this book. (I should 
confess now that my main love is topological graph theory. What a good way to get 
student converts to the field, or, more generally, to topology!) 

But why and when should one teach any kind of graph theory course and 
specifically one such as that proposed in this book? I contend that graph theory at 
the elementary (but also intermediate and advanced) level is an excellent training 
ground for mathematics: The field is visualizable and concrete; it calls for the 
development of geometric, algebraic, and logical skills; and even in a one-semester 
course it can illustrate and point toward both research mathematics and applica- 
tions modeling. 

Consider the case of antimagic graphs: A graph with e edges is called antimagic 
if its edges can be labelled each with a distinct element from {1,2,..., e} so that 
the sums of the edge labels at each vertex differ. (When the sums are all equal, the 
graph is called magic, a graph theorist’s version of a magic square.) This book 
includes a variety of examples and exercises on antimagic (and magic) labellings, 
giving ideas to play with, even on the simple structure of trees. The investigations 
within the text and those suggested as exercises lead to the authors’ conjectures 
that, except for the two-vertex graph with one edge, every tree is antimagic as is 
every connected graph. Now that looks like a problem at least pieces of which a 
student should be able to tackle. (Settled cases are given in the exercises.) Magic 
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and antimagic labellings lead into the important topics of Leech and graceful 
labelings of trees. 

Thus the field of graph theory, especially as presented in the Hartsfield—Ringel 
book, provides an ideal training ground for the mathematical process of investiga- 
tion, géneralization, and conjecture, leading to the discovery of proofs or coun- 
terexamples. The book contains a pleasant variety of simply stated and intriguing 
problems, such as the Oberwohlfach problem and the planar coin problem. The 
former, posed by Ringel after he organized sessions at the Oberwohlfach mathe- 
matics institute, asks for seating plans for m meals so that the 2n + 1 participants 
have n different dining neighbors. (Solved cases are included.) The coin problem 
asks: Which planar graphs can be realized with each vertex as a (nonoverlapping) 
coin in the plane so that adjacent vertices correspond to touching coins? It is 
conjectured that every planar graph is a coin graph. This work also leads into some 
recent and intricate work in geometric realizations of graphs, a tantalizing area, 
but with few proposed techniques for further work. 

In summary, I feel that an elementary or intermediate course, for which this 
book would be appropriate, can serve first to intrigue potential students and then 
to lead them into a significant body of mathematics, resolved and unresolved. The 
references include both original research articles and graph theory texts; I would 
have preferred more pointers toward the intermediate-level references that are 
included in the reference section. For instance, the Hungarian algorithm to solve 
the assignment problem is presented, but not proved correct, with references to 
the papers of Kuhn, Egervary, and Konig. The validation of this algorithm is a nice 
application of Hall’s Theorem, as shown in [3]. 

The placement of a graph theory course, as proposed by this book, may not be 
completely clear for those of us who now teach (with enthusiasm!) a discrete 
mathematics course at a similar level. Such a course typically has considerable 
overlap with the Hartsfield~Ringel graph theory course (e.g., trees, graphs, and 
related algorithms), but also includes topics such as the proof techniques of 
induction and contradiction, the theory of sets, subsets, and permutations, and 
analysis of algorithms. The Hartsfield—Ringel book assumes and uses induction 
without ado, and in my experience induction is first taught in discrete mathematics 
courses. 

Pearls in Graph Theory has much to offer a solid intermediate-level course, 
although there are some topics where the instructor may want to introduce and 
develop more rigor (for example, in the sketch of Priifer’s proof of Cayley’s 
Spanning Tree Formula). The authors do especially nice work with “double 
counting,” a combinatorialist’s favorite trick of counting pairs of objects twice, as, 
for example, when they count the number of paths of k vertices within a complete 
graph. It is a little disappointing that only two of the four graph algorithms 
presented come with a proof of correctness. The full proof of Kruskal’s algorithm 
may be intricate for a beginner, but the special case when all edge weights are 
distinct is easily verified (see [1]). 

One of the most significant graph theory achievements in the past 25 years is 
the proof of the Heawood Map Color Theorem. In 1890 P. J. Heawood, frustrated 
by inability to prove the Four Color Conjecture, proved that every map on the 
sphere plus g handles, g > 0, can be colored with at most 


7+ V48¢+1 


2 
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colors, and conjectured that this bound could always be achieved by a map of h 
mutually adjacent countries. Initially only specific instances were known to be true, 
and his entire conjecture was finally proved in 1968 by G. Ringel and J. W. T. 
Youngs in a masterful piece of work (without use of a computer) assisted in parts 
by Terry, Meyer, and Gustin. This result was a major breakthrough in topological 
graph theory for the interest it stimulated, for its applicability, and for the 
development of the Ringel-Youngs “current graph” technique. Until now, this 
work and its important techniques have been available primarily to eager graduate 
students and researchers in [4, 5, 6], for example, but the ideas involved, when well 
explained, are straightforward and involve the kind of hands-on experimentation 
and discovery that comprise much research. I have long wanted to teach the 
rudiments of topological graph theory and of the Heawood Map Color Theorem, 
in particular, and Pearls provides the means. The last chapter includes a careful 
and elegant development of the heart of the Ringel- Youngs proof that is a joy to 
read and would be fun to teach. In fact, the next time I teach topology I think that 
this work will be my goal. (Such an approach is consistent with the topology 
textbook by Armstrong [2].) 

Heawood also made other important topological conjectures on which Ringel 
and many of his students have worked and generalized. For example, Heawood 
proposed the following Empire Problem. How many colors are needed for a planar 
map if all countries in the same empire receive the same color (and adjacent 
countries still receive different colors)? He showed that if each empire contains m 
countries (and so is dubbed an m-pire by Ringel), then the map can be 6m-col- 
ored. B. Jackson and Ringel showed 94 years later that the bound of 6m is best 
possible for all m> 41. In a similar vein, Ringel proposed the Earth-Moon 
Problem, which asks for the minimum number of colors needed to color maps on 
the earth and on a colonized moon so that each country and its lunar colony 
receive the same color. Such maps can always be 12-colored, but it is unknown 
whether more than 9 colors are ever needed. On these topics illustrative examples 
are presented as is introductory theory. My only quibble would be with the proof 
of the (planar) Five Color Theorem: The authors prove that the countries of every 
normal (i.e., connected, bridgeless, cubic) planar map can be 5-colored, and it 
seems to me an easier exercise (building on their work in Chapter 2) to prove that 
every planar graph can be five-colored. But then again, it was in the context of 
maps, rather than graphs, that Ringel achieved his celebrated results. 

I suppose many of us have contemplated someday writing up our favorite results 
of graph theory, but how many of us will ever do so? We are most fortunate that as 
Professor Ringel approaches retirement, he and Nora Hartsfield have made the 
effort to write up many of their interests in an expository style so that we can share 
their insights with our present and future students. 
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review in the MONTHLY. 

General, P, L. Nine Papers from the Interna- 
tronal Congress of Mathematicians 1986. I.G. Bash- 
makova, et al. Transl. Ser. 2, V. 147. AMS, 1990, vii 
+ 100 pp, $54. [ISBN: 0-8218-3133-X] Translations 
of papers that appeared in Russian in the Proceedings 
of the 1986 ICM. LAS 


General, S. 1-2-3 For Scientists and Engineers, 
Second Edition. William J. Orvis. Sybex, 1991, xxiv 
+ 371 pp, $24.95 (P). [ISBN: 0-89588-733-9] Text 
promotes the idea that spreadsheet software provides 
an ideal platform for the day-to-day computations 
and analysis of data in an engineer’s life. This edi- 
tion exploits developments in the Lotus 1-2-3 soft- 
ware since 1987. Little to no theory, but lots of ex- 
amples and applications. JO 


General, P, L**. Mathematical Impressions. Ana- 
tolir T. Fomenko. AMS, 1990, 184 pp, $45. [ISBN: 
0-8218-0162-7] A collection of nearly eighty evoca- 
tive drawings and oil paintings conveying Fomenko’s 
romantic imaginings (“the freeing of fantasies”) 
in starkly beautiful visual representations. The 
multi-talented mathematician-artist (“an academic 
in whose spirit lives a poet”) uses topological and 
analytic surfaces to develop stunning landscapes and 
intricate images inspired both by mathematical theo- 
rems and human emotion. The perfect mathematical 
coffee-table book. LAS 


Mathematics Appreciation, S(12-14). The 
Magic of Number. K.R. Imeson. Mathematical 
Assoc (Leicester, England), 1989, 96 pp, £3.50 
(P). [ISBN: 0-9514716-0-0] A series of brief math- 
ematical “byways,” mostly about numbers, derived 
from talks given over many years to Mathematics 
Clubs in British schools. Examples: golden propor- 
tion, equiangular spiral, irrational numbers, contin- 
ued fractions, calculation short cuts, primes. Good 
source for high school projects. LAS 


Mathematics Appreciation, T(13-14: 1, 2), L. 
Personal Mathematics and Computing: Tools for the 
Liberal Arts. Frank Wattenberg. New Liberal Arts 
Ser. MIT Pr, 1990, xiv + 556 pp, $29.95. [ISBN: 0- 
262-23157-3] The goal of the course envisioned by 


the author is to teach students “how to use math- 
ematics to reason about a variety of real, impor- 
tant problems.” Using True BASIC as its computing 
tool, the text investigates problems in probability, 
economics, optics, distribution of money, and pop- 
ulation dynamics. The author leaves students with 
enough tools and enough open questions after each 
section to make it plausible that this book might help 
achieve the author’s goals. The problems considered 
are, for the most part, real and important enough to 
make this book worth considering for a liberal arts 
mathematical reasoning course. JO 


Precalculus, T(13: 1, 2). Functions and Graphs, 
Second Edition. R. David Gustafson, Peter D. Frisk. 
Brooks/Cole, 1991, xiii + 559 pp, $49.50. [ISBN: 
0-534-14286-9] A simple, straightforward, thorough 
introduction to standard topics of precalculus math- 
ematics: elementary functions, analytic geometry, 
polar coordinates, rudiments of complex numbers, 
matrices and linear equations, Gaussian elimina- 
tion. Sequences, series, and mathematical induction 
are briefly surveyed. Elementary functions (alge- 
braic, exponential, logarithmic, and trigonometric) 
and their graphs are strongly emphasized. Many 
“story” problems require simple applications of ele- 
mentary functions and their properties. Other ex- 
ercises and examples employ calculators or comput- 
ers. PZ 


Finite Mathematics, T(13: 1). Finite Mathe- 
matics. André L. Yandl. Brooks/Cole, 1991, xxx + 
592 pp, $50. [ISBN: 0-534-14190-0] Opens with re- 
view and discussion of basic algebra including equa- 
tions and inequalities in one variable, functions and 
graphs, then covers standard topics such as matrices, 
linear programming, probability, and game theory. 
Many examples worked out in full detail. LC 


Education, P. From the File Treasury. Selections 
from the Arithmetic Teacher: Mathematics Educa- 
tion through the Middle Grades, 1977-1991. Ed: 
Jean M. Shaw. NCTM, 1991. Several dozen K-8 
classroom activities reprinted from The Arithmetic 
Teacher on 4x6 cards, arranged in a file by topic: 
numeration, estimation, fractions, problem solving, 
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geometry, measurement, time and money, graphing 
and statistics. Examples: use crackers to explore 
tessellations of the plane; analyze data from frisbee 
throws. LAS 


History, L. Operations Analysis in the U.S. Army 
Evghth Air Force in World War II. Charles W. 
McArthur. History of Math., V. 4. AMS, 1990, xxiv 
+ 349 pp, $81. [ISBN: 0-8218-0158-9] A telling of 
an important untold story—how a group of univer- 
sity mathematicians (“most of whom had never been 
near a military airplane”) helped increase the accu- 
racy of allied bombing from 15% in 1942 to 60% 
two years later. More a chronology of events and 
people (many of whom subsequently became very 
well known) than of mathematical problems, but 
nonetheless interesting and valuable, especially for 
insights into the origins of operations research. LAS 


Logic, S*(13-18), L. The Elements of Logic: For 
Use in Computer Science, Mathematics, and Philos- 
ophy. William J. Edgar. Macmillan, 1989, ix + 150 
pp, (P). [ISBN: 0-02-331485-0] Intended for a three 
or four week component of a course which uses some 
basic mathematical logic. Written in a slightly infor- 
mal manner. All standard basic topics included and 
treated in an elementary manner. Many exercises, 
all with solutions. Glossary. Suggestions for further 
reading. Index. RJA 


Logic, P. Lecture Notes in Mathematics-1429: Logic 
and Computer Science. S. Homer, et al. Springer- 
Verlag, 1990, 162 pp, $18 (P). [ISBN: 0-387-52734-6] 
Collection of lecture notes for four short courses given 
at the C.I.M.E. meeting held June 1988 in Monteca- 
tini, Italy. RJA 

Logic, T*(17-18: 1, 2), S, P. Non-Monotonic 
Reasoning: Formalization of Commonsense Rea- 
soning. Witold Lukaszewicz. Ser. in AI. El- 
lis Horwood (US Distr: Prentice Hall), 1990, 328 
pp. (ISBN: 0-13-624446-7] Begins with a collec- 
tion of needed background material including clas- 
sical propositional, first-order, resolution method, 
second-order, and modal logic. General discussion 
of non-monotonic reasoning and its use in artificial 
intelligence ensues. Bulk of text devoted to modal 
non-monotonic logics, default logic, circumscription, 
and the closed world assumption. Bibliography; in- 
dex. RJA 


Logic, $(16-18), P. Proof Theory and Logical Com- 
plezity, Volume I. Jean-Yves Girard. Stud. in Proof 
Theory Mono. Bibliopolis (US Distr: Elsevier Sci- 
ence), 1987, 503 pp, $102.50. [ISBN: 0-444-98715- 
0] ‘Topics include Hilbert’s program, semantics of 
the sequent calculus, applications of the Hauptsatz 
of Gentzen, Ii formulas and well-foundedness, the 
w-rule, the ordinal €,, and arithmetic. Begins with 
background and notation. Note price. Bibliography 
and index. RJA 

Logic, S(14-16), L. Deducibility and Decidability. 
R.R. Rockingham Gill. Routledge, Chapman & Hall, 
1990, viii + 162 pp, $47.50. [ISBN: 0-415-00033-5] 
Intended for students of philosophy. Chapters on the 
natural numbers, addition, multiplication and expo- 
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nentiation, theory of U-forms, effectiveness, recur- 
sively defined predicates, arithmetization of syntax, 
diagonalization, Godel’s theorems, limits of repre- 
sentability, and arithmetical hierarchy. Appendix; 
references; index. RJA 


Foundations, T(13-14: 1). Bridge to Abstract 
Mathematics: Mathematical Proof and Structures, 
Second Edition. Ronald P. Morash. McGraw-Hill, 
1991, xviii + 389 pp, $42.95. [ISBN: 0-07-043043- 
8] Changes include reorganization and rewriting of 
chapters 3 and 4, new emphasis on writing, differ- 
ent design and typeface for “user friendly” appear- 
ance, and inclusion of some biographical sketches of 
famous mathematicians. (First Edition, TR, April 
1988.) LC 


Graph Theory, P. Graphs and Polyhedra: Binary 
Spaces and Cutting Planes. A.M.H. Gerards. CWI 
Tract, V. 73. Stichting Mathematisch Centrum, 
1990, 188 pp, Dfl. 49 (P). [ISBN: 90-6196-390-7] A 
research monograph applying regular matroid the- 
ory, linear programming, and cutting planes to com- 
binatorial optimization problems. Results on signed 
graphs (respectively, undirected graphs), with no 
odd-K4 and no odd-K? (respectively, and no odd- 
prism). JPH 


Discrete Mathematics, T(13-14: 1, 2). Dis- 
crete Mathematics and Its Applications, Second Edi- 
tion. Kenneth H. Rosen. McGraw-Hill, 1991, xxii 
+ 788 pp, $45.95. [ISBN: 0-07-053744-5] Covers 
logic, sets, methods of proof, algorithms, elementary 
and advanced counting techniques, relations, graph 
theory and trees, Boolean algebra, formal languages, 
and finite-state machines. Numerous exercises and 
computer projects. Algorithms in both English and 
pseudocode. More in-depth applications in Michaels 
and Rosen’s book Applications of Discrete Mathe- 
matics. AD 


Discrete Mathematics, P, L*. Discrete Mathe- 
matics across the Curriculum, K-12, 1991 Yearbook. 
Margaret J. Kenney, Christian R. Hirsch. NCTM, 
1991, viii + 248 pp, $18. [ISBN: 0-87353-305-4] 
Twenty-eight chapters giving a wide variety of teach- 
ing strategies and topical examples of how to in- 
troduce discrete mathematics—“the math for our 
time” —into the K-12 school curriculum as called for 
in the NCTM Standards. A rich source of ideas for 
teachers who are beginning to develop curricula in 
this area. LAS 


Linear Algebra, T(14: 1). Elementary Linear Al- 
gebra with Applications, Second Edition. Richard O. 
Hill, Jr. Harcourt Brace Jovanovich, 1991, xv + 492 
pp, $33 net. [ISBN: 0-15-521092-0} This edition 
offers an optional early treatment of determinants; 
adds about 200 exercises, including some cumulative 
review exercises; adds five new sections, including 
applications of determinants and iterative methods; 
and makes some editorial changes. (1986 Academic 
Press First Edition, TR, October 1986.) OJ 


Group Theory, S(18), P. Automorphic Repre- 
sentations of Unitary Groups in Three Variables. 
Jonathan D. Rogawski. Annals of Math. Stud., V. 
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123. Princeton Univ Pr, 1990, xii + 259 pp, $19.95 
(P); $60. [ISBN: 0-691-08587-0; 0-691-08586-2] Au- 
tomorphic representations of the group G = U(3) 
with basic field F are studied by means of compari- 
son of trace formulas of G and Res G for a quadratic 
extension E& of F. This involves a stabilization pro- 
cedure initially due to Langlands and developed here 
with applications. References, indexes. JS 


Algebra, P. First Siberian Winter School: Alge- 
bra and Analysis. Eds: A.D. Aleksandrov, et al. 
Transl. Ser. 2, V. 148. AMS, 1991, vii + 112 pp, $63. 
[ISBN: 0-8218-3700-1] Ten advanced expository pa- 
pers based on lectures delivered in March 1987 at 
Kemerovo State University intended to acquaint an- 
alysts with the current frontiers of algebra. LAS 


Algebra, P. Semigroup Algebras. Jan Okninski. 
Pure & Appl. Math., V. 138. Marcel Dekker, 1990, 
ix + 357 pp, $119.50. [ISBN: 0-8247-8356-5] <A uni- 
fied presentation of recent results in the theory of 
noncommutative semigroup rings over a field. Re- 
ceiving special attention are semigroup algebras of 
cancellative semigroups, semigroup algebras satisfy- 
ing polynomial identities, and finiteness conditions 
on semigroup algebras. SG 


Algebra, S(18), P. Ideals of Identities of Asso- 
ciative Algebras. Aleksandr Robertovich Kemer. 
Transl. of Math. Mono., V. 87. AMS, 1991, v + 81 
pp, $75. [ISBN: 0-8218-4548-9] After an introduc- 
tion summarizing developments in the theory of al- 
gebras defined by polynomial identities, the author 
leads into his own recent work, presenting several 
major results. Most notable is a solution of Specht’s 
problem with a proof that any associative algebra 
over a field of characteristic zero has a finite basis of 
identities. Bibliography, index. JS 


Algebra, T(15-16: 2), L. Abstract Algebra. Den- 
nis Kletzing. Harcourt Brace Jovanovich, 1991, xii 
+ 605 pp, $35 net. [ISBN: 0-15-500391-7] Groups, 
rings, fields including sections on p-groups and Sylow 
theory, the geometric groups, Galois theory and solv- 
ability by radicals. Copious examples, appendices on 
history of groups, rings and fields, glossary. LC 


Differential Equations, T(18: 1), P. Lecture 
Notes in Mathematics-1445: Regularity Theory for 
Quastlinear Elliptic Systems and Monge—Ampere 
Equations in Two Dimensions. Friedmar Schulz. 
Springer-Verlag, 1990, xv + 123 pp, $16 (P). [ISBN: 
0-387-53103-3] Self-contained introduction to char- 
acteristic theory for the named equations. Presents 
the Heinz-Lewy theory plus auxiliary material. Also 
includes basic results about Holder continuity and 
regularity theory for linear elliptic equations. BL 

Differential Equations, P. Ezponential Sums and 
Differential Equations. Nicholas M. Katz. Annals 
of Math. Stud., No. 124. Princeton Univ Pr, 1990, 
xi + 430 pp, $22.50 (P); $65. [ISBN: 0-691-08599-4; 
0-691-08598-6] Relates the theory of linear differ- 
ential equations in one complex variable with poly- 
nomial coefficients to the theory of one-parameter 
families of exponential sums over finite fields. JO 


Differential Equations, P. Ezterior Differential 
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Systems. R.L. Bryant, et al. Math. Sci. Res. Inst., 
V. 18. Springer-Verlag, 1991, vii + 475 pp, $49. 
[ISBN: 0-387-97411-3] Comprehensive reference on 
the subject, with applications to partial differen- 
tial equations and linear differential operators. This 
book is not an introduction. JO 


Partial Differential Equations. Surveys on Anal- 
ysis, Geometry and Mathematical Physics. Eds: 
Bert-Wolfgang Schulze, Hans Triebel. Teubner- 
Texte zur Math., Band 117. BG Teubner Leipzig, 
1990, 308 pp, (P). [ISBN: 3-322-00773-1] Five pa- 
pers on boundary value problems, non-linear prob- 
lems, and pseudo-differential operators. LAS 


Partial Differential Equations, P. Solitons in 
Molecular Systems, Second Edition. A.S. Davydov. 
Transl: Eugene S. Kryachko. Math & Its Applic., V. 
61. Kluwer Academic, 1991, xiii + 413 pp, $149. 
[ISBN: 0-7923-1029-2] Presents the theory of ex- 
cited states in quasi-one-dimensional periodic struc- 
tures using the theory of non-linear differential equa- 
tions. Background theory in first five chapters in- 
cludes discussion of role of solitons in biological sys- 
tems such as muscle contraction and general anes- 
thesia. JO 


Partial Differential Equations, P. Multidimen- 
stonal Hyperbolic Problems and Computations. Eds: 
James Glimm, Andrew J. Majda. IMA, V. 29. 
Springer-Verlag, 1991, xiv + 386 pp, $39.80. (ISBN: 
0-387-97485-7] Proceedings of an IMA workshop 
held April 3-14, 1989 as part of the program on 
nonlinear waves. Twenty-six papers covering a wide 
cross-section of research: kinetic theory of gases, sys- 
tems of hyperbolic conservation laws, nonlinear ge- 
ometric optics, shock waves, and numerical meth- 
ods. BL 


Partial Differential Equations, P. Bounded and 
Almost Periodic Solutions and Nonlinear Operator 
Differential Equations. A.A. Pankov. Math. & Its 
Applic., V. 55. Kluwer Academic, 1990, x + 220 
pp, $99. [ISBN: 0-7923-0585-X] Given a bounded 
function from a locally compact group to a Banach 
space and a positive number epsilon, an element of 
the group is an epsilon-almost period of the func- 
tion if the supremum taken over the group of the 
difference between the function and its translate by 
that element is less than epsilon. A function is al- 
most periodic if each subset of epsilon-almost periods 
is relatively dense in the group. Treats solutions of 
evolution equations and of spacially almost periodic 
problems. BL 


Partial Differential Equations, P. Hamiltonian 
Flows and Evolution Semigroups. Henryk Gzyl. Pit- 
man Res. Notes in Math. Ser., V. 239. Longman 
Sci & Tech (US Distr: Wiley), 1990, 124 pp, $32 
(P). [ISBN: 0-582-03190-7] Develops a non-unitary 
representation theory for canonical transformations 
of Hamiltonian systems and an operational calculus 
similar to that in quantum mechanics. The repre- 
sentation theory maps semigroups and special func- 
tions associated with different Hamiltonians onto one 
another. The operational calculus is used to asso- 
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ciate an evolution semigroup to each Hamiltonian. A 
variety of examples illustrate the results and meth- 
ods. BL 


Partial Differential Equations, P. Lagrangian 
Manifolds and the Maslov Operator. <A.S. Mish- 
chenko, V.E. Shatalov, B. Yu. Sternin. Transl: Dana 
Mackenzie. Springer-Verlag, 1990, x + 395 pp, 
$69.50. [ISBN: 0-387-13613-4] Presents a method 
for obtaining global asymptotic solutions to partial 
and pseudodifferential equations of physical interest, 
e.g., quantum mechanics, wave propagation. Using 
geometric optics, the amplitude of a wave may be- 
come infinite where rays focus or form caustics. The 
Maslov operator translates the existence and unique- 
ness of solutions to conditions on cocycles on a sub- 
manifold in phase space. Part I develops the neces- 
sary topological tools. Part II constructs the canon- 
ical operator and presents applications. BL 


Partial Differential Equations, P. Inverse Source 
Problems. Victor Isakov. Math. Surveys & Mono- 
graphs, No. 34. AMS, 1990, xiv + 193 pp, $72. 
[ISBN: 0-8218-1532-6] Treats inverse problems for 
all three major types of partial differential equations, 
focusing on elliptic equations. Contains a basic re- 
view of the direct problem and an overview of in- 
verse problems, uniqueness, existence, and regularity 
theorems. Also treats stability and numerical solu- 
tions. BL 


Numerical Analysis, P. Fourth International 
Symposium on Domain Decomposition Methods for 
Partial Differential Equations. Eds: Roland Glowin- 
ski, et al. SIAM, 1991, xi + 417 pp, $56.50 (P). 
[ISBN: 0-89871-278-5] Proceedings of a symposium 
held in Moscow, May 21-25, 1990. Thirty-three pa- 
pers on the method of “solving boundary value prob- 
lems which allow concurrent computations (parallel 
processing) to occur on different subdomains.” BL 


Numerical Analysis, P, L. Finite Element Meth- 
ods (Part 1). Eds: P.G. Ciarlet, J.L. Lions. Hand- 
book of Numer. Analysis, V. II. North-Holland (US 
Distr: Elsevier Science), 1991, ix + 928 pp, $110. 
[ISBN: 0-444-70365-9] Contains six articles entitled 
“Finite Elements: An Introduction,” “Basic Error 
Estimates for Elliptic Problems,” “Local Behavior in 
Finite Element Methods,” “Mixed and Hybrid Meth- 
ods,” “Eigenvalue Problems,” and “Evolution Prob- 
lems.” AO 


Numerical Analysis, P. Advances in Numert- 
cal Partial Differential Equations and Optimization. 
Eds: S. Gomez, J.P. Hennart, R.A. Tapia. SIAM, 
1991, xii + 365 pp, $47.50 (P). [ISBN: 0-89871-269- 
6] Proceedings of the Fifth Mexico-United States 
Workshop on Numerical Analysis in Merida, Yu- 
catan, Mexico, January 1989. Numerical aspects of 
optimization, linear algebra, and differential equa- 
tions. MLR 


Functional Analysis, P. Littlewood-Paley Theory 
and the Study of Function Spaces. Michael Frazier, 
Bjorn Jawerth, Guido Weiss. CBMS Reg. Conf. Ser. 
in Math., No. 79. AMS, 1991, vii + 132 pp, $40 
(P). [ISBN: 0-8218-0731-5] Fifteen lectures from a 
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1989 conference at Auburn presenting the basics of 
Littlewood-Paley theory, which provides a unifying 
framework for a variety of seemingly unrelated func- 
tion spaces. JO 


Analysis, P. Mathematical Problems of Tomogra- 
phy. Eds: I.M. Gelfand, S.G. Gindikin. Transl. of 
Math. Mono., V. 81. AMS, 1990, iii + 267 pp, $96. 
[ISBN: 0-8218-4534-9] Translations from Russian of 
eight papers related to computerized tomography, 
the mathematics of CT-scans, and of many applica- 
tions in radio astronomy and electron micrography. 
The papers are further unified by their attention to 
the mathematical restrictions arising from the prac- 
tical nature of the problems. JO 


Algebraic Geometry, P. Algebraic Geometry: 
Sundance 1988. Eds: Brian Harbourne, Robert 
Speiser. Contemp. Math., V. 116. AMS, 1991, xi 
+ 146 pp, $57 (P). [ISBN: 0-8218-5124-1] A collec- 
tion of ten papers on algebraic curves and related 
varieties. SG 


Algebraic Geometry, P. Fewnomials. A.G. Kho- 
vanskii. Transl. of Math. Mono., V. 88. AMS, 1991, 
viii + 139 pp, $96. [ISBN: 0-8218-4547-0] Con- 
cerned with the general problem of estimating the 
number of positive solutions of a system of k real 
polynomials in k variables in terms of & and the 
number of monomials that appear in the polynomi- 
als. Complex and transcendental versions are also 
presented. SG 

Differential Geometry, T?(17: 2). Basic Ele- 
ments of Differential Geometry and Topology. S.P. 
Novikov, A.T. Fomenko. Math. & Its Applic., V. 
60. Kluwer Academic, 1990, zx + 490 pp, $164. 
[ISBN: 0-7923-1009-8] Based on a series of courses 
at Moscow State University, this book is intended to 
present those aspects of geometry and topology that 
are essential for a modern mathematical education. 
Includes classical theory of curves and surfaces, ten- 
sor analysis, and basic homotopy. Seven appendices 
provide the preceding theory with some life in the 
form of examples. Note the price. JO 


Differential Geometry, T(17-18: 1), S. Manifold 
Theory: An Introduction for Mathematical Physi- 
cists. Daniel Martin. Math. & Its Applic. Ellis 
Horwood, 1991, 423 pp, $81. [ISBN: 0-13-543877- 
2] Differential geometry text for graduate students 
in physics. Standard topics. Very readable—could 
be used for independent study. Many examples in 
the first chapters, only a few from physics. A few 
proofs in the later chapters are omitted. Answers, 
hints, or partial solutions to about half the exercises. 
Bibliography. OJ 

Geometry. Mathographics. Robert Dixon. Dover, 
1991, ix + 214 pp, $8.95 (P). [ISBN: 0-486-26639-7] 
Over 100 mathematical drawings created by com- 
pass constructions or computer graphics. Intended 
to intrigue and inspire the reader to replicate or ex- 
tend the drawings. Some figures are explained; many 
are not. Minimal narrative provides mathematical 
background, but the heart of the book is in the art- 
work. MW 
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Algebraic Topology, T(18: 1), S, P. Topology of 
4-Manifolds. Michael H. Freedman, Frank Quinn. 
Math. Ser., V. 39. Princeton Univ Pr, 1990, viii + 
259 pp, $49.50. [ISBN: 0-691-08577-3] Masterful 
presentation of the basic tools and some major re- 
sults in the study of 4-manifolds. Offers simplified 
treatments of the embedding theorems for 2-disks, 
the classification theorem for closed 1-connected 4 
manifolds, and more general classification theorems. 
Almost a third of the material is new. Most chap- 
ters conclude with references for the major ideas and 
results they contain. A strong algebraic (geometric) 
topology course is a prerequisite. OJ 


Operations Research, P, L. Mathematical Devel- 
opments Arising from Linear Programming. Eds: 
Jeffrey C. Lagarias, Michael J. Todd. Contemp. 
Math., V. 114. AMS, 1990, xiv + 341 pp, $59 
(P). [ISBN: 0-8218-5121-7] Proceedings of the AMS 
Summer Research Conference held at Bowdoin Col- 
lege in 1988. Contains twenty-one papers dealing 
mostly with various aspects of interior-point lin- 
ear programming algorithms. Nonlinear optimiza- 
tion and integer programming problems are also dis- 
cussed. These papers represent current research and 
are neither introductions to nor surveys of linear pro- 
gramming. SM 


Operations Research, T(16-17), L. Game The- 
ory: Analysts of Conflict. Roger B. Myerson. Har- 
vard Univ Pr, 1991, xiii + 568 pp. [ISBN: 0-674- 
34115-5] A very well-written introduction to game 
theory presented in the style of a mathematics text 
with many theorems and a heavy use of mathemat- 
ical symbols. The ideas, techniques, and notation 
from undergraduate courses in calculus, linear al- 
gebra, and probability are assumed throughout the 
text. Exercises appear after each of the ten chapters; 
none of the exercises are solved. Extensive bibliog- 
raphy. SM 


Optimization, T(16-17: 1), C, P, L**. Knapsack 
Problems: Algorithms and Computer Implementa- 
tions. Silvano Martello, Paolo Toth. Wiley, 1990, 
xii + 296 pp, $110. [ISBN: 0-471-92420-2] Presents 
state-of-the-art exact and approximate algorithms 
for a number of important NP-hard integer program- 
ming problems: binary, bounded, unbounded, and 
binary multiple knapsack problems as well as subset- 
sum, change-making, generalized assignment, and 
bin-packing problems. Includes theoretical and em- 
pirical results on computational complexity. Fortran 
codes for the most effective algorithms are provided 
on a diskette included with the book. AO 


Systems Theory, P. Progress in Fuzzy Sets and 
Systems. Eds: Wolfgang H. Janko, Marc Roubens, 
H.-J. Zimmermann. Theory & Decision Lib., Ser. 
D, V. 5. Kluwer Academic, 1990, ix + 188 pp, $87. 
[ISBN: 0-7923-0730-5] Contains the proceedings of 
a workshop held April 6-8, 1989 in Vienna. Fifteen 
refereed papers, in alphabetical order. RJA 


Stochastic Processes, P. Lecture Notes in 
Mathematics-1464: Ecole d’Eté de Probabilités de 
Saint-Flour XIX-1989. D.L. Burkholder, E. Par- 
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doux, A. Sznitman. Springer-Verlag, 1991, vi + 259 
pp, $30 (P). (ISBN: 0-387-53841-0] Three sets of 
course lectures: D. Burkholder on martingale the- 
ory; E. Pardoux on filtrage stochastiques (in French); 
and A. Sznitman on propagation of chaos. (One sig- 
nature, pp. 58-90, bound up-side-down.) LAS 


Computational Statistics, P. The Frontiers of 
Ezpert Systems and Artificial Intelligence. Ed: Ed- 
ward J. Dudewicz. Amer. Ser. in Math. & Manage- 
ment Sci., V. 27. American Sciences Pr, 1989, 214 
pp, $98.75 (P). [ISBN: 0-935950-29-X] Contains nine 
papers presented at the First International Confer- 
ence on Statistical Computing held March 30-April 
2, 1987 in Cegme, Izmir, Turkey. RJA 

Statistics, P. COMPSTAT 1990: Proceedings in 
Computational Statistics. Eds: K. Momirovic, V. 
Mildner. Springer-Verlag, 1990, xiii + 336 pp, $69 
(P). [ISBN: 0-387-91390-4] A collection of forty- 
nine papers from the ninth COMPSTAT Symposium 
held at Dubrovnik, Yugoslavia, 1990. Topics include 
multivariate data analysis, statistical expert systems, 
and robust computing. RWJ 


Statistics, P. Statistical Multiple Integration. Eds: 
Nancy Flournoy, Robert K. Tsutakawa. Contemp. 
Math., V. 115. AMS, 1991, xii + 276 pp, $71 (P). 
[ISBN: 0-8218-5122-5] Papers from a 1989 confer- 
ence of mathematicians, statisticians, and computa- 
tional scientists on problems and techniques in sta- 
tistical multiple integration, primarily arising from 
multivariate and Bayesian statistics. RM 


Programming, T(15-17: 1), L*. An Intro- 
duction to Object-Oriented Programming. Timothy 
Budd. Addison-Wesley, 1991, xv + 399 pp, $36.75. 
[ISBN: 0-201-54709-0] A language-independent in- 
troduction to the concepts of object-oriented pro- 
gramming illustrated with examples from C+4, 
Smalltalk, Objective-C, and Object Pascal. Case 
studies present practical examples of the use of 
object-oriented programming techniques. Covers ad- 
vanced topics such as metaclasses, delegation, pa- 
rameterized classes, and multiple inheritance as well 
as implementation of object-oriented languages. AO 


Algorithms, P*, L**. Handbook of Algorithms 
and Data Structures In Pascal and C, Second Edi- 
tion. G.H. Gonnet, R. Baeza- Yates. Intern. Comput. 
Sci. Ser. Addison-Wesley, 1991, xiv + 424 pp, $34.50. 
[ISBN: 0-201-41607-7] This new edition (First Edi- 
tion, TR, February 1985) incorporates many recent 
results and a new chapter on text searching. The bib- 
liography contains references to thirty-six textbooks 
and 1350 published research papers! Gives Pascal 
and/or C implementations of many of the algorithms 
discussed in the text. AO 

Algorithms, T(17-18: 1, 2), P, L. Analysis of Al- 
gorithms and Data Structures. Lech Banachowski, 
Antoni Kreczmar, Wojciech Rytter. Intern. Com- 
put. Sci. Ser. Addison-Wesley, 1991, xii + 300 pp, 
$36.75. [ISBN: 0-201-41693-X] Presents a variety of 
approaches to the analysis of algorithms and data 
structures including probabilistic models, generating 
functions, embeddings between data structures, the 
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pointer machine model of computations, and peb- 
ble games for modeling space-efficient computations. 
Also discusses transformational techniques for speed- 
ing up algorithms (dynamic and algebraic simula- 
tion) and analysis of parallel algorithms. AO 
Computer Systems, S?7(13), L. Life with UNIX: 
A Guide For Everyone. Don Libes, Sandy Ressler. 
Prentice Hall, 1989, xx + 346 pp. [ISBN: 0-13- 
536657-7] Chatty guide (not a tutorial or manual!) 
to UNIX: how to use it, history, folk features that 
are rarely documented, all the neat stuff that “ev- 
erybody knows” but that no one who is new can find 
out except by asking a guru. RM 


Computer Systems, P. UNIX Curses Ezplained. 
Berny Goodheart. Prentice Hall, 1991, xiii + 287 
pp, (P). [ISBN: 0-13-931957-3] The UNIX curses 
library is a set of routines that facilitate the writing 
of interactive screen-based programs. This volume 
provides a tutorial on the use of the curses library, 
and can also function as a reference manual for the 
library as implemented in UNIX System V.3. AO 


Computer Systems, P. C Library Reference for 
UNIX System V.4 (with ANSI C). Specialized Sys- 
tems Consultants (POB 55549, Seattle, WA 98155), 
1991, 72 pp, $8 (P). [ISBN: 0-916151-47-6] A 
pocket reference guide to the UNIX System V, Re- 
lease 4 version of the C programmer’s library. ANSI 
prototypes are used to indicate the calling conven- 
tions of the functions, and functions that are part of 
the ANSI standard library are flagged. AO 


Computer Graphics, P. Scientific Visualization 
and Graphics Simulation. Ed: Daniel Thalmann. 
Wiley, 1990, ix + 264 pp, $49.95. [ISBN: 0-471- 
92742-2] Eighteen lectures originally given at the 
Swiss Federal Institute of Technology, each written 
by one or several experts in a particular aspect of sci- 
entific visualization. The lectures present techniques 
of computer graphics, image synthesis, and anima- 
tion as well as applications in engineering, science, 
and medicine. AO 


Theory of Computation, S(17-18), P. Logic 
and Computation: Interactive Proof with Cambridge 
LCF. Lawrence C. Paulson. Tracts in Theoretical 
Comput. Sci., V. 2. Cambridge Univ Pr, 1990, xiii 
+ 302 pp, $24.95 (P). [ISBN: 0-521-39560-7] LCF 
(Logic for Computable Functions) denotes computer 
programs for reasoning about computation. The de- 
sign of Cambridge LCF (based on Edinburgh LCF) 
is explained here. Part 1 contains background in el- 
ementary logic and domain theory. Part 2 explains 
the features and details of the program. Bibliogra- 
phy and index. (First Edition, TR, June-July 1989.) 
RJA 

Artificial Intelligence, T(15-17: 1, 2), S, L. 
Knowledge and Inference. Makoto Nagao. Transl: 
Richard Weyrauch, Yasuko Kitajima. Academic Pr, 
1990, xv + 304 pp, $42.50. (ISBN: 0-12-513662- 
5] An attempt to merge artificial intelligence and 
library science is presented. Discusses the current 
state of library science and the place of artificial in- 
telligence in it. Includes search and problem solving, 
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methods of making proofs, and the use of knowledge 
in looking for a proof. Finishes with a chapter on 
expert systems. Chapter summaries, key word lists, 
exercises. Answers to exercises. List of recommended 
books by topics. Index. RJA 


Computer Science, S(17-18), P. Implementing 
Persistent Prolog: Large, Dynamic, Shared Proce- 
dures in Prolog. Robert M. Colomb. Ser. in Com- 
put. & Applic. Ellis Horwood (US Distr: Prentice 
Hall), 1990, 199 pp. [ISBN: 0-13-453531-6] Begins 
with a simple interpreter and the principle issues in 
implementing Prolog. The next chapters include per- 
sistence research, handling large procedures, persis- 
tent dynamic procedures, shared procedures, types, 
the Warren Abstract Machine, NU-Prolog, and in- 
tegration of a persistent procedure management sys- 
tem onto existing implementations. Extensive lists 
of figures and of tables. References, index. RJA 


Applications, P. Proceedings of the Second Work- 
shop on Road-Vehicle-Systems and Related Mathe- 
matics, Ed: Helmut Neunzert. ECMI, V. 4. Kluwer 
Academic, 1989, 235 pp, $79. [ISBN: 0-7923-0243-5] 
Twelve papers dealing with various aspects of road- 
vehicle systems. Mathematical descriptions of roads, 
vehicles, vehicle dynamics, road reactions, and reli- 
ability are presented. These papers would be most 
beneficial to researchers in the area and their ad- 
vanced students. SM 

Applications (Fluid Dynamics), T(18: 2), L. 
Computational Techniques for Fluid Dynamics 1: 
Fundamental and General Techniques, Second Edi- 
tion. C.A.J. Fletcher. Ser. in Computat. Physics. 
Springer-Verlag, 1991, xiii + 401 pp, $49 (P). [ISBN: 
0-387-53058-4] Intended to introduce engineering 
graduate students to techniques of computational 
fluid dynamics. Includes brief differential equations 
review, discussion of discretization techniques, and 
presentation of general techniques for diffusion and 
convection equations. Volume 2 applies these gen- 
eral techniques to specific problems. IBM disk with 
Fortran programs available from the author. JO 


Applications (Physics), P. Quantum Mechanics, 
Algebras and Distributions. D.A. Dubin, M.A. Hen- 
nings. Pitman Res. Notes in Math. Ser., V. 238. 
Longman Sci & Tech (US Distr: Wiley), 1990, 238 
pp, $38 (P). [ISBN: 0-582-06776-6] Presents a rig- 
orous formulation of quantum mechanics based on 
the algebra of observables and states. Intended to 
provide rigorous background for students and re- 
searchers who might otherwise do calculational re- 
search without mastery of the deeper theory. JO 


Reviewers 


RJA: Richard J. Allen, St. Olaf; LC: Laura Chihara, St. Olaf; 
SG: Steven Galovich, Carleton; JPH: Joan P. Hutchinson, 
Macalester; OJ: Ockle Johnson, St. Olaf; RWJ: Roger W. 
Johnson, Carleton; BL: Brian Loe, Carleton; SM: Steve McK- 
elvey, St. Olaf; RM: Richard Molnar, Macalester; JO: Jeff 
Ondich, St. Olaf; AO: Arnold Ostebee, St. Olaf; MLR: Mar- 
garet L. Reese, St. Olaf; JS: John Schue, Macalester; LAS: 
Lynn Arthur Steen, St. Olaf; MW: Martha Wallace, St. Olaf; 
PZ: Paul Zorn, St. Olaf. 


Foundation Chair 
of Mathematics Education 


Applications are invited for appointment to the Foundation Chair of Mathematics Education within the Institute 
of Education. 


The University seeks a leader in the field of mathematics education, with an outstanding research record and 
wide recognition in the field, strong qualifications in mathematics, and an excellent record as a teacher. 


The appointee will build up a centre for those wishing to develop expertise or undertake further research in 
mathematics education. 


The chair will be located within the School of Science and Mathematics Education in the Institute of Education. 
The new professor will strengthen co-operative links already established with the Department of Mathematics 
in the Faculty of Science. 


This is a continuing chair, with an initial salary of AUD$72,000 rising to AUD$76,000 in July 1992. 


Further information about the position, including details of application procedure, conditions for outside work, 
superannuation, travel and removal expenses, housing assistance and conditions of appointment, is available from 
the Registrar. All correspondence (marked “PERSONAL AND CONFIDENTIAL’) should be addressed to the Registrar, 
The University of Melbourne, Parkville, Victoria, 3052, Australia. 


Telephone enquiries: (613) 344 7527, Facsimile: (613) 344 6897 

Applications close on 31 January, 1992. 

The Council reserves the right to make no appointment or to fill the position by invitation at any stage. 

The University of Melbourne is an equal opportunity employer and has implemented a smoke-free workplace 


policy. 
THE UNIVERSITY 
OF MELBOURNE 


arm51375 


POWERFUL DISCOVERY TOOL 


BestGrapher Software allows students to take control 
of their own learning through a hands-on approach. 
BestGrapher is easy to learn and easier to use. 

¢ Function plotting and finding function zeros 

¢ Symbolic differentiation and numerical integration 

1 © Draws secants, tangents and normals 

¢ Approximates volumes of solids of revolution 

¢ Graphs polar equations and parametric equations 
e Available for IBM or Macintosh (Specify 3.5 or 5.25 disk) 

¢ Quantity pricing, LAN licenses are available BestGrapher: $73.00 (Demo disk: $10.00) 


COMPUTER LABORATORY GUIDE FOR CALCULUS 


Using BestGrapher is a rich selection of exercises and laboratory investigations designed to 

supplement any calculus textbook, and any calculus software. The chapiers contain a variety of 

challenging, non-routine computer exercises. 

¢ Problems that encourage experimentation and discovery about functions, derivatives, areas 
and other topics in calculus and precalculus 

¢ Lab projects that contain open-ended questions, promote working in small groups, and 
culminate in a written report. 

¢ Work-text format with answers to selected problems 
All prices include shipping & handling Using BestGrapher: $18.00 (Spiral bound, 150 pages) 


Venture Publishing 9 Bartlet St., Suite 55 Andover, MA 01810 508-475-3733 


Winning Women into 


Mathematics 


Patricia Clark Kenschaft, Editor 


American media often ask why women “can’t” do 
mathematics. Any answer is misleading. Better 
questions are needed, along with indications of 
how to find potential answers. 


The Committee on the Participation of Women of 
the Mathematical Association of America was 
established in 1987 “to work for full involvement of 
women in MAA activities that will encourage women 
to pursue careers in the mathematical sciences.” 
With this book, the Committee seeks to expand 
the number and effectiveness of those winning 
women into mathematics. WINNING WOMEN is 
written to inform, to empower, and to inspire. 


The Committee identifies fifty-five cultural cus- 
toms that discourage aspiring women mathema- 
ticians. They tell us how these customs can be 
changed and what can be done to recruit, retain, 
and acknowledge women in mathematics. A bib- 
liography of over 100 sources on the issues of 
women’s participation in mathematics is included, 
as well as descriptions of programs that have 
been successful in encouraging young women to 
study mathematics. The book is filled with inter- 
esting anecdotes, and contains over 50 photo- 
graphs of prominent women in mathematics. 


88 pp., 1991 , Paperbound 
ISBN 0-88385-453-8 


List: $11.00 MAA Member: $9.00 


Catalog Number: WIW 


CONTENTS 
A bibliography of over 100 sources on 
the issues of women’s participation in 
mathematics 
Fifty-five cultural patterns causing Ameri- 
can women to be underrepresented in 
mathematics 
What you personally can do 


Programs that succeed 


A history of women in mathematics- 
especially in the MAA 


A chronicle of the programs, articles, 
and suggestions of the Committee on 
the Participation of Women 

A minority woman’s viewpoint 


An overview of the statistics 


Photographs, anecdotes, cartoons 
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Jy) ORDER FROM: 


Ne 


7 Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, DC. 20036 


(FAX) (202) 265-2384 

Prepaid orders sent postage & 
handling free. Visa and Mastercard 
orders accepted. (Please give the card 
number and expiration date on credit 
card orders) We wiill bill for orders 
over $10.00. 


When was 
the last time 
a computer 
program helped 
you think about 
mathematics? 
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Lascaux Graphics (602) 544-4229 


Software and video tapes for the student, professional, and 
anyone who loves mathematics. Ask for our latest catalog. 


7601 N. Calle Sin Envidia, Suite 31 - Tucson, AZ 85718 USA 


Fig 2. fz) = exp(z) 


(800) 338-0993 


cuit THE MATHEMATICAL SYMBOL FOR QUALITY 


New from McGraw-Hill for 1992 
| CALCULUS AND ANALYTIC CALCULUS FOR BUSINESS, 


GEOMETRY ECONOMICS, AND THE 

Fifth Edition SOCIAL AND LIFE SCIENCES 

Stein/Barcellos Fifth Edition 
Hoffmann/Bradley 

DISCOVERING CALCULUS WITH 

THE HP 28 AND HP 48 ENCOUNTERS WITH CHAOS 

Smith/Minton, Softcover Gulick 


INTRODUCTION TO MATHCAD™ NUMERICAL METHODS AND ANALYSIS 


FOR SCIENTISTS AND ENGINEERS = BUChanan/Turner 
(also available with the Student STATISTICAL METHODS 


| Edition of MathCAD 2.5) IN THE BIOLOGICAL AND 
Wieder, Softcover HEALTH SCIENCES 
CALCULUS GEMS Second Edition 
Simmons, Softcover Milton 
FOUNDATIONS OF ABSTRACT FUNDAMENTALS OF 

| MATHEMATICS COMPUTING: VOLUME I 
Kurtz LOGIC, PROGRAMS, AND 

COMPUTING 


*MathCAD™ is a trademark of MathSoft, Inc. Tucker/Bradley/Cupper/Garnick 


Also Available . — 


APPLICATIONS OF DISCRETE 
MATHEMATICS 
Michaels/Rosen 


DISCRETE MATHEMATICS AND 
Its APPLICATIONS, 2/E 
Rosen 

DIFFERENTIAL EQUATIONS 
WITH APPLICATIONS AND 
HisTORICAL Notes, 2/e 
Simmons 


PARTIAL DIFFERENTIAL EQUATIONS 
AND BOUNDARY VALUE 
PROBLEMS WITH APPLICATIONS 
Pinsky 


For more information please contact | 


your McGraw-Hill representative or 
write to McGraw-Hill College Division, | 
Comp Processing and Control, P.O. Box | 
448, Hightstown, NJ 08520-0448. 


Solving The Problem Of How Students Solve Problems. 


Any professor will tell you 
that good teaching requires an 
understanding of their students’ 
thinking. At the University of 
California Los Angeles, how- 
ever, Dr. Ronald H. Stevens 
has found an especially 
innovative way to get inside the 
cranium of his second-year 
medical students. 

With his award-winning 
“IMMEX” software program, 
Dr. Stevens discovers not only 
if his students can solve Immu- 
nology problems, but also 
how information was gathered 
and processed during the 
solving. 

Here’s how it works. 
Programmed in Microsoft® 
Windows™ version 3.0, the 


easy-to-use “IMMEX” consists 
of multiple cases of immune 
defects and a set of results from 
45 laboratory tests (see figure 
below). Through a series of 
exercises and exams, students 
are asked to diagnose these 
cases by selecting the appro- 
priate tests and examining 
their results. 

Upon completion, graphi- 
cal representations are gener- 
ated by computer to show 
which tests were chosen and 
importantly, demonstrate how 
students searched for the 
solution to each problem. 

It is then possible to 
visualize the students’ thought 
process in a way that 
standard, multiple choice 
testing doesn’t allow. For 
instance, Dr. Stevens 
can learn how organ- 
ized and focused their 
knowledge is, how 
well their organization 
relates to critical con- 
cepts in Immunology, 
where major miscon- 
ceptions exist and 
whether proper 
knowledge links 
are evident. 

In turn, these tn- 
sights gained through 
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medium are added fo cells dependent on IL-2 tor 
growth and DNA synthesis measured 3 days later 
THYMIDINE INCORPORATIO 


“IMMEX” can have a sig- 
nificant impact on teaching 
methods. As Dr. Stevens 
explains, “This approach can 
lead to rapid detection and 
remediation of individual 
students’ problem solving diffi- 
culties, and can greatly per- 
sonalize the education process: 
What was Dr. Stevens’ 
approach in developing his 
award-winning software? He 
chose to program “IMMEX” on 
Zenith Data Systems laptop 
PCs. They provided him with 
all the Random Access Memory 
and portability required to work 
after hours and over weekends. 
And that made solving the 
problem of how students solve 
problems, less of a problem. 
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“IMMEX” Software Program 


ABOUT THE MASTERS OF INNOVATION COMPETITION. 


As acorporation committed to education, Zenith Data Systems encourages students and educators —like Dr. Ronald H. 
Stevens —to creatively explore the potential of computers within their fields of study. Towards that end, Zenith Data Systems has 


sponsored the MASTERS OF INNOVATION Competition for the past three years. 
To obtain an unabridged copy of his discussion paper on “IMMEXY or an application to enter the MASTERS OF 

INNOVATION IV Competition, please write to us at: Masters Of Innovation Program, Zenith Data Systems Corporation, 

P.O. Box 14513, Chicago, IL 60614-9998. 


Microsoft and Windows are trademarks of Microsoft Corporation. Copyright © 1991 Dr. Ronald H. Stevens. 
Copyright ©1991 Zenith Data Systems Corporation. 


Introducing E.Z. Math, E.Z. Algebra and E.Z. Arithmetic for the HP 48SX 


E.Z. Math, E.Z. Algebra and E.Z. Arithmetic are programs for the Hewlett Packard 48SX calculator conceived, written 
and programmed by Raymond La Barbera and the E.%. Software Company. Each program comes on a 128K plug-in ROM 
card accompanied by an easy-to-understand, well-written, detailed manual loaded with lots of specific examples and _is 
designed for use by students, teachers. parents and business people. Each program features an easy-to-use, logically 
organized, user-friendly interface which enables those who consider themselves to be calculator and computer illiterates, 
as well as those who don’t like to read manuals, to have full access to all program features quickly and easily. Since the 
HP 48SX is essentially an impressive, s arp-looking, 8 ounce pocket computer, students are easily motivated to take it 
along with them to study, practice, drill and master math in a study hall, on a train or bus, in a car, on line, on vacation, 
on a break—in short, for self-study at any time and in any place. 


What Can Be Done With E.Z. Math 


E.Z. Math effectively solves problems involving graphs, numbers, loans and savings. With E.Z. Math, anyone can: 
¢ Master the entire high school and college graphing curriculum, from algebra to calculus, with 188 families of equations, inequal- 
ities, functions, and systems, all arranged in an easy-to-use, user-friendly system of menus to make graphic analysis a snap! 
e Get extensive help with calculations involving fractions, whole numbers, complex numbers and number sequences. 
¢ Easily do savings and loan calculations and generate complete amortization tables. 
¢ Learn many basic concepts including those involving sets, variables, graphing, solving, numbers, loans and savings. 


What Can Be Done With E.Z. Algebra 


B.Z. Algebra is a comprehensive ninth grade high school basic algebra course as well as a high school and college remedial 
algebra course that builds a solid algebra foundation. With E.Z. Algebra, anyone can: 

¢ Learn about sets, operations, variables, relations and other concepts essential to a real understanding of algebra. 

¢ Understand the sets of natural numbers, whole numbers, integers, rational numbers and real numbers. 

« Master the meaning and properties of the operations of addition, subtraction, multiplication, division, power and root. 

¢ Do all kinds of problems involving algebraic expressions, numerical phrases, equations and inequalities. 


What Can Be Done With E.Z. Arithmetic 


E.Z. Arithmetic is a comprehensive elementary school basic arithmetic course as well as a high school and college 
remedial arithmetic course that makes solving most arithmetic problems a snap! With E.Z. Arithmetic, anyone can: 
¢ {Learn how to add, subtract, multiply, divide and order whole numbers, fractions, decimals, percents and integers. 
* Master the meaning, terminology and conversion methods for whole numbers, fractions, decimals and percents. 
¢ Drill and be graded on endlessly varied, randomly selected sets of problems involving whole numbers, fractions, decimals and 
integers, with the difficulty level, number of problems, operation and type of number user selectable. 


How To Order Copies or Get Further Information 
Each E.Z. Software program costs $130.00 ($125.00 retail, plus $5.00 shipping and handling). Take a 10% discount when 
ordering ten or more units. We accept payment by check, money order, COD, VISA, MC, AE and purchase order. If 
within 30 days you find that any E.Z. Software program fails to meet your expectations, we'll gladly take back your copy 
for a prompt, courteous refund. To order copies, either individually or bundled with HP 48SX calculators, please contact: 
SMI Corporation, 250 West New Street, Dept MM2, Kingsport, Tennessee 37660 
(800) 234-0123 or (615) 378-4821 or (615) 245-8982 (Fax). 


Cambridge University Press 


Now in paperback... 


A Course in Mathematics 
for Students of Physics 
Volumes 1 and 2 

Paul Bamberg and Shlomo 


Sternberg 


Voll: 424pp. 40649-8 Paper $27.95 
Vol2: 461pp. 40650-1 Paper $27.95 


Logic and Information 


K. Devlin 
1991 336pp. 41030-4 Hardcover about $34.50 


Exact Constants in 
Approximation Theory 


N. Korneichuk 


K. Ivanov, Translator 
Encyclopedia of Mathematics and its Applications 38 
1991 464pp. 38234-3 Hardcover $89.50 


Statistical Inference for 


Spatial Processes 


B. D. Ripley 
160 pp. 42420-8 Paper $19.95 


Twistor Geometry and 
Field Theory 


R. S. Ward and R. D. Wells, Jr. 
Cambridge Monographs on Mathematical Physics 


532 pp. 42268-X Paper $34.50 


Representations of Finite 
Groups of Lie Type 


Francois Digne and Jean Michel 

London Mathematical Society Student Texts 21 

1991 166pp. 40117-8 Hardcover $44.50 
40648-X Paper $17.95 


Available in bookstores or write: 


CAMBRIDGE UNIVERSITY PRESS 


40 West 20th Street, New York, NY 10011-4211. Call toll-free 800-872-7423. 
MasterCard/VISA accepted. Prices subject to change. 


Recursive & iterative programming ¢ Over 70 new math functions «e New manual, 500 examples & exercises 


The Joy of BERTWEZAING. 


DERIVE,— A Mathematical 
Assistant program. 

PC Magazine says it’s 

“a joy to use’ and proclaims it 
‘Editors’ Choice.” PC Week calls 
it “fast and capable.” The 
DERIVE® program Is de- 
livered with built-in 
standard equipment 

that delights 


MAGAZINE 


EDITORS’ 
CHOICE 


May 29, 1990 
Denve, Version 16 


both 
math lovers and 
math phobics. 


High performance. 


DERIVE does numeric and symbolic 
equation solving, exact and approximate 
arithmetic to thousands of digits, calculus, 
trigonometry and matrices. It displays or- 
mulas in comprehensible 2D format using 
raised exponents and built-up fractions. It 
plots beautiful 2D curves and 3D surfaces on 
monochrome or color monitors. 


Compact and sporty. 


PC Magazine calls ita “small wonder.” All it 
takes is 512K of memory and one floppy disk 
drive. DERIVE takes to the road on PC com- 
patibles, and really gets around on laptop 
and even handheld computers! 


DERIVE is a Hanacrafted 
registered trademark of Software 
Soft Warehouse, Inc. for the Mind 


Human engineering. 


You don't have to be fluent in computerese to 
use DERIVE. |n fact, it’s the friendliest and 
easiest to use of any symbolic math package 
on the road today. 
lts menu-driven 
interface 


and 
on-line help 
make it easy-- 
you Il soon be up to 
speed doing math, instead of 
trying to learn how to drive the 
software. 


Freedom to maneuver. 


DERIVE’s automated expertise releases you 
from the drudgery of hand calculations. 
You can do problems you'd never attempt 
otherwise and obtain exact symbolic solu- 
tions, in addition to approximate numerical 
solutions. 


No sticker shock. 


DERIVE’s suggested retail price is $250. And 
of course DERIVE doesn’t require an expen- 
sive computer, a math Co-processor, or even 
a hard disk drive. 


Order DERIVE through your favorite software 
dealer or mail-order house. For a list of 
dealers, write Soft Warehouse, Inc. at 3615 
Harding Avenue, Suite 505, Honolulu, Hl 
96816. Or call (808) 734-5801 after 11 a.m. 
Pacific Standard Time. 


And happy DERIVEing! 


Soft Warehouse: 


mamnnee 11 O NO LULU: HAWAII 


Foundations of Computing Series, 
Research Reports and Notes 


Basic Category Theory for 
Computer Scientists 


Benjamin C. Pierce 
128 pp ~=—s $17 95 softcover 


Categories, Types, and 
Structures 
An Introduction to Category 


Recommended for text use 
Linear Network 
Optimization 
Algorithms and Codes 
Dimitri P. Bertsekas 


This book explains important 
recent algorithms such as 
auction and relaxation, pro- 
posed by the author and others 
for the solution of these prob- 


Theory for the Working Computer lems. 


Scientist 275 pp $39.95 


Andrea Asperti and Giuseppe Longo 
325 pp $32.50 


The Art of the Metaobject 


. . Protocol 
ic Programming and | _ 
Logic Progra S Gregor Kiczales, Jim des Rivieres, and 


Nonmonotonic Reasoning Daniel Bobrow 

Proceedings of the First 335 pp $45.00 cloth, $24 95 softcover 
International Workshop 

June 22-24, 1991 Washington, D.C. 
edited by Anil Nerode, Wiktor Marek, and 
V.S. Subrahmanian 

432 pp $32 50 softcover 


To order call toll-free 1-800-356-0343 Fax orders (617) 625-6660 
MasterCard and VISA accepted 


The MIT Press 
5D Hayward Street, Cambridge, MA 02142 
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TO HELP TEACH 
MATHEMATICS EFFECTIVE 


Request a free copy of the NCTM catalog today! 
It lists over 180 titles with annotations. 


NCTM educational materials have been written by 
mathematics teachers for mathematics teachers. Each 
publication focuses on how mathematics can be taught more 
effectively. Materials are available for all grade levels and 
are helpful to practicing teachers and future teachers. 


Refer these books to your students. They will find many 
classroom-tested ideas and helpful ways to improve the 
teaching and learning of mathematics. 


If you would like multiple copies of the catalog, let us know. 
National Council of Teachers of Mathematics 

1906 Association Drive, Reston, VA 22091, Attn: Dept. P-MAA 
Tel.(703) 620-9840/ext. 128; fax (703) 476-2970 
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VisSim is the powerful mathematical 
modeling software tool for the 1990s. 


VisSim introduces an unrivaled visual 
programming technique for high perfor- 
mance scientific analysis. 


VisSim numerically solves non-linear 
differential equations for a multitude of 
engineering and scientific disciplines. 


Visual Solutions, Inc. 

487 Groton Road 

Westford, MA 01886 

To order, call (508) 692-5499 
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VisSim sells for thousands less than its 
nearest competitor and yet has 65 block 
functions and a choice of 6 integration 
methods. 


VisSim/RT add-on does data acquisition 
and control in real time. 


VisSim plot options include: XY, Time 
History, Log, Log-Log, Autoscaling, Color Coding, 
Over Plotting, Grid Lines, Labeling, Geometric 
Markers. 


VisSim is the mathematical modeling 
software available under Windows 3.0 
for only $895. Demos $25. 


Visual 
Solutions 


VisSim is a trademark of Visual Solutions Inc. 
Windows is a registered trademark of Microsoft Corporation 


Ross Honsberger is the author of seven 
books in the Dolciani Mathematical Expo- 
sition series, each of which presents prob- 
lems from algebra, arithmetic, number the- 
ory, probability, and geometry, and provides 
ingenious solutions and/or intriguing results. 
His most recent addition to the series is 
MORE MATHEMATICAL MORSELS which 
is a continuation of the earlier Mathematical 
Morsels volume published in 1979. Now is 
your opportunity to own both of these ex- 
cellent books. The problems presented are 
meant to be enjoyed, rather than instruct, al- 
though instruction is almost always the au- 
tomatic by-product. 


Honsberger says in the Preface of his first 
volume of morsels, “Mathematics abounds 
in bright ideas. No matter how long and 
hard one pursues her, mathematics never 
seems to run out of exciting surprises. And 
by no means are these gems to be found 
only in difficult work at an advanced level. 
All kinds of simple notions are full of ingenu- 
ity. The present volume discusses scores of 
elementary problems and contains dozens 
of marvellous ideas." 


All of the problems are accessible to any- 
one with a knowledge of freshman mathe- 
matics. Buy both of these excellent books 
today, and share them with your students. 


This is an excellent text...lt exposes 
gifted undergraduate students to a set 
of elementary problems...whose proofs 
are ingenious, usually containing beau- 
tiful ideas...Its lucid expository style 
and stimulating mathematical content 
make its reading a rewarding experi- 
ence for mathematicians at all levels. 
(about Mathematical Morsels) 


M.S. Cheema in Mathematical Reviews 
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..@ven the more advanced theoretician 
will find many of the problems fascinat- 


ing... 
D.T. Swift-Hook in Physics Education 
(about Mathematical Morsels) 


MORE MATHEMATICAL MORSELS 


Ross Honsberger 


315 pp., Paperbound, 1990, 
ISBN 0-88385-313-2 


List: $16.00 MAA Member: $13.50 
Catalog Number DOL-10 


MATHEMATICAL MORSELS 


Ross Honsberger 


249 pp., Hardbound, 1979 
ISBN 0-88385-303-5 


List: $ 28.00 MAA Member: $ 21.00 
Catalog Number DOL-03 


Both volumes purchased now 
List: $40.00 MAA Member: $29.00 
Catalog Number DOL-310 


ORDER FROM 


iD Mathematical Association of America 


1529 Eighteenth Street, N.W. 
Washington, D. C. 20036 
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Algebraic Characterization of Compact Abelian Groups, 
M. V. Deshpande and S. R. Ghorpade, 235 


An Analytic Technique to prove Borel’s Strong Law of 
Large Numbers, Liu Wen, 146 


An Analytical Description of Some Simple Cases of 
Chaotic Behaviour, James V. Whittaker, 489 


Another Elementary Approach to the Jordan Form, J. I. 
Hall, 336 


Anthropomorphic Polygons, Godfried Toussaint, 31 


Arcs with Positive Measure and a Space-Filling Curve, 
Timothy Lance and E. Thomas, 124 


Borromean Circles are Impossible, Bernt Lindstr6ém and 
H.-O. Zetterstrom, 340 

Box-Spline Tilings, Carl de Boor and K. Hollig, 793 

Branching Processes That Grow Faster Than Binary 
Splitting, KF M. Dekking, 728 

Carlyle Circles and the Lemoine Simplicity of Polygon 
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The Cauchy Problem for the Wave Equation with Dis- 
tribution Data: an Elementary Approach, Calvin H. 
Wilcox, 401 
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Chalice, 255 
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Derangements, Permanents, and Christmas Presents, 
Stephen G. Penrice, 617 


The Derivative 4 la Carathéodory, Stephen Kuhn, 40 


Differentiation of Approximately Specified Functions, C. 
W. Groetsch, 847 


Disks and Shells Revisited, Walter Carlip, 154 


Distance Functions and ‘Topologies, Fred Galvin and S. 
D. Shore, 620 


A Dynamic Puzzle, Moshe Rosenfeld, 22 


An Early Introduction to Dynamics, Marysia T: Weiss, 
635 


A Very Elementary proof of the Malgrange—Ehrenpreis 
Theorem, Jean-Pierre Rosay, 518 


On the Equality L1() = Lj,.(), Alfonso Villani, 826 


The Exploding Exponential and Other Chaotic Bursts in 
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Durkin, 217 
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Hans Liebeck and A. Osborne, 131 
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Hypocycloids, Continued Fractions, and Distribution 
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From Wythoff’s Nim to Chebyshev’s Inequality 


KENNETH B. STOLARSKY,* University of Illinois, Urbana 


KENNETH B. STOLARSKY received his undergraduate education at Caltech, 
and obtained a Ph.D. from the University of Wisconsin (Madison) in 1968 
under the supervision of Marvin I. Knopp. He did one year of postdoctoral 
work at the Institute for Advanced Study (Princeton), and has been (aside 
from sabbaticals) at the University of Illinois (Champaign-Urbana) since 
then, where he is now professor of mathematics. In 1979, 1980 and 1981 he 
was a member of the national Putnam Contest Committee; for the past five 
years he has served as an Associate Editor of this Monthly. His research 
interests include number theory, combinatorics, extremal problems of ge- 
ometry, and complex analysis (with emphasis on polynomials). 


1. Introduction. This is the story of how the study of a combinatorial game, 
Wythoff’s Nim, led to certain inequalities for real integrals. Since J. H. Conway [C; 
see also B-C-G] has shown that the construction of the real numbers (and rather 
more) can be based on a collection of combinatorial games, this is not totally 
implausible. However, our point of view is of a different nature than Conway’s. We 
consider only Wythoff’s Nim, and from a purely algebraic point of view. This leads 
to questions about the associativity of certain multiplications. A probabilistic 
investigation then suggests inequalities involving certain multiple integrals. We 
first prove these in a spirit consonant with their original context, but then discover 
that Chebyshev’s (Tchebychef’s) inequality for similarly ordered sequences can be 
taken as the underlying principle. Our journey from associative multiplications for 
the integers to inequalities for integrals ends in a curious way. The inequalities 
themselves suggest an uncountable collection of associative multiplications for the 
positive real numbers (§9). 

Wythoff’s Nim (or simply Wythoff’s game) is described in many references, for 
example [Ba-C, B, B-C-G, Co, Cox, F, F-B, G, Wy]; see also many of the references 
cited in [S1]. The initial positions of the game are pairs of nonnegative integers 
(P,Q). Play alternates between two players. At each turn the player’s move 
consists of replacing (P, Q) by one of 


(P—M,Q),(P,Q-M), or (P—M,Q-M) 


where (i) M > 1 is an integer and (ii) the new pair is also a pair of nonnegative 
integers. The first player unable to move loses. Wythoff proved [Wy] that if neither 
player blunders, the player about to move will win if and only if he can move to a 
pair (P’, Q’) such that either it or its transpose (Q’, P’) has the form 


(a(n), b(n)) = ([nd], [n¢?]). 


Here [u] denotes the greatest integer in u and # = 1.618... is the “golden 
mean,” i.e. the largest root of x” — x — 1 = 0. Such pairs are called (Wythoff) 
winning pairs. The first six are (0, 0), (1,2), (3, 5), (4, 7), (6, 10), and (8, 13). 


*Supported by ONR Grant N00014-85-K-0368 
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2. Certain inequalities for integrals. Our principal inequality can be stated very 
simply. 


THEOREM. If 0 < g(u) < 1 and g(u) is nonincreasing on [0,1], then for positive 
n and m we have - 


figure”) du > [is(u”) du ['g(ul””) du. (2.1) 
0 0 0 


However it seems easier to work with (and “get a feeling for’) a rather less concise 
formulation of this Theorem. If g(u) is integrable on [0,1] and n > 0, we define 
the quotient O = Q(g;n) by 
'g(u)u"! du 
Q = Q(g;n) = ——_——_ (2.2) 
i u"~ "du 
0 


whenever the ratio on the right exists. This is a weighted average of g. 


THeEoremM. If 0 < g(u) < 1 and g(u) is nonincreasing on [0,1], then for positive 
n and m we have 


O(g;n+m) = Q(g;n)Q(g;m). (2.3) 


Equality holds for commensurable m and n if and only if there is a real c with 
0O<c <1 such that 


1 O<u<c 
(uw) = {4 c<us<l. 24) 

The original formulation is easily recovered from (2.3) by three simple changes 
of variable. It seems natural to conjecture that the above equality criterion holds 
for all positive real m and n. The author is embarrassed by his inability to prove 
this; see, however, the Proposition in §7. 

For g nondecreasing a similar Theorem is valid with u”~' replaced by (1 — u)”7! 
and the functional values 0 and 1 interchanged in the criterion for equality. 

Our main point is not some possible novelty in (2.1) but rather the manner in 
which it arose (see §3), was proved (see §4 and §5), and served to suggest other 
inequalities (see §6 and §8). We shall give a fairly straightforward proof of it based 
on Chebyshev’s inequality in §7. 


3. The semigroup of Wythoff’s game. It was recently discovered [P-S2] that the 
winning positions of Wythoff’s game form a multiplicative semigroup. Associate to 
each winning pair (a(n), b(n)) the real number 


W(n) = b(n) — da(n). (3.1) 
This is a one-to-one map of the pairs into [0,1]. These numbers are closed under 
multiplication! In fact, 


W(n) = {nd}/¢, and W(n)W(m) = W(n*m) (3.2) 
where {x} denotes the fractional part of x and 
nem=nm +a(n)a(m). (3.3) 


The first statement in @.2) is easily verified. For the proof of the second assertion 
we use the following Lemma (the “K-L-M formula’’). 
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LEMMA. For any integers K, L, M and n we have 


a( Ka(n) + Ln + M) = Kb(n) + La(n) + |M¢d+ (Ld ~ Ke . (3.4) 


Proof. Let m be a variable integer. Then (here and elsewhere the notation 
z: = means that the new symbol nearest the : is being defined) 


A:= (Ka(n) + Ln + M)o — (Kb(n) + La(n) + m) 
= L(n@ — a(n)) — K(b(n) — a(n) d) + M6 — m 
{nd} 


= L{nd} A +Mod—-—m 


{nd} 


- (Lo — K) +Mod| -—-m=:R-mMm. 


Now choose m = [R]. Then 0 < A < 1 and the Lemma follows. 
Now, from the K-L-M formula and the fact that b(m) = a(m) + m, we have 


{nd} 


a(a(m)a(n) + mn) = a(m)b(n) + ma(n) + |(md — a(m)) — 


= (b(m) — m)b(n) + m(b(n) — 1) 
= b(m)b(n) — mn, 


SO 
b(a(m)a(n) + mn) = b(m)b(n) — mn + mn + a(m)a(n) 
and 
—a(a(m)a(n) + mn) = mn — b(m)Dd(n) 
= —na(m) — ma(n) — a(n)a(m) 
= a(n)a(m) — a(n)b(m) — a(m)b(n). 
Thus 


W(n)W(m) = b(m)b(n) + a(m)a(n) + 6(a(m)a(n) 


—a(n)b(m) — a(m)b(n)) 
= b(n*m) — da(n*m) 


and multiplicative closure is established. 
Since ordinary multiplication is associative, we have 


(W(n)W(m))W(k) = W(n)(W(m)W(k)) (3.5) 


and hence the *-multiplication is associative. Do some similar multiplication rules 
also have the associativity property? Indeed, a considerable number of such rules 
have now been found [Ar, K, P-S2, P-S3]. One of the most general ways (so far) of 
finding such rules has been to study [P-S2] the set S of all integral affine 
transformations that map the set of all Wythoff pairs (a(n), b(n)) back into itself. 
This set S is a two-parameter noncommutative semigroup isomorphic to the set of 
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all pairs of integers (/j, /) with the associative (yes!) multiplication rule 
(j,1) @ (km) = (ty(k), ty(m)) (3.6) 
where 
a(n) = (i — a(n) + (c(i) - e(1))n +i (3.7) 


and c(j) = a(j) —/ for all j. Quite a few of the associative rules so far discovered 
correspond to subsemigroups of S. A. Fraenkel, H. Porta, and the author (unpub- 
lished work) have also found a rich variety of associative laws connected with 
quadratic irrationalities other than the golden mean @. 

One seemingly slight variation of n * m not investigated in these references is 


n#m = 4nm + a(2n)a(2m). (3.8) 
It is tempting to say this is associative since we are simply using 27 as a “code” for 
n. A closer look, however, reveals no strong reason for the associative law to hold. 


Several initial attempts to produce a counterexample by hand (with n and m 
chosen at random) failed, and my first counterexample, 


(8#42) #110 # 8#(42#110), (3.9) 


was obtained by choosing the numbers ‘“‘maliciously”. The difficulty of finding a 
counterexample calls for an explanation. It turns out that if (k, m,n) is a lattice 
point chosen “at random” from the positive octant, then the probability of 


(k#m)#n = k#(m#n) (3.10) 
exceeds 96%! (It is clear what “at random” means if (k, m,n) is chosen from 
{(k,,k,k3):0 <k; < N,1 <i < 3,and Kk, is integral}. (3.11) 


We define the ‘at random” probability to be the limit as N > ~.) 
The key to computing the probability is the following identity (a consequence of 
the K-L-M formula): 


k#(m#n) = 16kmn + 4(ka(2m)a(2n) + ma(2n)a(2k) + na(2k)a(2m)) 
+ 2a(2k)a(2m)a(2n) + a(2k)(267-'{2mg}{2nd}], (3.12) 


where the [ ] denotes “greatest integer in”. We also need the very well-known 
result [Cas, K-N] that if a is irrational then the sequence {na}, n = 1,2,3,..., is 
uniformly distributed in [0, 1]. 


4. A probability integral. By the last identity of §3, the associativity statement 
k#(m#n) = (k#m)#n (4.1) 


is equivalent to 
2 2 
a(2k)) = (2m6)(2n8)] - a(2m)| = (2k6)(2n6) | (4.2) 


If m = k (an event of probability zero) this is clearly true. Note that the greatest 
integer terms can only be 0 or 1. Set x = {2k}, y = {2m}, and z = {2nd}. It is 
easy to see that equality holds for m # k if and only if 


xz<o/2 and yz<@¢/2. (4.3) 
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The ‘individual’ probabilities are easy to calculate, but these events are not 
independent. In fact, the occurrence of one of them clearly makes the other more 
probable. This observation is enough to establish that the probability exceeds 
961333... since 


1, b _ fe/2 1 
fimin(1, =~) ax = f det foot 
6 6 
=—~—-—-In—= 
2 2 2 


= .9804759... = v.961333... 


To find the exact probability we shall need to calculate a multiple integral. 


LemMa. For 0 < b < 1 we have 


1 b 
[T= 1,—, > dx ax, 
=f fin eee ca 
nb — b” 
=\n-1’ "** (4.4) 


b—binb, n=1. 


Proof. This is easy for n = 1. The idea for n = 2 is to partition the unit cube of 
E” into the n! simplices given by the n! different possible orderings of the 
variables x,,...,x,- On each of these the integral of the integrand will have the 
same value, namely 


ine Le [- *min( 1, a dk, 


tm int, | a 
=| tm opi min x, n 
—1 n-2 n 
b x" 1 x 1 nb—b 
= | — —dx+b] —- = — —— , 4.5 
| Gopi ® | mo! ni n-1 (4.5) 


and the result follows. 
We can now determine the probability of equality in (4.1). Since 2¢ is irrational, 
the numbers x, y and z are random in [0, 1]. Thus 


Prob[0 <x <1,0<y<1,0<z<1,z<@¢/2x,andz<¢/2y| 
_ Lf [rr de decdy 
0/0 “0 
p p 
=], = —|2 — —] = .963525.... 4.6 
»- $[2- 5] (4.6) 
It’s rather close to the rough estimate! At this point the study of the integrals can 


be detached from the question that called them into being. If s is real the same 
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method of proof yields 


1+sb"(n—s)™* n#s,5 <0, 
nb* — sb” 
~ \ ——— n#S8,s > 0, (4.7) 
n—-Ss 
b* — b* In b* n=S. 
That (for s a positive integer) the integral is symmetrical in n and s is far from 
obvious (at least to the author). 
5. Inequalities for integrals. The italicized observation of §4 says that 
P(A MB) = P(A)P(BI|A) = P(A)P(B) (5.1) 


for certain special events A and B. We now drop the language of probability, but 
the above is the real motivation for what follows. 
We can now prove the Theorem. Assume that n and m are positive integers. 
Let N=n+™m and let ¢,,...,¢, be numbers in [0, 1]. Clearly 
Min(t,,-.-slmstmeis +rotman) = Min(t,,...,¢,,) ° MiN(t,415-+->tman)- (5-2) 


Now let g be a nonincreasing function that maps [0, 1] into itself. (For example, if 


s > 0 then 
. b ° 
g=g,(x) = in 1. | | 
is such a function). Then 


foo [i min(e(21),.---5 8(2w)) dey + dxy 


= NIP [Yo [eC ey) ae + dey 


N! 
wail, 


O(g;N) =Q(g3n+m). (5.3) 
The inequality now follows: simply minorize the integrand of (5.3) by the product 
on the right of (5.2). For the case of equality, first assume that g is neither 0 nor 1 
on some open interval J. Then (5.2) holds with strict inequality for (t,,...,t,) in 
Ix ++: XI (WN times), so there is strict inequality in (2.3). Thus, since g is 
nonincreasing, equality implies that there is some c in [0,1] such that (except 
perhaps at c itself) g is 1 to the left of c and 0 to the right. Thus we have (2.4) and 
equality since 


1 
g(xX\V)xn | dxy = N f'g(x)x¥} dx 
0 


cutm =c"c™. 


This proves the theorem for integral values of m and n. Now let D be positive and 
let ¢ be a real variable varying between 0 and 1. The change of variable u = t'/? 
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yields 
nf'g(uyut!du = — ['g(t'/?)e-1"/? dt. 
0 D “0 


As g(u) varies over all nonincreasing functions on [0,1], so does g(t'””). Hence 
the inequality is true for n and m commensurable, and by taking limits it is easy to 
see that it is true for all positive real numbers n and m. The inverse change of 
variable t = u? enables us to show that the criterion for equality in the case where 
n and m are commensurable is the same as that in the integral case. This proves 
the Theorem. 


6. Fundamental inequalities. Depending upon the ranges of x and gq there are 
various “fundamental” inequalities between x and its “q-analogue’’, namely 

q*—1 

a_i (6.1) 
The q-analogue is so called because its limit as g — 1 is x. It is stressed in [B-B, 
H-L-P] that these fundamental inequalities can be used to derive the most 
important inequalities of classical analysis, those of Hdlder and Minkowski, and 
the inequality between the arithmetic and geometric means. (These g-analogue 
inequalities can also be interpreted from a convection-diffusion point of view [S2].) 
Observe that if 0 < g < 1 (resp. 1 < q) the inequality between the arithmetic and 
geometric means of g and 1, with weights x and 1 — x, 0 <x < 1, namely 


x4 


q** <xq+(1-x)l1, (6.2) 

iS 
x <x, (resp. x, <x). (6.3) 
The obvious fact that the multiple integral of (4.7) is at most 1 yields for0 <n <s 


(resp. 0 < s <n) the inequality 


n b” — I 


—< 
K) bi —1 


(resp. >). (6.4) 


Upon making the substitutions b > q'/* and n/s > x we obtain, with s a positive 
integer, the inequality (6.3) for rational x. The inequality for x real is a limiting 
case. Note that among these results are inequalities valid for x > 1. We omit a 
precise analysis of the cases of equality. 
Now assume 0 < b < 1 and apply the Theorem (see (4.7)) to the function 
nb* — sb” 


f(b, 83) = ~——— (6.5) 


Ss 
We obtain the 


CorROLLARY. For 0 <b < 1, s => 0, and positive integers n and m, we have 
f(b,s;n+m) = f(b,s;n)f(b,5;m). (6.6) 


It seems strange that such a simple function could have a “previously undiscov- 
ered” property, but the above is new, at least to the author. A proof without 
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integrals is called for. Some manipulation shows it is equivalent to 
F(b) =(m—s)(n—s)(n +m) — nmsb"*"~5 — nm(n + m — s)b* 
+ ns(n +m—s)b™ + ms(n +m —s)b"=0. (6.7) 
For b = 0 this is true if also n + m > 0, while for b = 1 we have equality. Hence 


it suffices to show F(bD) is decreasing. This is true if also nm(n + m — s) > 0 since 
then 


F'(b) = -—nm(n+m-—s)bs"'(1— b")(1 — b™ 7S) < 0. (6.8) 


In particular, the Corollary remains true if n and m are both positive real numbers 
that exceed s. 

The Corollary suggests some second order improvements (similar in spirit to 
those given in [S2]) in the inequalities between x and x, for certain ranges of the 
parameters, but we shall not pursue this here. 


7. Chebyshev’s inequality. Consider the step function 
g(u) = g;, C;SU< C44 (7.1) 


where 0 <1 <N, Cy = 0, Cy41 = 1, 6; < C41, Qo = 1, By = 0, and g,,, <28;. A 
simple calculation shows that 


1 _ 
pf g(u)u * du (89 — 8)ef + °° +(8y_-1 — Bn) CN 


N 
= » X;cP (7.2) 
i=1 


where A, > 0 and LA; = go — 8y = 1. Also, for ¢ > 0 and any bounded decreasing 
function f defined on [0, 1] there is clearly such a step function g with 


[lf slau <¢, (7.3) 


so it suffices to prove our inequality for such g. Now for n,m > 0 we have by 
direct expansion 


oS 
IA 


Me 


AiAj(c7 — 7 (er — c7") 


N 
u 
j=li 


1 


N N N 
2 Apc? *™ —2Y Aj? Vi A,c™ (7.4) 
i=1 i=1 j=l 


and hence by (7.2) the validity of our inequality (2.3) for step functions. Now (7.4) 
is almost Chebyshev’s inequality; if we replace the finite sequences {c”} and {c?"} 
by sequences {a,} and {b,} such that 


(a; — a;)(b; — b;) = 0 (7.5) 
for all i and j, and expand the corresponding double sum in (7.4), we obtain 
dA; Ab < Li Aja;b;. (7.6) 


This is Chebyshev’s inequality [H-L-P, pp. 43-44]. Clearly there is strict inequality 
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unless either all the a; are equal, or all the b, are equal. In our own special case 
this can only happen when N = 1. Thus we have the 


Proposition. If g is a step function, we have equality in (2.3) if and only if (2.4) 
holds. 


Remarks. Hardy-Littlewood-Polya omit any discussion of equality in their for- 
mulation of the integral analogue of Chebyshev’s inequality [H-L-P, Theorem 236, 
p. 168]. 

Given the statement of the Theorem, it seems natural to study the product of 
the “n-integral’ and the “m-integral” when T =n + m is held constant. To do 
this it is natural to introduce the function 


F(t) = [os(uyuT?' du f'g(u)ul/?~' du. (7.7) 


Our theorem (together with the Cauchy-Schwarz inequality) becomes 


1 2 1 1 ] 
[/ g(u)u™? du < F(t) < | 7 + 7 [ e(wul* du; 
0 0 
—~+t+1 x~-t+1 
2 2 
(7.8) 
for example (¢ = 0) 
['g(wut? du < ° [os(wut* du (7.9) 
0 — T+2Jo 


Note that the upper bound in (7.8) is a convex function of t. In fact (we omit the 
proof) it can be shown that F(t) itself is convex for |t| < T/2. 


8. A Gamma function inequality. It seems appropriate to indicate what our 
inequality says about Beta and Gamma functions, especially since the Beta 
function integral appears in the next section. For an exceptional account of the 
role played by the Beta function in the general theory of special functions, see 
[Ca]. 

Let g be a nondecreasing function, so 

F(n +m) > F(n)F(m) (8.1) 


where 
F(n) =nfig(x)(1 x)" de (8.2) 


(recall the second paragraph after (2.4)). 
If we take g(x) = x*~' where a > 1, then F(n)/n becomes B(a, n), the Beta 

function of a and n. Since 

P(x)T(y) 


B(x, y) = [T(x +y) 


(8.3) 


where I(x) is the gamma function, we have from (8.1) that 


(m+n)B(a,m +n) =nB(a,n)-mB(a,m). (8.4) 
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Then from xI(x) = ['(x + 1) we have that 
T(m+nt+a)l(a) (m+n 1)T(1) 
T(m+a)l(n +a) ~ T(m+10(n + 1) 
for a= 


> 1. This suggests that in fact p(a) is decreasing in a. Now Gauss’ ,F, 
summation states that 


p(a):= (8.5) 


F(a.bierl ' abil a(a+1)b(b+1) 1 

,b3c;1) =14+ —— + ——_ — 

F(a, B55) c 1! c(c + 1) 2! 

[(c)T(c —a —b 

xO) ) (8.5 
[(c — a)T(c — Bb) 

for a,b,c complex, c not a negative integer, and (in case neither a nor D is a 

nonpositive integer) 


Re(c — a — b) > 0; 
see, e.g. [Ca, W-W, pp. 281-282]. Thus 
[(m+n+a)I(a) 
oo =F (n,m rn t+ im +51) 
[(n + a)T(m +a) 


and the infinite series makes it clear that this is decreasing in a. 


9. Associative multiplications for real numbers. If we could ignore the greatest 
integer brackets implicit in the definition of n * m in (3.3), this product would be 
(hd + 2)nm. It is, of course, obvious that any multiplication rule of the form 


n*xm = Knm, (9.1) 


with K a real number, is associative. Now turn to the function g of our Theorem, 
and assume for simplicity that it is a continuous nonincreasing function such that 
2(0) = 1 and g(1) = 0. 


Definition. For positive n and m define 
nxem=n*,m=N (9.2) 


where 
[is(u/) du = fis”) du ['g(ul/") du. (9.3) 
0 0 0 


The left side of (9.3) is strictly decreasing in N, and tends to 0 as N > ~. By 
our Theorem, we also know that the left side of (9.3) exceeds the right side when 
N=n+m. Hence the function N = N(n, m) is well defined, and 


N>n+m. (9.4) 
The multiplication * is associative, since any multiplication rule of the form 
nm = f~'( f(n)f(m)) (9.5) 


is associative. To get a feeling for (9.2), let’s consider some particular examples. 
If g(x) = 1 — x° then (9.3) becomes 


= . (9.6) 
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and 


nm 
N=n+m+—. (9.7) 
C 


As c > © we recover the usual (associative) law of addition while for c — 0 the 
n +m becomes negligible and we have essentially (9.1). In fact the multiplication 
rule (9.7) is quite familiar to algebraists who work with rings [Me, p. 110]. Now 
examine 


g(x) =(1- x), k an integer. (9.8) 
By (8.2), (8.3), and a simple change of variable we have 
Lan 7 k! 
J, (4 yd = Tay (ne) (9.9) 


The corresponding multiplication rule may be described so that it resembles (9.6), 
to which it reduces when c = k = 1: the geometric mean of N+ 1,N42,..., 
N +k shall equal the geometric mean of (n + 1)(m + 1),(m + 2)(m + 2),..., 
(n+k)\(m +k). 

It is known that in some sense (9.5) describes all possible sensible associative 
multiplications of the real numbers. See [Ac] and the references therein for the 
details. We conclude by posing a presumably new open question in the realm of 
nonlinear integral equations. If a given function of two variables 


N=N(n,m)=n*m (9.10) 


yields a multiplication that is commutative, associative, and “smooth”, and N > 
n + m, when and how can (9.3) be solved for g? 
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1. Introduction. Digital topology is a term that has arisen for the study of 
geometric and topological properties of digital images. In our increasingly digital 
technology, such questions arise in a wide range of diverse applications, including 
computer graphics, computer tomography, pattern analysis and robotic design, to 
mention just a few of the areas of current interest. Despite the name, the 
traditional approach to this study has been graph-theoretic rather than topological. 
Although our approach does not abandon graph-theoretical tools, we seek to 
convince the reader that it is both possible and natural to proceed topologically. 
The two theories are coextensive for finite spaces (see §2), but the topological 
approach leads naturally to constructions which, although possible in graph theory, 
had not previously been considered (see §6 and 7). 

To develop a theory for studying images, digital or otherwise, it is natural to 
start with the Jordan curve theorem, which states that a simple closed curve 
separates the real plane R X R into exactly two connected components. We want 
an analog for the digital plane Z x Z. The first approach to this, by Azriel 
Rosenfeld in the late 1960’s (see [22, 23] and references there), used the graph-the- 
oretic notion of adjacency to define connectedness. Before giving the definitions 
necessary for a precise description of Rosenfeld’s theorem (for which, by the way, 
we give a topological proof in §8), let us outline the topological approach. 

In the topological approach to this (§3), one defines a connected topology on Z 
regarded as an ordered set. (The usual interval topology on Z is, of course, discrete 
and not connected.) This induces a product topology on Z X Z (and on Z X Z X Z, 
when we come to consider the 3 dimensional theorem). In the usual (real) theory 
one defines a path (resp. arc) as a continuous function (homeomorphism) on a 
closed real interval; analogously, define digital path and digital arc by replacing 
the real interval by an interval subset of Z (i.e., a set of the form {n,n + 1,..., 
n + k} with this new topology). Our Jordan curve theorem in Z X Z (proved in §5, 
after the necessary background has been developed) is then an exact analog of the 
usual theorem for R X R, where “connected” means the usual definition of 
connected with respect to the product topology. It is due to Khalimsky [4, 5]; the 
proof given here is from [7]. 

This approach also yields a Jordan surface theorem in Z° (described in §6). Our 
proof of the Jordan curve theorem is purely digital (i.e., self-contained), but the 
proof of the 3 dimensional theorem invokes the classical Jordan-Brouwer theorem 
in R°. 

These topologies are not homogeneous,’ but that turns out to be an advantage, 
as we will see in §7, where the following questions about boundaries in digital 
pictures are considered: 


1. How do more complicated curves separate the digital plane into connected 
components? 

2. Conversely, if we are given only a partition of the plane into connected sets, 
when are the boundary sets curves? And, if they are curves, how can we 
recover them? 

3. How can we represent this on a computer screen? 


‘A topology on X is called homogeneous if for each x, y © X there is a homeomorphism that maps 
xX to y. 
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We will wait until §7 to answer these questions, after the relevant definitions have 
been introduced. Let us just mention here that the answer to the last question also 
settles the non-homogeneity problem: we display a homogeneous subspace which 
—like visual reality—does not include the bounding curves between adjacent 
regions. 

Returning now to Rosenfeld’s graph-theoretic approach, we need some defini- 
tions in order to state his theorem. The 4-neighbors of (x, y) € Z X Z are the four 
points (x + 1, y) and (x, y + 1); the 8-neighbors are the eight points consisting of 
the 4-neighbors and the points (x + 1, y + 1). If k = 4 or 8, a k-path is a finite 
sequence Pp,...,p, in ZX Z such that p,_, is a k-neighbor of p,, 1 <i<n.A 
set S is k-connected if any two of its points can be joined by a k-path in S. A 
Jordan k-curve is a finite k-connected set which contains exactly two k-neighbors 
for each of its points. As usual, a component is a maximal connected set. 

The examples in FiGuRE 1 show that neither 4-connectedness nor 8-connected- 
ness give a Jordan curve theorem. The first is a Jordan 8-curve whose complement 
is 8-connected; the second is a Jordan 4-curve which is not a Jordan 8-curve and 
whose complement has three 4-connected components. However, by combining 
these two notions of connectedness, Rosenfeld proved (see [22] and references 
there): 


THEOREM 1.1 (Rosenfeld). A Jordan k-curve with at least five points separates 
Z xX Z into exactly two k'-components where {k, k'} = {4, 8}. 


Fic. 1. A Jordan 8-curve and 4-curve. 


Note that the definition of connectedness changes when the curve changes. 

Since finite topological spaces behave very differently from the usual ones, we 
begin (§2) by developing techniques for such spaces in some detail. We are 
primarily interested in finite T,-spaces, but we give some indication of where the 
ideas are valid more generally. 

This paper is more of an introduction than a survey (for surveys see Rosenfeld 
[22] and Kong and Rosenfeld [13]). In [15] Kovalevsky proposed an approach 
similar to ours, but without proofs; see also [16]. Since we are interested in the 
foundations of the subject and the relationship between the topological and the 
graph-theoretical approaches, §2 and 3 contain more detail than is needed for the 
sequel. 


2. Finite topological spaces. Recall that a topology on a set X is a collection of 
subsets of xX including X and @ and closed under finite intersections and 
arbitrary unions. A set is closed if its complement is open, so that closed sets are 
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preserved by finite unions and arbitrary intersections. For x € X let N(x) denote 
the intersection of all open neighborhoods of x, which of course is not a 
neighborhood in general. 

If X is finite then open sets and closed sets behave similarly with respect to 
intersection and union and each point has a smallest open neighborhood, namely 
N(x). We are primarily concerned with finite spaces but much of this section is 
valid more generally for a class of spaces introduced by Alexandroff [1], who called 
them ‘discrete’: an Alexandroff topology is one in which every intersection of open 
sets is open, or, equivalently, in which each N(x) is open. (It should be noted that 
some recent authors, e.g., Johnstone [2], include 7, in this definition.*) We also 
consider an intermediate class of spaces between finite and Alexandroff: call a 
space locally finite if each point is in a finite open set and in a finite closed set, but 
note that this is stronger than the usual definition. 

Thus if X is an Alexandroff space we have for x € X, A CX: 


x € cl A implies x € cl{ y} for some y € A. (1) 


It follows from this that an Alexandroff topology is completely determined if one 
knows each N(x) or, alternatively, each cl{x}. What is more interesting is that, by 
(1), such a topology is determined by knowing the subspace topology on each two 
point subset. (Actually, (1) is equivalent to the Alexandroff property: a topology 
satisfies (1) iff it is Alexandroff.) Note that for x, y in any topological space we 
have: 


x €cl{y} iff ye N(x) (2) 
but this is not enough to determine the topology in spaces where N(x) is not 
necessarily open. . 

So let us look (FiGurE 2) at the possible topologies on a two point set {x, y} (by 
drawing N(x) and N(y) in each case). These are the discrete, indiscrete, and two 
copies of a topology called the Sierpinski topology. (The diagram is drawn this way 


to be consistent with the specialization order to be introduced shortly.) Thus if X 
is any T)-space with x, y © X we have: 


{x, y} is connected iff the subspace topology on {x, y} is Sierpinski. (3) 


Bo ()f 


Fic. 2. Topologies on a two point space. 


*Recall that a topological space X is called a To-space (resp., T,-space) if, whenever x and y are 
distinct points in X, x € cl{y} or (resp. and) y € cl{x}. 
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There is a correspondence between topological spaces and directed graphs, 
which can be obtained from the specialization order (introduced by Alexandroff [1] 
and discussed, for example, in [2]): For any topological space, x < y iff x € cl{y} 
defines a reflexive transitive relation. This relation is a partial order (.e., anti-sym- 
metric) iff the space the space is 7, (and then it is called the specialization order). 
Thus in any topological space, cl{y} = {x: x < y} and, by (2), N(x) = {y: x < y}. 
Also, the closed points (resp. open points) are precisely the minimal (maximal) 
elements. If a T)-space is finite, or more generally locally finite, then each point 
lies below a maximal element. Stated topologically this says that in such spaces the 
set of open points is dense in the following strong sense; we call it pointwise dense: 


U {cl{x}: {x} is open} = X. 


It is useful to represent the specialization order on X by making X a digraph 
(= directed graph) by putting an edge directed, say, from x to y whenever x < y. 
In this way each T,-topology determines a unique transitive digraph. Conversely, 
each partial ordering determines a unique Alexandroff topology. However, more 
general topologies in which N(x) need not be open are not determined by their 
specialization order. For example, on an infinite set, the discrete topology, the 
cofinite topology, and any topology between these two, all have the same special- 
ization order, namely equality. 

For finite X, however, this gives a one to one correspondence between T>-topol- 
ogies on X and partial orderings on X. If we drop the 7, axiom, it gives a one to 
one correspondence between topologies on X and transitive digraphs on X (where 
here it is understood that both x — y and y — x are possible). 

We also use the connectedness graph of a topology, an undirected graph in 
which an edge joins x and y iff {x, y} is topologically connected. This can be 
obtained from the digraph above by just forgetting the direction of the edges. In 
this context define the adjacency set W(x) of a point x as {y #x:{x, y} is 
connected}. In any topological space (don’t need J, here): 


A(x) = (N(x) U cl{x}) — {x}. (4) 


These concepts are interesting only in non-7,-spaces because: X is T, iff W(x) = 
@ for each x iff N(x) = {x} for each x iff the specialization order relation is 
equality. 

We conclude this section with two results on continuity of functions. We omit 
the easy proofs; see [19]. 


Proposition 2.1. A function defined on a T,-Alexandroff space is continuous iff 
it preserves specialization order. 


Proposition 2.2. If f is defined on X X Y and Y is locally finite, then f is jointly 
continuous iff it is separately continuous. 


3. The digital line and digital arc connectedness. The key to the topological 
construction of the digital plane is getting the right topology on Z. This definition 
was published in Russia by E. Khalimsky in 1970 [5] but not widely circulated 
outside Russia until recently. Figure 3 shows the topology (by showing each 
N(x)); observe that the points alternate being open and closed. (A more formal 
description is given in [7, §10].) It can be defined on Z or on an interval subset of 
Z. We refer to this topology, and the associated product topologies on Z”, as 
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Fic. 3. A portion of a digital line, showing the minimal! neighborhoods of each point. 


Khalimsky topologies; Marcus [18] independently constructed a topology which 
agrees with a Khalimsky topology on Z, but differs on Z’. 

There is a sense in which this topology is uniquely determined. Like the real 
line R, this space satisfies the following condition, which justifies regarding it as a 
digital line. 


DEFINITION 3.1. A connected ordered topological space (COTS) is a connected 
topological space X with this property: for any x1, X>, x, distinct points in X, there is 
an i such that x; and x, lie in different components of X — {x,}, where {i, j,k} = 
{1, 2, 3}. (There is no separation’ assumption made on X ). 


It can be shown [7, Theorem 7] that each COTS induces a total order on the 
underlying set such that: if x is not an end point, the components of X — {x} are 
{y: y > x} and {y: y < x} (call these sets U(x) and L(x) resp.)* This COTS order 
is unique up to inversion. Note that it differs from the specialization order; in the 
case of the real and digital lines the COTS order is the usual one. 

The next proposition shows that the topology given by FiGuRE 3 is unique in the 
sense that any locally finite COTS must have this form; in the finite case there is 
the question whether the first point is open or closed. In fact, every locally finite 
COTS (more generally, every Alexandroff COTS) with at least three points is 
homeomorphic to an interval subset of Z. 


Proposition 3.2 [7, Prop. 8]. In a COTS with at least three points,° each point is 
either open or closed. A point x is closed (resp. open) iff U(x) and L(x) are open 
(closed). If two points are order-adjacent then one of them is open and the other 
closed. If x has a successor, but no immediate successor, then x is closed. 


Proof. If a COTS has at least three points, then it has a point which is not an 
end point; we show that any such x is open or closed (a separate argument is 
required for end points). If A, B separate X — {x} then 


clA=A or dA=AU {x} 
and 


lB=B or clB=BU {x}. 


>We remind the reader that the term separation has two different standard meanings: separation 
axioms (7), 7,, etc.) and separation of a set which is not connected (see, for example, [3]). 

‘Conversely, any connected topological space with a total ordering which is related to the topology 
as in the previous sentence is a COTS. 

>From here on the usual order notations will be used for the COTS ordering. There should be no 
confusion with the specialization ordering because the latter will appear only in the digraph and 
connectedness graph which were constructed from it. 

This hypothesis is necessary because the two point indiscrete space is a COTS. 
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Both A and B must behave similarly (otherwise we have a separation of X). If 
A, B are both closed then {x} is open; otherwise cl A M cl B = {x} is closed, and 
A, B are open. 

If x, y are order-adjacent, say x = y”, the successor of y, we show one is open 
and the other closed. Say x is closed, so that U(x) is open. Then cl U(x) = U(x) 
U {x} = U(y) is closed, and {y} is open; here again a modification is necessary if x 
is an end point. A similar argument works if we assume one of the points open. 

For the last sentence; show U(x) is open by noting that, for each y > x, U(y) or 
U(y) U {y} is open, and U(x) is a union of these. 

In the classical theory a path (resp. arc) in a topological space is a continuous 
function (homeomorphism) from a closed real interval. We define a digital path 
(resp. digital arc) in a topological space to be the range of a continuous function 
(homeomorphism) from a finite COTS, i.e., from a finite interval in Z. If it is clear 
from the context, we sometimes omit the adjective digital. 

The following properties will be useful in Jordan curve theory. 


THEOREM 3.3 [7, Theorem 13]. A finite topological space is connected iff it is 
digitally arc-connected iff it is digitally path-connected. 

If C is a finite connected set containing distinct points x and y, then C is a digital 
arc with end points x and y iff |@x)|=|My)|=1 and |M(w)| =2 for any 
w EC — {x, y} (ie., iff there is no “extra” connectedness ). 


Proof. The first statement follows immediately from the following characteriza- 
tion of COTS-arc [7, Proposition 13(b)]: In any topological space a set is minimal 
among connected sets containing points x and y iff it is a COTS-arc with end 
points x, y. The second follows because, in a COTS, .“(w) is the set of points 
order-adjacent to w. The theorem Is actually valid more generally for Alexandroff 
spaces. 

A more complete discussion of COTS with detailed proofs can be found in [7, 
Sections 2,3]. Although it is not needed for the sequel, let us note how the last 
sentence of Proposition 3.2 relates this theory to a now classical theory of 
connected ordered T7,-spaces (see [9] for a recent survey): considering only T,- 
spaces is equivalent to considering only orderings without order-adjacent points 
(see also [7, Proposition 9]). 


4. The topology of the digital plane. A space X X< Y with the product topology, 
where X and Y are locally finite COTS (i.e., interval subsets of Z) with at least 
three points, is called a digital plane. Point (x, y) is called pure if {x} and {y} are 
either both open or both closed, mixed otherwise (i.e., one open and the other 
closed). Define border(X < Y) = {(x, y): either x or y is an end point}. The 
border of a finite plane will play an important role in the proof of the digital 
Jordan curve theorem. On the other hand, it is sometimes convenient to have a 
plane with no border, in which case we use Z X Z. 

Let us begin by considering the connectedness graph of a digital plane, a 
portion of which is shown in FiGurE 4. (We omit the details of the construction, 
but see §6 where they are given for the analogous construction in Z°.) Several 
properties will be useful. 

1. The two different kinds of pure points behave similarly with respect to 
connectedness. The adjacency set of a pure (resp. mixed) point which is not a 
border point has eight (four) points. 
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an +4 C) = (open, open) point 
@ = (closed, closed) point 
mm = (open, closed) point 
| = (closed, open) point 
2n 


2m 2m+ 4 


Fic. 4. Connectedness graph of a portion of a digital plane. 


2. In constructing a digital path one can follow any sequence of points as long as 
adjacent points on the path are connected in the graph. Thus, by the preceding 
theorem, the connectedness graph is sufficient to determine which sets are 
connected. 

3. For an arc, however, there can be no ‘“‘extra” connectedness (as noted in the 
preceding theorem); thus an arc cannot turn at a mixed point (since the pure point 
before the mixed point would be connected to the pure point after the mixed 
point). 

Although it will not be needed in the sequel, one can construct the digraph of 
the specialization order from the (undirected) connectedness graph by directing 
the edges as follows: edges go from closed points and towards open points. Since 
every edge contains a pure point, this suffices to direct all of the edges. Note that 
mixed points are saddle points in this ordering.’ 

Thus far we have considered digital lines as objects arising naturally from the 
COTS property and digital planes as products of such lines. It is important to note 
that there is an alternative construction of digital spaces as suitable quotients of 
real spaces. This will play an essential role in §6 when we use the Jordan-Brouwer 
theorem for real 3-space in a digital proof and in §7 when we consider how to 
display digital images. Define f: R > Z by f(2n) = 2n and f(x) = 2n + 1 when- 
ever 2n <x < 2n +2. In the quotient topology on Z odd points are open and 
even points are closed, so the quotient topology is the COTS topology. Now define 
g: R* > Z’ by g =f X f. Again, the quotient topology coincides with the digital 
topology. 

Alternative diagrams of this quotient topology on digital lines and planes are 
obtained by looking anew at the diagram of a partitioning of a line (FiGuRE 5) and 
seeing each dot as a closed point and each subinterval between two dots as an 


"For an Alexandroff topology, reversing the specialization order gives a new Alexandroff topology in 
which the new open sets are the old closed sets. These topologies are not homeomorphic in general. 
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Fic. 5. The digital line as the quotient of the real line. 


open point. Thus a set is open in Z if it “looks open’ i.e., the corresponding set is 
open in R. The corresponding diagram of the digital plane looks like FiGuRE 6. 
Here the open points are represented by the interiors of the rectangles, the closed 
points by their corners, and the mixed points by their open sides. One’s intuition 
about these digital structures can be improved by using FiGuReE 6 to derive the 
connectedness graph given in FiGuRE 4. 


———_@—_9__o eo __-e—_—_- 


Fic. 6. The digital plane—quotient of the real plane or product of digital lines. 


This gives further insight into why a digital arc should not turn at a mixed point: 
to exit from the interior of a square, an arc goes through an edge or a corner, but 
not an edge and then a corner. 


5. The digital Jordan curve theorem. We say that a digital Jordan curve is a 
finite connected set J with |J| > 4 such that J — {j} is a digital arc for each j € J. 
By [7, Prop. 17(a)] this is equivalent to saying that J # @ is connected with 
WL(j) AJ a two point discrete set for each j € J. For use in the next section we 
note that the second characterization extends to three dimensional space as well. 
We now sketch the proof of the digital Jordan curve theorem; for a more formal 
treatment with all of the details see [7, Section 4]. Let X = X, x X, be a finite 
digital plane. We will use the fact that, for any non-border point j, /(/) is a 
digital Jordan curve and .(j) — J has exactly two components. Since this proof 
utilizes the border, we begin by proving 


THeoreM 5.1. If C is an arc in X, then X — C and border(X) — C have the 
same number of components and these correspond by set inclusion. 


Proof. Let @ = border(X) and consider the map WV from the set of compo- 
nents of @ — C to the set of components of X — C defined by letting V(W) be 
the component in X — C which contains W. We show that WV is a bijection by 
showing: 

(i) Each component of X — C meets @ — C. 

(ii) If D is an arcin X that meets more than one component of @ — C, then D 
meets C. 
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To prove (i) by induction on |C|, assume that we have a counterexample in 
which C is as short as possible and suppose that the component of some point p 
does not meet @&. If f is an endpoint of C, then C’ = C — {f} is a shorter arc, so 
the component of p in X — C’ meets &. There is an arc D in X — C’ connecting 
p tosome b € @, and f must be on D. We get a contradiction by constructing an 
arc D’ in X — C also connecting p to b. We construct D’ from D — {f} by using a 
portion of <(f) — C to “bridge the gap” which arose when f was removed from 
D. Ficure 7 shows one possible configuration where f is a mixed point. To justify 
using Wf) — C in this fashion, note that |#/(f) A C| = 1 and that of) is a 
Jordan curve, so that ..°/(f) — C is an arc by [7, 17(a)]. The remaining cases and 
other details can be found in [7, Lemma 18]. 


_C ig f @ oS 
D' 


D 


Fic. 7. Construction of D’ for the case in which f is a mixed point not on &. 


If (ii) fails, let X,, X,,C, D be a minimal counterexample; note that minimality 
implies that C and D meet the border at precisely their endpoints. If |X,| = LX,| 
= 3 the result is easy to verify, so we may assume, say, |X,| > 3. Let y be the 
initial point of X, and consider a second example obtained by deleting the bottom 
row of the original plane; i.e., replace X, by X, — {y}. By the minimality, this is 
no longer a counterexample. This yields a contradiction, the details of which are in 
[7, Lemma 19]. 


THEOREM 5.2. If J is a digital Jordan curve in X which does not meet border(X ), 
then Z* — J has exactly two components. 


Proof. To show that X — J has at most two components, fix j € J and note that 
J — {j} is an arc which does not meet border(X). By the preceding theorem, 
X — (J — {j}) is connected; thus any point in X — J can be joined to j by an arc in 
X — J. Such an arc must meet .~/(j) — J. This shows that any point in X — J can 
be joined to Wj) — {j} by an arc in X — J. Since Wj) — J has two components, 
it follows that X — J has at most two components. 

To show that X —J has at least two components, we must find an “inside”’ 
point (i.e., a point which cannot be connected to border(X ) by an arc in X — J). 
To utilize the previous theorem, we shorten X, as we did in the proof of (ii), by 
“moving up” the bottom edge until it touches J. By modifying J, we then construct 
an arc C which separates the border of this shortened plane X’. By the previous 
theorem X’ — C has at least two components, and the existence of the desired 
inside point can be deduced from this. The many details missing from this sketch 
can be found in the proof of [7, Theorem 21]. 
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Although this theorem was formulated in a finite plane in order to use the 
border in the proof, as an obvious corollary we get an analogous theorem for 
(finite) Jordan curves in Z”. 


TuEoreM 5.2. If J is a digital Jordan curve in Z, then Z* — J has exactly two 
components. 


6. The Jordan surface theorem. There are also three dimensional questions in 
image processing; the third dimension can be spatial of course (e.g., X-ray 
tomography), but it can also be time (e.g., a “movie” is a sequence of changing two 
dimensional images) or a “gray scale function”, which assigns to each pixel in a two 
dimensional digital image a numerical value measuring the shade of gray assigned 
to that pixel. Since the topology for the digital plane was a product topology, there 
is such a topology for Z°. We use this to prove a digital analog of the classical 
Jordan-Brouwer theorem which asserts that every compact 2-manifold separates 
R°> into two connected components. 

Let X = X, xX X, X X;, where each X; is a finite COTS; let 7+ denote the 
product of the COTS topologies. A digital m-simplex in X is a set of m+ 1 
mutually connected points. It is convenient to use the terms dyad (resp. triad, 
tetrad) for the cases m = 1 (resp., 2, 3). 

As in the digital plane (§4), there are just two types of adjacency sets, for pure 
points and for mixed points (definitions analogous to those given earlier). This 
time we verify this by constructing .e( p) for the various types of non-border points 
p = (x,y, z): 


1. If p is closed then® 
N(p) ={x7,x,x7} X{y ,y,y*} X {z7, 2,27} 


and cl{ p} = {p}, so that |N(p) U cl{ p}| = 27 and Mp) has 26 points. 

2. If p is open, then the roles of N and cl are reversed, but the result is the 
same. 

3. If p is mixed, say (open, open, closed), then 


N(p) = {x} X ty} X {275 2,27} 


and 


cl{ p} = {x~,x, x} x {y ,y,y } x {z} 
so that .</(p) has 10 (= 2 + 8) points. 


The connectedness graph can again be used to determine which sets are 
connected. In Figure 8 we illustrate the connectedness relationships in a unit 
cube of X; with an appropriate rotation any unit cube of X can be reduced to this 
configuration. 

Define a digital Jordan surface in X to be a r-connected subset S of X which 
does not meet the border and such that <p) S is a digital Jordan curve for 
each p © S. The next lemma [14] gives some properties which begin to justify 
calling such sets surfaces. 


SNotation: x~ (resp. x*+) denotes the predecessor (successor) of x in the COTS-ordering. 
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@ = mixed point 


O = pure point 


Fic. 8. The connectedness graph of a unit cube in digital three space. 


LEMMA 6.1. If S is digital Jordan surface, then each dyad in S is contained in 
exactly two triads in S, and S contains no tetrads. 


We sketch the proof of the separation theorem for three dimensional space; full 
details can be found in [14]. The proof utilizes the construction of a polyhedral 
analog in R° of the digital Jordan surface S. This construction has been general- 
ized by Kong and Khalimsky [10] to arbitrary locally finite T)-spaces; we use their 
notation here. This separation theorem has been generalized to n-dimensional 
digital space by T. Yung Kong (personal communication). The concept of approxi- 
mating polyhedra by such finite spaces goes back to Alexandroff [1]. 


THEOREM 6.2. If S is a digital Jordan surface in X then X — S has exactly two 
T-components. 


Proof. If we regard X as a subset of the integer lattice points of R°, then each 
digital simplex in X determines a real simplex in R° (take the euclidean convex 
hull). Now each subset S of X is a union of digital simplexes, and in fact 
determines a simplicial complex K(S) in R°. We are interested in |K(S)|, the 
associated polyhedron. Let v denote the usual euclidean topology for R*. One can 
show that |K(S)| is a compact 2-manifold in R°. By the classical Jordan-Brouwer 
theorem, R° —|K(S)| has two v-components, J, and E,. Put T= XJ, and 
E=XOE,. 

It remains to show that J and E are the r-components of X — S. That J and E 
are T-separated follows readily from the fact that J, and E, are v-separated. We 
show that J is non-void and r-connected by constructing a continuous surjection f: 
(I,,v) > U, 7). (A similar argument works for E). We construct f as a composi- 
tion of p: U,,v) > (KU), v) and w: (|KCU)|, v) — U7). In §4 we constructed a 
quotient mapping R”? > Z*. The desired 7 is the restriction to |K(/)| of the 
analogous quotient mapping R°® > Z°. Since some points in J are 7-connected to 
points in S, we must retract J, onto |K(J)| before taking the quotient; see [14] for 
the details of this construction of p. 

For two dimensions our topological theory is equivalent to the graph-theoretic 
version in the following sense. Our Jordan curves (and Jordan surfaces) yield 
graphs which are “normal digital pictures” in the sense of Kong and Roscoe [12], 
and their approach gives a graph-theoretic proof for our Jordan curve theorem. 
Conversely, there is a topological proof of the Rosenfeld Jordan curve theorem 
(see §8 for details). In three dimensions, however, the topology yields a new Jordan 
surface theorem. Originally, 6, 18 and 26-adjacency (definitions analogous to those 
for 4 and 8-adjacency in two dimensions) were studied and Jordan surface 
theorems were developed [21, 20, 11] for (6, 26), (26, 6), (6, 18), and (18, 6) (see [13, 
§6] for further references). 
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7. Boundaries in digital planes. Let us look more carefully at the question of 
how pixels are represented mathematically. In a black and white image, a pixel 
takes one of two values, so pixels are often represented as points in Z? and the 
image as a {0, 1}-valued function on the set of pixels. But an actual pixel has area. 
How does one deal with points that have area? Some models regard all of the 
elements as points in Z”, while others regard a point in Z” as representing the unit 
square in R* of which it is the center. 

We now have a model in which some elements of Z” act like points (the closed 
points), some like segments (the mixed points), and some act like areas (the open 
points), but only the latter are actually displayed (i.e., correspond to pixels). As 
noted in §4, our topology arises as the quotient topology from a quotient mapping 
of the real plane onto the digital plane, in which a closed unit square in R’ is 
partitioned into 9 equivalence classes: the 4 corner points as singletons, the 4 open 
segments of the border, and, finally, all of the interior points as the ninth class. 

In order to consider the questions about boundaries from §1, we need some 
definitions. A scene in a digital plane X is a partition of X into connected subsets. 
A separator in X is either a digital Jordan curve which does not meet border(X ) 
or a digital arc which meets border(X ) at precisely its endpoints. We saw in §5 
that, if J is a separator, then XY — J has exactly two components; furthermore, it is 
not difficult to see that J is minimal with respect to this property. 

Example 8 of [8] shows that we need some conditions on the elements of a scene 
in order that their boundaries be unions of separators. To that end we define a 
robust scene to be a scene, each of whose elements has connected interior and is 
contained in the closure of its interior. 

Defining a cartoon to be a union of closed separators, we get a natural 
correspondence between robust scenes and cartoons. Before looking at this theo- 
rem, let us note some important properties of closed separators. By the boundary 
of set A we mean the topological boundary, i.e., cl A — int A, which we denote by 
bdry A. 


Proposition 7.1 [8, Proposition 10]. The following are equivalent for a separ- 
ator J: 

1. J is the boundary of one of the components of X — J. 

2. J is closed. 

3. J contains no open points. 

4. J is nowhere dense. 


The last two properties are actually equivalent for any subset of a locally finite 
space (recall that the set of open points is pointwise dense in such spaces). 


THEOREM 7.2 [8, Theorem 13]. Jf C is a cartoon, then there is a robust scene II 
such that 


C = U{bdry A: A € II}. 


Further, {int A: A © IT} is uniquely determined by C. Conversely, if II is any robust 
scene then 


C=X-Uf{int A: Ae} 
is the unique cartoon for which C = U{bdry A: A € II}. 


The fact that a cartoon does not uniquely determine the elements of a scene, 
but only their interiors, reflects a reality of vision: we do not ‘“‘see” the bounding 
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curves between adjacent regions; rather, we infer them from the regions they 
bound. This ambiguity about what to do with the bounding curves disappears when 
we consider the third question: how should one display these images? Our answer 
is to display only the subspace consisting of the topologically open points. Define 
the open screen # in a digital plane to be the subset consisting of all the open 
points. In this approach only points of the open screen are actually represented as 
pixels; other points are not displayed. 

Note that this also resolves a difficulty mentioned in §1; although the digital 
plane is not homogeneous, the open screen is, as the setting for a picture should 
be. Also, if we consider this from the perspective of the quotient mapping of the 
real plane onto the digital plane, we see that it is the grid lines in the real plane 
which correspond to the points not represented by pixels in the open screen. This 
grid is a set of measure zero. 

The trace of a robust scene on the open screen is called its display; more 
formally, if II is a robust scene, then 


Dy ={ANP: A € Th 


is its display. Our next theorem provides a unified answer to the three questions 
about boundaries raised in §1. 


THEOREM 7.3 [8, Theorems 18 and 19], 


1. Displays and cartoons of robust scenes uniquely determine each other. 

2. Each display of a robust scene is a partition of the open screen # into 
““4-connected”’ sets. Conversely, each such partition is the display of some 
robust scene. 


Since # is still a subspace of X, the concept “4-connected in #” requires a 
slightly different definition of 4-neighbor from that given in §1: the 4-neighbors of 
(x, y) are the points (x + 2, y) and (x, y + 2). 

A robust scene determines a unique cartoon and a unique display; furthermore, 
a display and a cartoon uniquely determine each other. However, a robust scene is 
not uniquely determined by either its display or its cartoon, because one must 
decide where to put the bounding curves. One method of constructing the robust 
scene in either of these cases: enumerate the connected sets in the partition (.e., 
the 4-connected sets (in the case of a display) or the topological components of the 
complement of the cartoon), say, Q,...Q,. Let A;=clQ,;— U{elQ;: j < i. 
Then in both cases A,,..., A, turns out to be an appropriate robust scene. 

To summarize then, we have three ways to specify a digital image, and they are 
equivalent in the sense that the classes determined by each of them are coextensive 
and each determines the other two essentially uniquely. One can specify the 
bounding curves (cartoons) or specify the connected regions, either as topologically 
connected sets in the digital plane (robust scenes) or as 4-connected subsets of the 
open screen (displays). 

This latter characterization is reminiscent of the Rosenfeld theorem stated in 
§1. We conclude by giving a topological proof of this [8, Theorem 20] in the next 
section, but before getting to that let us describe briefly another more general 
topological approach to some of these issues. Kronheimer [17] defines a grid A on 
a topological space S to be a decomposition in which the union of the open 
elements is dense and the projection S > §/A is open. (Thus the Khalimsky 
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topology forms a grid on R”.) A trace space is a space in which the set of all open 
points is dense; any trace space can be reduced to a unique minimal trace space: 
roughly, the quotient obtained by identifying as many non-open points as possible, 
subject to the identification map being open. Call a trace space digital if it is both 
minimal and semi-regular. It follows that any regular Hausdorff space with a dense 
discrete subset (e.g., any Hausdorff compactification of N) is digital. He then 
constructs, on an arbitary set E, a universal digital space (by topologizing the 
collection of all filters on E with a variant of the Wallman topology), universal in 
the sense that every digital space with a trace equipotent to E can be densely 
embedded in the universal space. 


8. A topological proof of the Rosenfeld Theorem. We prove the Rosenfeld 
theorem stated earlier by embedding a Rosenfeld plane, R = Z” with no topology, 
into a digital plane, X = Z? with the Khalimsky topology, via the slant map S (so 
called because the embedding uses a 45° rotation) defined by: 


S(x,y) =(x+y,-x+y). 


If U denotes the set of pure points in X then range S < U because (x + y) + 
(y — x) = 2y is always even. 

Recalling that connectedness in X means topological connectedness, whereas 
in R it means k-connectedness for k = 4 or 8, let us look at images under S of 4 
and 8-neighbors of (x, y) in R as shown in Ficure 9. 


‘(x+1,y+1) 


s S(x +1, y + 1) 
—_» e O 


(x + 1, y) 


e e O 
S(x + 1, y) 
@ 


O = mixed point 
not in range S 


Fic. 9. Images of 4 and 8-neighbors under the slant map S. 


Thus: qg is a 4-neighbor of p iff S(q) € W(S(p)) and gq is a proper 8-neighbor 
of p iff there is a unique mixed point on the (necessarily horizontal or vertical) line 
joining S(p) and S(q). 

If J is a 4-path in R, let J* = S(J/). If J is an 8-path, form J* by adjoining to 
S(J) the mixed point between S(p,) and S(p,;,,) whenever p,;,, is a proper 
8-neighbor of p, on J.” If J is a k-path (resp. k-arc, Jordan k-curve) in R then J* 


In the case where J is a Jordan 8-curve, one must also do this between the last point and the first 
point of J. 
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is a topological path (arc, Jordan curve) in X. If H,J are disjoint in R, with one a 
4-path and the other an 8-path, then note that H* and J* are disjoint in X. 

Now to sketch the proof of the theorem (full details may be found in 
[8, Theorem 20]), let J be a Jordan k-curve in R which has at least five points and 
does not meet the border. Since X — J* has exactly two components, J and E£, it 
remains to show that S~'(/) and S~'(E) are the k'-components of R — J. 

We first show that S~'(/) and S~‘(E) are k’-separated. If not, there is a k’-arc 
H which joins them and is disjoint from J. But then H* and J* are disjoint in X, 
which contradicts the separation of J and E. 

To show S~‘(/) non-void it suffices to verify that J contains a pure point, which 
is seen by using the hypothesis |J| > 4. 

To show that S~‘(J) is k'-connected, we let p,q € S~'() and construct a 
k'-path joining them. Now S(p), S(qg) € J, so there is a topological path C in J 
joining them. Since C does not meet J*, S~1(C) is an 8-path in S~‘(J) joining p 
and g, and we are done if k’ = 8. If k’ = 4, we must “square the corners” of 
S~'(C) to make it a 4-path. For the 8-neighbors on S~‘(C) illustrated in FiguRrE 
10, we need to know that either u or v is available (i.e., is in S~'()), so that it can 
be adjoined to the path. Since S~'(J) and S~‘(E) are k'-separated, u,v € S~'(E). 


S(u) 


Sv) 


Fic. 10. To convert S~'(C) from an 8-path to a 4-path we ‘square the corner’ by adjoining either 
u OF U. 


If u,v € J, then S(u), Sv) € J* and the mixed point m must be on J*, which is 
not possible since C and J* are disjoint. Thus either u or v can be adjoined to 
S-—(C) to “square the corner”. This can be done for each pair of proper 
8-neighbors. This shows that S~'(/) is k’-connected. Since a similar argument 
works for S~!(E), the proof is complete. 
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Linear Algebra by Analogy 


Scott H. Hocuwa _p, University of North Florida, Jacksonville, FL 32216 


Scott H. Hocuwa.p: I discovered the beauty of mathematics while I was a 
member of a Brooklyn Tech High School class that spent three years under 
the guidance of Lawrence J. Zimmerman. I learned how to do mathematics 
at Caltech, where Richard Dean and Fred Dashiell played key roles in that 
phase of my education. I received my Ph.D. in mathematics from U.C. 
Berkeley in 1984 under the direction of Donald Sarason. Since that time, I 
have been on the faculty of the University of North Florida. My main 
research interests are operator theory and linear algebra. 


1. Introduction. Whenever I see a result about real or complex numbers, I ask 
myself whether it is also true for linear transformations. I developed this habit as a 
result of reading books like [9] and articles like [6]. 

One reason I wrote this article is so that others may pick up this habit. Another 
reason I wrote it is to emphasize aspects of linear algebra that suffer from 
underexposure. 

My use of the phrase “linear transformations’ as opposed to the word 
“matrices” is deliberate. A great deal can be said about linear algebra without 
appealing to matrices. In fact, except for 2 x 2 matrix examples and counterexam- 
ples, I could have written this article without mentioning matrices. I chose not to 
do this so that insights were not obscured by techniques. Those who want to fill in 
any coordinate-free gaps or who want to see a coordinate-free treatment of linear 
algebra should read [9]. 

In order to make this article accessible to as large an audience as possible, the 
underlying vector space is restricted to being finite-dimensional. Much of what is 
true in the finite-dimensional case is also true in the infinite-dimensional (com- 
plete) case, and what isn’t true is what makes the study of infinite-dimensional 
spaces interesting. Therefore, Ill occasionally note some of the complications that 
occur in infinite-dimensions. 


2. Introducing the main players. Let L(V) denote the set of all linear transfor- 
mations on a complex finite-dimensional inner product space V of dimension n, 
where n > 1. Let C denote the set of complex numbers and (-,: ) denote the inner 
product on V. 

In what follows, all 2 x 2 matrices are written with respect to the orthonormal 
basis {(2), (°)} for the inner product space consisting of the set of all ordered 
pairs of complex numbers with the dot product. 

If S € L(V), then S*, the adjoint of S, is the unique linear transformation that 
satisfies (Sx, y) = (x, S*y) for all x, y in V. If one fixes an orthonormal basis for V 
then the matrix of S* with respect to this basis is the conjugate transpose of the 
matrix of S with respect to it. It follows directly from either of the last two 
statements that for S,T in L(V): (S + T)* = S* + T*, (ST)* = T*S*, (S*)* = S, 
and (kS)* = kS* if k € C. 

In an algebraic sense, L(V) with the adjoint operation looks very much like C 
with the conjugation operation. (This is the raison d’étre of this article.) Their 
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algebraic differences occur because multiplication in L(V) is not commutative and 
there are nonzero linear transformations that do not have multiplicative inverses. 
One of these conditions has a much greater impact than the other. Care to make a 
conjecture? 

Given any complex number z, there exist unique real numbers a and b such 
that z =a + be. In order to transfer this basic observation regarding complex 
numbers into one concerning linear transformations, the “real” linear transforma- 
tions must be identified. Since the complex number z is real if and only if z = Z, 
we say a linear transformation S is Hermitian (‘real’) if S = S*. Since 


z+Z 
Re(z) = 


z—-Z 
and Im(z) = for a complex number z, 


we define the real and imaginary parts of a linear transformation S$ by 


S* S — S* 
and Im(S) = 
2¢ 


It then follows that both Re(S) and Im(S) are Hermitian and S = Re(S) + ¢ Im(S). 
If S is Hermitian, then (Sx, x) is real for all x in V, because (Sx, x) = (x, Sx) 
= (Sx, x). The converse of this observation is also true. That is, if (Sx, x) is real 
for all x in V, then S is Hermitian ((9, p. 139]). 
It is easy to see that the set of Hermitian linear transformations, like the set of 
real numbers, is closed under addition. However, the situation is different for 
multiplication. For example, if 


_ 1 -1 _{1 1 
S = 5) and 7 (| ar 


Re(S) = 


then S and 7 are both Hermitian, but ST = (° 4) is not. It is because multipli- 
cation is not commutative that we get this counterexample. If S and T are 
Hermitian linear transformations and S$ commutes with T, then ST is Hermitian, 
simply because (ST)* = (7S)* = S*T* = ST. 

Now that the “real” linear transformations have been identified, it follows 
naturally that we look for the “pure imaginary” linear transformations, the 
“nositive” linear transformations, and the “negative” linear transformations. 

Since a complex number z is pure imaginary if and only if z = —Z, we say that 
a linear transformation S is skew-Hermitian (“pure imaginary”) if S = —S*. 

A moment’s thought about connections between real and pure imaginary 
numbers and a few simple manipulations lead us to conclude that a linear 
transformation S$ is skew-Hermitian if and only if ¢S is Hermitian. Thus, the set of 
skew-Hermitian linear transformations is closed under addition and, as expected, it 
is not closed under multiplication. 

The definition of a ‘“‘positive” linear transformation is not as clear cut. It is best 
approached by first defining a “nonnegative” linear transformation, although this 
also has its complications. On the one hand, since a real number p is nonnegative 
if and only if p = Zz for some complex number z, we should say that a Hermitian 
linear transformation S is nonnegative if S = T*T for some linear transformation 
T. On the other hand, since a real number p is nonnegative if and only if p = b? 
for some real number b, we should say that a Hermitian linear transformation S is 
nonnegative if S = R* for some Hermitian linear transformation R. 
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It turns out that it is not necessary to decide which definition is better because 
they are equivalent. (One way to show this is to use the Diagonalization Theorem 
in section 4.) 

If § © L(V) and § = T*T for some T in L(V), then S is Hermitian. So, it 
makes sense to simply say, “S is a nonnegative linear transformation,’ and to 
denote this by S > 0. 

A linear transformation S is said to be nonpositive if —S > 0. 

If S and T are Hermitian linear transformations and $ — T > 0, then we write 
S > T. This natural partial order turns out not to be a total order. The following 
proposition and a “natural counterexample”’ illustrate why this is the case. In 
addition, the proposition establishes a direct connection between ‘“‘> ” for non- 
negative real numbers and “> ” for nonnegative linear transformations. 


PROPOSITION 2.1. The linear transformation S is nonnegative if and only if 
(Sx, x) > 0 for all x in V. 


See [9, p. 166] for a proof of the hard direction. 


If 
“(1 0) Jf _ (0 
S ( 4). x (3) andy (‘). 


then (Sx, x) = 1 and (Sy, y) = —1. It then follows from Proposition 2.1 that S$ + 0 
and 0 + S. Thus, unlike the set of real numbers, the set of Hermitian linear 
transformations cannot be totally ordered by the natural partial order. 

Another immediate consequence of Proposition 2.1 is that the set of nonnega- 
tive linear transformations is closed under addition. 

Again, the situation is different for multiplication. For example, if 


s-( )-(2 (0 7 


7 (5 ‘) _ (5 alt | 
1 5 1 2)/\0 2)’ 
then ST = (° 4), This shows that the set of nonnegative linear transformations is 
not closed under multiplication. Once again, this is a consequence of the fact that 
multiplication is not commutative. That is, if two nonnegative linear transforma- 
tions commute, then their product is nonnegative ((9, p. 167]). 

In view of the discussion concerning nonnegative linear transformations, it 
seems reasonable to say that a linear transformation, S, is positive, in symbols 
S > 0, if (Sx,x) > 0 for all x in V. Jf S is positive, then S is invertible and 
nonnegative. This implies that S = T*T for some invertible linear transformation T 
and S = R? for some invertible Hermitian linear transformation R. These two 
sentences are in italics because even when “properly modified” they are not valid 
when V is infinite dimensional. “‘Properly modified” means that the phrase “linear 
transformation” is replaced by the phrase ‘‘bounded linear transformation” (see [9, 
p. 176] for a definition). 

The reason for starting the above discussion by identifying the nonnegative 
linear transformations rather than the positive linear transformations can now be 
revealed. A nonnegative real number is positive if it is nonzero; this is equivalent 


and 
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to its being invertible (with respect to multiplication). However, these two terms, 
nonzero and invertible, are not equivalent when used to describe linear transfor- 
mations. Thus, the translation of the statement, “‘a real number p is positive if 
there exists a nonzero complex number z such that p = zz” to a statement about 
linear transformations is ambiguous. It now appears that ‘nonzero complex 
number” corresponds to “invertible linear transformation.” 

The final subset of the complex plane that has a role in the analogies is the unit 
circle. A convenient way to describe this set.so that the corresponding set of linear 
transformations can be identified is {z: z € C and Zz = 1}. Thus, we say that an 
invertible linear transformation U is unitary if U*U = J. (This is not redundant 
when V is infinite-dimensional.) If U and V are unitary linear transformations, 
then (UV )*UV = V*U*UV =I. Hence, the set of unitary linear transformations, 
like the unit circle, is closed under multiplication. 


3. Natural questions. This section contains answers to many questions that 
arise naturally when L(V) is viewed in the light of the previous section. Most of the 
results in this section just appear as statements of theorems. To fully appreciate 
these results their proofs must be sampled. | 

The following theorem demonstrates that nonnegative linear transformations 
behave like nonnegative real numbers with respect to taking roots. 


Root THEorREM. /f S is a nonnegative linear transformation and n is a positive 
integer, then there exists a unique nonnegative linear transformation T such that 
T” = § (so S'/" = T). 


See [3] for a proof. (Another way to prove this is to use the Diagonalization 
Theorem in section 4.) 

If z is a complex number, then there is a unique nonnegative real number 
r(r = |z| = (Zz)'””) and a real number 6 in [0,27) such that z = re’®. If z is 
nonzero, then r is positive and @ is also uniquely determined. 

The next analogous result places a different emphasis upon this decomposition. 


PoLAR DECOMPOSITION THEOREM ([15], [9, p. 169]). If S € L(V), then there is a 
unique nonnegative linear transformation P, where P = (S*S)'/*, and a unitary 
linear transformation U such that §S = UP. If S is invertible, then P is positive and U 
is also uniquely determined. 


There is more to say about this decomposition but the statements and details 
must wait until section 4. 

If S = LY), let |S| denote (S*S)'/*. The next five results need no added 
explanation. 

THEOREM 3.1 ([5]). If S € L(V) and (Re(S))* = |S \7, then S is Hermitian. 

THEOREM 3.2 ((6]). If S € L(V) and Re(S) = |S|, then S is nonnegative. 

THEOREM 3.3 ((6]). Jf S € L(V) and |Re(S)| = |S|, then S is Hermitian. 

THEOREM 3.4 ((14]). Jf S, T © L(V) and S => T = 0, then S'”* > T'”? > 0. 


THEoREM 3.5 ((1]). Jf S,T € L(V), T is invertible, and S => T= 0, then S is 
invertible and T~' > S~! > 0. 
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The ‘‘properly modified” version of Theorem 3.3 is unresolved in the infinite- 
dimensional case. 

Nice proofs of Theorems 3.4 and 3.5 and much more in the spirit of this article 
can be found in [4]. Those interested in [4] may also be interested in [7] and [8]. 

We now break the trend established by the last few theorems and show that 
some inequalities about real numbers cannot be translated to inequalities about 
Hermitian linear transformations. 

There exist linear transformations S and T such that S > T > 0, but S* # T?. 
Let S = (_! “1 and T = (° °), It is easy to see that TJ > 0, and we already know 
that S > 0. Since 


sr-(} “1 -(1 9h “tp 


it follows that S > 7. Since 


gor.(_2 -3) 


—3 4 
and (S$? — T*)x, x) = —.5 when x = (*5], it follows that S* # T°. 

The story is different if S and T commute. If §$ > 7 > 0 and S commutes with 
T, then S? > T’. Here are the details. Since both § + T and S — T are nonnega- 
tive and they commute with each other, it follows that S* — T? =(§8 + T\S — T) 
> 0. 


There exist Hermitian linear transformations S and JT such that S$ > 0 and 
S>T2> —S but S ¢ |T|. Let 


-(2 2) = valle | 
2 2 0 y2}\v2 v2 
and T=(! _°}. It then follows that 


v2 (02 | s+7=(° 0 (2 1) 


S-T= 
v2 1}/\0 1 1 0 


but 


S —|T| = (5 *) and ((S — |7|)x, x) = —1 when x = | i}: 

Once again, if S and 7 commute, then S$ > 0 and § > 7 > —S imply that 

S > |T|. This follows from the facts that S$? > T* and S = ($7)! > (T?)!7? = 
|T|. 


4. Normal linear transformations. This brief section explores another impor- 
tant effect of commutativity and completes a discussion that began in section 3. 
Even though the following sets are characterized by the commutativity of different 
linear transformations, they are equal. 

A ={S € L(V): S commutes with $*} 
B = {S € L(V): Re(S) commutes with Im(S)} 
C ={S € L(V): S = UP, a Polar Decomposition, and U commutes with P} 

A few simple calculations show that A = B. A couple of different calculations 
show that C CA. The following shows that A CC. If SS* = S*S and S = UP isa 
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Polar Decomposition of $, then UPPU* = PU*UP. This shows that UP* = P7U. 
Since U commutes with P?, it follows that U commutes with P, the unique positive 
square root of P”. See [9, p. 170] for more details. 

It is easy to see that 


D = {8 € L(V): (Re(S))’ + (Im(S))” = [S17] 


also belongs on the list of sets. 

We say that a linear transformation S is normal if SS* = S*S. Nonnegative, 
Hermitian, skew-Hermitian, and unitary linear transformations are normal. 

The following fact is even more remarkable than the equality of A, B, C, 
and D. 


DIAGONALIZATION THEOREM ([13, p. 304]). The linear transformation S is normal 
if and only if there exists an orthonormal basis for V consisting of eigenvectors of S. 


We are now able to complete the discussion concerning Polar Decompositions. 
Basically, we show that unitary linear transformations have a form analogous to 
the “e’®” form for complex numbers on the unit circle. 

If J is a normal linear transformation, then there exists an orthonormal basis 
{u,} for V and complex numbers 4, such that Tv, = A,v; for j = 1,2,...,n. Tne 
linear transformation e’ is defined by e’v, = e*/u,. 

It is actually true that e’ makes good sense even when T is not normal. 
Formally, e7 = J + &°_,T”"/n!. It can be shown that this infinite series converges 
to a linear transformation and, moreover, the two definitions coincide when T is 
normal. 

If U is a unitary linear transformation and x is an eigenvector of U, then 
(Ux, Ux) = (U*Ux, x) = (x, x) = 1 shows that the set of eigenvalues of U is a 
subset of the unit circle. 

It is now easy to prove the following result. 


THEOREM 4.1. Jf U is a unitary linear transformation, then there exists a Hermi- 
tian linear transformation T such that U = e°'. 


Theorem 4.1 remains valid in the infinite-dimensional case (see [16, p. 317]) as 
long as it is properly modified. However, the Polar Decomposition Theorem must 
be more than properly modified to remain valid in the infinite-dimensional case 
(see [10, p. 74)). 


5. The spectrum and the trace. I’m sure that it is now apparent that the source 
of a lot of the interesting algebraic phenomena in L(V) is commutativity—or the 
lack thereof. This point of view has some surprising consequences when applied to 
the spectrum and the trace. 

If § € L(V), let speciS), the spectrum of S, denote the set of eigenvalues of S 
and let ¢tr(S), the trace of S, denote the sum of the eigenvalues (counting 
multiplicities) of S. Equivalently, tr(S) = the sum of the diagonal entries of any 
matrix representation of S. 


We begin the investigation by concentrating on spec(-). Let spec(S)* denote 
{A: A € spec(S)} and if k € C, let k spec(S) denote {kA: A € spec(S)}. 

If A € spec(S), then there exists a unit vector x such that (Sx, x) = (Ax, x) = 
Mx, x) =A. This implies that spec(S) ¢[0,) if S is nonnegative, spec(S) C 
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(—o,0) if § is Hermitian, and spec(S) C {pure imaginary numbers} if S$ is 
skew-Hermitian. 

It is also easy to see that if k € C, then spec(kS) = k spec(S). 

Another satisfying result concerns the relationship between spec(S) and 
spec($*). 


Proposition 5.1. If S € L(V), then spec(S*) = spec(S)*. 

See [12, p. 57] for a proof. 
Another more remarkable result along these lines concerns products. 
Proposition 5.2. If S and T are in L(V), then spec(ST ) = spec(TS). 


The details for this can be found in [17]. This statement needs slightly more than to 

be properly modified when V is infinite dimensional. See [10, p. 43] for details. 
Proposition 5.2 shows that, in some sense, spec(:) allows any two linear 

transformations to commute. Here are some of the consequences of this. 


THEOREM 5.1. Jf S and T are nonnegative linear transformations, then spec(ST ) 
C [0, ~). 


Proof. Since S = 0, there exists a nonnegative linear transformation R such 
that R* = S. It then follows that spec(ST) = spec(R°T) = spec(RTR). If x € V, 
then (RTRx, x) = (TRx, Rx) = (Ty, y) > 0. Therefore, RTR > O and _ thus, 
spec(RTR) C [0, °°). m 


For a different proof of Theorem 5.1 and interesting related results see [18]. 
The proof in [18] must, however, be modified so that it works for noninvertible 
linear transformations. 

Even though it is true that the product of commuting Hermitian linear transfor- 
mations is Hermitian, it is not possible to replace “nonnegative” in Theorem 5.1 by 
“Hermitian” and ‘{0,)” by “(—,)’. Consider for example, S = (° '] and 


T= (: °h. It is easy to see that S and T are Hermitian, ST = (° 3), and 


spec(ST) = {¢,— ¢}. 

Nevertheless, there is something salvageable from this situation. If S and T are 
Hermitian linear transformations, then spec(ST) = spec(TS) = spec((ST)*). This 
implies that A © spec(ST) if and only if A € spec(ST). This is equivalent to saying 
that spec(ST) is symmetric about the real axis. 

Notice that spec(ST) € (—~, ) if one of S or T is Hermitian and the other is 
nonnegative. 

Can Theorem 5.1 be extended from two factors to three? No. Let 


2 -¢\)_{1 -—¢\f1l 0 1 -1 1 1 
r= (2 ‘ - t Alp i}: s=( 5 5). and = f s}: 
It then follows that R, S, and T are nonnegative, RST = ( ~ 8-94), and 
spec( RST) contains two complex numbers whose sum is 9 — 5¢. We expect this 
because, unlike the situation for two linear transformations, spec(-) ‘‘doesn’t 
allow” three linear transformations to mutually commute. In fact, the above 
example shows that spec(7SR) contains two complex numbers whose sum is 9 + 5¢ 


(spec(TSR) = spec(RST)* since TSR = (RST )*). 
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A positive result along these lines is: If R, S, and T are nonnegative linear 
transformations and RST is Hermitian, then spec( RST) Cc [0,). In fact, these 
conditions imply that RST is nonnegative. See [18] for details. 

The time has come to let tr(-) enter the picture. 

Clearly, tr(S) > 0 if S is nonnegative, tr(S) € (—~,) if § is Hermitian, and 
tr(S) € {pure imaginary numbers} if S is skew-Hermitian. 

In light of what has been mentioned concerning spec(:) and/or by definition it 
is now easy to see that if § and T are linear transformations and k is a complex 
number, then tr(kS) =k tr(S), and tr($*) = tr($), tr(S7) = tr(TS), and if in 
addition, § and T are nonnegative, then tr(ST) > 0. 

A perusal of the beginning of this section shows that we have a perfect match in 
results up to this point. The first discrepancy occurs when we consider tr(S7') for 
Hermitian S and T. In this case, tr(ST) € (—,) but all that can be said for 
spec(ST ) is that it is symmetric about the real axis. 

Formally, we can account for this “better” behavior of tr(-) with a proof. If 
S =[s,,] and T = [t,,] are matrix representations for S and T with respect to the 
same basis for V, then s,, = 5,,, t;; = ¢,; and the sum of the diagonal elements of 
ST is U7, L415; ;t;;. The following calculations show that this sum is indeed a real 
number. 


n n nol n 
2 u Sibi = 2 | Sj jbji + iu L S, bj; + iu Sib ij 


The last expression is the sum of a complex number, its conjugate and a real 
number. 

Actually, more is true. There is, in fact, a basis for V such that the matrix 
representation of ST with respect to this basis consists only of real numbers. See 
[12, Theorem 4.1.7] for the details. 

Informally, additivity might explain this “better” behavior. That is, tr(S + T) = 
tr(S) + tr(T) and it seems that it is not possible for spec(-) to be additive. 
However, if A and B are sets of complex numbers, then 4+ B={a+b:aEA 
and b € B} defines addition for sets. Clearly, spec(S + T) = spec($) + spec(T) is 
too much to hope for. Is spec(S) + spec(7) C spec(S + 7) or spec($ + T) ¢c 
spec(S) + spec(T )? No. If S = (° ‘| and T = (° °), then spec(S) = spec(T) = {0} 
and spec(S + T) = {—1, 1}. Once again, the situation changes when commutativity 
enters the picture. That is, if S and JT commute, then spec(S + T) C spec(S) + 
spec(T ). See [12, p. 92] for the details. Thus, tr(-) is additive and spec(-) isn’t. 


Final remarks. Any discussion of linear algebra that mentions the trace usually 
puts a greater emphasis on the determinant. Therefore, I feel that I must justify 
the fact that section 5 has a discussion of the trace without mentioning the 
determinant. 
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Since det(ST) = det(S)det(7), all the results in section 5 become trivial when 
stated properly for the determinant. However, since det(-) is not additive, we 
should investigate det(S + T). Once again, results become trivial because S$ + T is 
nonnegative when S and T are nonnegative and § + T is Hermitian when S and T 
are Hermitian. 

It is possible to restate Theorem 5.1 in a way that raises an interesting question. 
Namely, if S is a nonnegative linear transformation, then spec(ST) ¢ [0,) for all 
nonnegative linear transformations T. 

Is the converse of this theorem true? That is, if spec(S7) c [0,) for all 
nonnegative linear transformations T, does it follow that S is nonnegative? The 
answer is yes and it is a consequence of Theorem 4 in [2]. Also, [11] has an 
approach to this question and related questions that is different from the approach 
in [2]. 
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An Obvious Proof of Burnside’s Lemma 


KENNETH P. BoGaRt™ 
Department of Mathematics and Computer Science, Dartmouth College, Hanover, NH 03755 


The Burnside-Cauchy-Frobenius (B-C-F) formula [2, 4], which is commonly 
called Burnside’s lemma, allows us to compute the number of orbits (equivalence 
classes) induced by a group G of permutations on a set X (where x and y are 
equivalent if gx =y for some g in G). The usual proof is an elementary 
computation involving the relationship between an orbit and the cosets of the 
subgroup fixing one of its elements [1, 2, 3]. The purpose of this note is to give a 
proof which makes the result an obvious corollary of the product principle. 

The product principle for sets says that the union of n disjoint sets each of size 
m has m-n elements. For multisets the product principle says that the multiset 
union of n disjoint multisets each of size m is a multiset of size mn. 

The intuitive idea that a multiset is a set with multiplicities and its size is the 
sum of the multiplicities should suffice for reading this paper. It is, however, worth 
noting that a function f from a set S to a set T defines a multiset of size |S| 
chosen from JT in which the multiplicity of t € TJ is the number of s € § with 
f(s) = t, ie., | f(t). In particular, given a group G of permutations of a set X 
and an element x of X, the function g — gx from G to X determines a multiset 
denoted by 


Gx =[gexlg eG 


of size |G| chosen from X. We call this multiset a multiorbit of x. Observe that, as 
with ordinary orbits, two multiorbits Gx and Gy are either identical or disjoint. As 
a consequence we have the following Lemma. 


LemMaA. The multiplicity of an element x in a multiorbit is the number of elements 


of G that fix x. 


Proof. If x is in Gy, then the multisets Gy and Gx are equal. Clearly the 
multiplicity of x in Gx is the number of elements that fix x. M& 


By the product principle, if there are n distinct multiorbits, then their union has 
size n|G|. The size of this union is the sum of the multiplicities of the elements of 
X in this union. Since each member lies in one and only one multiorbit, we obtain 


nlG|= 2, i{g: gx = x}| 


X>xXxEX 
=|{(«, 8): gx =x}| 
= ) |{x: gx =x}}. 
g:g6€G 
*Work supported by ONR Contract N00014-88-K-0065 
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Poisson Integrals of Riemann Integrable Functions 


J. KRAL 
MU CSAV, Zitnd 25, 115 67 Praha 1, Czechoslovakia 


W. F. PFEFFER 
Department of Mathematics, University of California, Davis, CA 95616 


Dedicated to Professor Jan Marik for his 70th birthday 


Throughout, R and C denote, respectively, the sets of all real and complex 
numbers. By a function, we always mean a complex-valued function. We let 


D={zeC:|z|<1} and T={ze€C:|z|= 1}. 


An infinitely differentiable function u defined in an open set 0 c @ 1s called 
harmonic in Q, if it satisfies the Laplace equation 


d*us 0*u 
ax? ay? 
at each point z = (x, y) of 0. The Poisson kernel is the positive function 
1 1-r’ 
PU) = 95 T—dreost +r? 


defined for0 <r <1landt<€R. If f is a Lebesgue integrable function defined in 
[—7, 7], then 


u(re'®) = fo P.6 — 1) f(t) dt, 


where 0 < r < 1 and 6 € R, defines a harmonic function u in D, denoted by P « f 
(see [6; Theorem 11.7, p. 234]). Moreover, if f is continuous and f(7) = f(—7), 
then P * f extends continuously to D U T and P« f |} T =f. In this case, P * f is 
the unique solution of the following Dirichlet problem: find a continuous function 
defined in D U T which is harmonic in D and whose restriction to T equals f (cf. 
[2)). 

Thus given a class & of Lebesgue integrable functions defined in [—77, 7], it is a 
natural task to characterize the functions u harmonic in D such that u = P * f for 
an fe &. When &= /#”, such characterizations are well known (see [6, Theo- 
rems 11.16 and 11.30, pp. 239 and 247] if 1 < p < ~, and [3, Chapter I, Section 3.2, 
p. 43] if p = 1). In this note we give a simple characterization for the class #& of 
all Riemann integrable functions defined in [—7, 7]. The proof, which is elemen- 
tary and reasonably self-contained, is based on the classical result that a bounded 
function is Riemann integrable if and only if it is continuous almost everywhere 
(see [5, Theorem 11.33(b), p. 323)). 

A direct calculation shows that 


[oP —t)dt=1 (1) 
09-7 


for each 6,6, € Rand 0 <r < 1. As every Riemann integrable function f defined 
in [—7, 7] is bounded by definition, if follows from (1) that u = P * f is bounded 
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in D. Thus it suffices to consider only the class h® of ail bounded harmonic 
functions in D (cf. [1; Section 1.1, p. 2]). 

If E C(—7,7), we denote by @, the family of all bounded measurable 
functions defined in [—7, 7] which are continuous at each point ¢ € E. 


Proposition. Let u € h®, and let u(0) = u(re’®) for O<r<1 and OER. 
There is an f€ @, with P* f =u if and only if the family {u,: 0 <r < 1} is 
equicontinuous at each @ © E. 


Recall that the family {u,: 0 <r < 1} is equicontinuous at 0, € R whenever 
given « > 0, there is a 6 > 0 such that |u,(@) — u,(@,)| < « for each r € [0, 1) and 
6 € (0, — 6,0) + 8). 


Proof. Suppose first that u = P * f foran f € &,. If 6 € E thenlim, ,,_ u,(@) 
= f(@) by [5; Theorem 11.23, p. 244], and the equicontinuity of the family 
{u,.: 0 <r < 1} at 6 follows. Nonetheless, for completeness we shall prove this 
directly. With no loss of generality, we may assume that f(z) = f(—7) and view ! 
as periodically extended to R with period 277. Choose an a > 0 with |f(t)| < 
for each t © R. Given a 6) € E and « > 0, find a positive 6’ < 7/2 so that 
| f(t) — f(@o)| < &/4 for each ¢ € R with |t — 6,| < 26’. Clearly, lim, ,,_ P.(t) = 0 
uniformly in t when 6’ < |t| < 7 + 6’. Thus there is a positive p < 1 such that 


[ P(6—t) dt <— (2) 
28’ <|t-—O9l<a7 8a 

whenever p <r < 1 and |6 — @,| < 6’. In view of (1) and (2), for these r and 0 we 
have 


14,8) — F(o)| < f-"""P.(0 — £)| f(t) ~ Fo) a 


0 


< 2af P(@—t)dt+ al P(@—t)dt 
26'<|t-—Oy9l<ar —0)|<28' 
E 
< — 
2 


and consequently, |u,(@) — u,(@,)|<«. Since u is uniformly continuous in 
{z € C: |z| < p}, there is a 6” > 0 such that |u,(@) — u,(@,)| < « whenever 0 <r 

<p and |@ — 6,| <6”. Now letting 6 = min{6’,6"}, we see that the family 
(u.: 0 <r < 1} is equicontinuous at 6). 

Conversely, suppose that the family {u,: 0 <r < 1} is equicontinuous at each 
6 € E, and choose a sequence {r,} in [0, 1) converging to 1. By the Ascoli theorem 
(see [4; Chapter 9, Corollary 34, p. 179]), the sequence {u.. 3 contains a convergent 
subsequence, still denoted by {u, a which converges at each 0 € E. We set 


f(t) = lim inf u, (¢) 


for each t © R. Then f is bounded and measurable (see [4; Theorem 20, p. 67]), 
and the equicontinuity of {u, 3 at each 6 € E implies the continuity of f at each 
6 < EF. Thus f © &@, and it suffices to show that u = P + f. To this end, observe 
that for a fixed integer n > 1, the function z — u(r,z) is harmonic in D and 
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continuous in D U T. By [6, Theorem 11.9, p. 235], 


u(r,re’) = [i Po — t)u,(t) dt (3) 


for each re’® in D. Since u is bounded, it follows from [1, Corollary 1, p. 5] that 
f(t) = lim, .,..u,(t) for almost all ¢ © R. Thus fixing r and 6, we have 


lim P,(6 — t)u,(t) = P,(@ — t) f(t) (4) 


for almost all ¢ € R. Moreover, in [—7, 7] the sequence {P.(@ — t)u,(t)} has an 
integrable majorant bP.(@ — t) where b = sup{|u(z)|: z € D}. Consequently, by the 
continuity of u and the dominated convergence theorem (see [4; Theorem 15, p. 
88]), equations (3) and (4) imply that 


u(re®) = [P.O —t) f(t) dt 
and the proposition is proved. 


Coro.iary. Let u © h®, and let u,(0) = u(re®) for O<r<1 and OER. 
There is an fe & with P * f =u if and only if there is a set E C(—7,7) of 
measure 27 and such that the family {u,: 0 <r < 1} is equicontinuous at each 
GEE. 


Indeed, A= U,¢, where the union is taken over all sets E C (—7, 77) whose 
measure is 277. 
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1. Weak continuity and connectedness. As a generalization of a continuous 
function, Levine [2] gave the following definition of a weakly continuous function. 


Definition 1. A function f: X — Y is weakly continuous at x € X if given any 
open set V in Y containing f(x), there exists an open set U in X containing x 
such that f(U) Cc V. If this condition is satisfied at each x € X, then f is said to 
be weakly continuous. 
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1. Weak continuity and connectedness. As a generalization of a continuous 
function, Levine [2] gave the following definition of a weakly continuous function. 


Definition 1. A function f: X — Y is weakly continuous at x € X if given any 
open set V in Y containing f(x), there exists an open set U in X containing x 
such that f(U) c V. If this condition is satisfied at each x € X, then f is said to 
be weakly continuous. 
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Another generalization of a continuous function was given in [3] by Andrew and 
Wittlesy. 


Definition 2. A function f: X — Y is closure continuous at x € X if given any 
open set V in Y containing f(x), there exists an open set U in X containing x 
such that f(U) C V. If this condition is satisfied at each x € X, then f is said to 
be closure continuous. 

Obviously closure continuity implies weak continuity. The converse is not true. 


Example 1. Let X = {x, y, z,w} with topology {@, {x, y, z}, {z},{z, w}, X}. Let 
Y = {a, b,c, d} with topology {2@, {a, b}, {b}, {d}, {b, d}, {a, b, d}, {b, c, d}, Y}. Define 
f(x) = a, f(y) = b, f(z) = c, f(w) = d. Then f is weakly continuous since {a, b} 
= {b} = {a, b, c}, {b, c, d} = Y, {d} = {c, d}. But f is not closure continuous since 
{x, y, z} =X. 

Theorem 1 in [1] states that a closure continuous function f: X —> Y is 
connected; i.e., if C is a connected set in X then f(C) is a connected set in Y. The 
editor of this MONTHLY has forwarded to us communication from Professor J. 
Ferrer, pointing out that this assertion is false. He gave the following counterexam- 
ple. 


Example 2. Let X = R, the real line with the usual topology and let Y = the set 
of non-negative integers with the co-finite topology; i.e., a set is open in Y if it is 
empty or its complement is finite. Since the closure of each non-empty open set is 
all of Y, any function f: X — Y is closure continuous. Now let f: X — Y be 
defined by f (rationals) = 0; f (irrationals) = 1. Then fCX) = {0,1}, a discon- 
nected set in Y. Hence, f is not connected. 

The space Y in.the above example is 7, but not 7,. One might wonder whether 
a closure continuous function into a Hausdorff space is connected. The answer is 
no even if the range space Y is Urysohn, as the following example shows. 


Example 3. Let P be the upper half plane and L be the x-axis. Let X = PU L. 
If 7 * is the half-disc topology on X, then (X, 7 *) is a Urysohn space (see [4], 
Example 78, p. 96). Let Y be the relative topology that X interits by virtue of 
being a subspace of R*. The identity function f: (X, 7) > CX, 7*) is closure 
continuous but f takes the connected set L in (X, 7 ) onto the discrete set L in 
(X, Z *), showing that a closure continuous function can fail to be connected even 
in the presence of strong separation properties. 

Theorem 1 in [1] should be replaced by the theorem below. 


THEOREM 1. Let X be a connected space and let f: X — Y be a weakly continuous 
function from X onto Y (Y = f(X)). Then Y is connected. 


Proof. Change the statement “ f(U ) CV” in the proof of Theorem 1 in [1] to 
“#(U) C V”. 

Actually we have a little stronger conclusion: if f: X — Y is weakly continuous 
and C is a connected set in X such that Y = f(C), then Y is connected. 


2. Weak continuity and continuity. The following result is Theorem 1 in [2]. 


THEOREM 2. The function f: X — Y is weakly continuous if and only if for each 
open set Vin Y, f-'(V) Cc Int f-'(Y). 


CoROLLARY 1. If f: X — Y ts weakly continuous, then for each open set V in Y 
we have f—'(V) c f-'V) A W, where W is an open set in X. 
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Proof. Take W = Int f~'(V). 
Corollary 1 suggests the following generalization of continuity. 


Definition 3. A function f: X — Y is relatively continuous at x € X if given an 
open set V in Y containing f(x), the set f~'(V) is an open set in the subspace 
f~'Y). If this condition is satisfied for each x € X, then f is said to be relatively 
continuous. 

If a function f: X — Y is continuous, then for each open set V in Y, f~'(V) is 
open in X. Since f~'(V) Cf~'\(V), we know that f~'(V) is an open set in the 
subspace f~'(V). Therefore the following theorem has been established. 


THEOREM 3. A continuous function f: X — Y ts relatively continuous. The 
converse of Theorem 3 is not true. 


Example 4. Let X = R be the real line with usual topology. Let Y = {a, b} with 
discrete topology. Define f(x) = a for x € (—~,0] and f(x) = b for x € (0, +). 
Then f is relatively continuous since f~ '({a}) = f7'({a}) = (—~, 0] = f- Ca) n 
(—0,1) and f~'({b}) = f~\({b}) = (0, +). It is easily seen that f is not weakly 
continuous at x = 0. 

The significance of relative continuity is that it yields a decomposition of 
continuity with weak continuity as the other factor. 


THEOREM 4. A function f: X — Y is continuous if and only if it is weakly 
continuous and relatively continuous. 


Proof. The necessity is trivial. We prove the sufficiency. 

Let V be an open set in Y. By relative continuity, f- '(V) = f- (V) A W, where 
W is an open set in X. To prove the openness of f~'(V) in X, let x € f-'(V). 
Then f(x) € V and x € W. By weak continuity of f, there exists an open set U in 
X containing x such that f(U) C V. This means U c f~'(V). Since the open set W 
contains x, we may assume U Cc W. It now follows that x © Ucf-'(V) NW= 
f-'V). This shows that each point x € f~'(V) is an interior point and so f~'(V) 
is open in X. 


Theorem 4 answers a question raised in [1]. 


3. Weak continuity and interiority. In this section we give a sufficient condition 
for continuity, which involves weak continuity. 


Definition 4. We say that a function f: X — Y satisfies the interiority condition 
if Int f° '(V) cf '(V) for each open set V in Y. 
Continuity does not imply interiority condition. See the example below. 


Example 5. Let X = Y = the set of natural numbers with the co-finite topology. 
The identity function f: X — Y is surely continuous. But if V is any proper 
non-empty open set in Y, then V = Y and so Int f- (V) = X while f- (V) = V. 
Hence Int f- '(V) ¢ f-1(VY). 


THEOREM 5. Weak continuity plus interiority condition implies continuity. 


Proof. Let f: X — Y be a weakly continuous function satisfying the interiority 
condition. Let V be a given open set in Y. By weak continuity, we have f~ '(V) C 
Int f~'(V). By interiority condition, we have Int f~'(V) c f~ (V). Hence f~(V) 
= Int f-'(V), it is open in X. 
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Relative continuity plus interiority condition does not imply continuity. See 
Example 4. 

Interiority condition does not imply either weak continuity or relative continu- 
ity. See example below. 


Example 5. Let X = {0,1} be endowed with the co-countable topology, i.e., a 
set is open if it is empty or its complement is countable. Let Y = {0*, 1*} with 
discrete topology. Define f: X — Y by f (rationals) = 0*, f (irrationals) = 1*. 
Then f satisfies the interiority condition, but f is neither weakly continuous nor 
relatively continuous. 


4. Between weak continuity and continuity. We conclude with two results which 
indicate the extent to which weak continuity approximates continuity. 


THEOREM 6. Let f: X — Y be a weakly continuous function from a space X to a 
Hausdorff space Y. Then f has closed point inverses. 


Proof. Let q € Y. We must show that f~'(q) = {x © .X: f(x) = q} is closed in 
X, or equivalently A = {x € X: f(x) # q} is open in X. To this end, let x € A. 
Since f(x) # q and Y is Hausdorff, there exists an open set V in Y containing 
f(x) such that q ¢ V. By weak continuity of f, there exists an open set U in X 
containing x such that f(U) c V. We can prove U CA. If U ¢ A, there exists 
x’ € U such that f(x’) = q. Since f(U) C V, it follows f(x’) = q € V, contradict- 
ing the choice of V. 

Example 2 shows that Hausdorffness can not be replaced by 7, in Theorem 6. 

Recall that a subset of a topological space is called closure compact if each 
open cover of the set contains a finite subcollection whose closures cover the set. 


THEOREM 7. Let f: X — Y be weakly continuous and let K be a compact set in X. 
Then f(K) is a closure compact subset of Y. 


Proof. Let VY be an open cover of f(K) and set W’ = {(V © V: VN f(K) # @}. 
Then #’ is an open cover of f(K). For each k € K, f(k) € V, for some V, € 
By weak continuity of f, there exists an open set U, in X containing k such that 
fU,) ¢V,. The collection {U,: k € K} is an open cover of K and so there is a 
finite subcover {U,: k © Ky}, where K, is a finite subset of K. Clearly 
{V,: k © Ko} covers f(K), or f(K) is a closure compact subset in Y. 


Added in the Proof. Let X = R* — {origin} with the usual topology and Y = R? 
with the usual topology. It is well known that the familiar function (from complex 
variables) f(z) = In|z| +i Arg z where —7a7 < Arg z <7 fails to be continuous 
on the negative reals. We leave it to the reader to verify that f is not even weakly 
continuous. However, f does satisfy the inferiority condition and f is relatively 
continuous. 
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On the Product of Two Alternating Matrices 


Dracomir Z. Doxovié 
Department of Pure Mathematics, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1 


We consider n by n matrices A = (a; p) over a commutative ring R. By A’ we 
denote the transpose of A. We say that A is alternating if A’ = —A and a,,; = 0 
for i = 1,...,m. Our main objective is to prove the theorem stated below. The 
article may be also of interest to the readers who are not familiar with the purely 
algebraic method of specialization which can often be used as a substitute for the 
usual topological arguments in the setting of finite dimensional vector spaces over 
R or C. 


THEOREM. Let A = BC where B and C are alternating n by n matrices over R, let 
f(A) be the characteristic polynomial of A, and assume that n is even, say n = 2m. 
Then there exists a monic polynomial p(A) over R of degree m such that 


f(A) =p(A)’, — p(A) = 0. (1) 


REMARK. When R is a field of characteristic # 2 it was shown in an Elemen- 
tary Problem in [2] that A has no simple eigenvalues. In fact the solution of that 
problem given by O. P, Lossers shows that the geometric multiplicity of each 
eigenvalue of A is at least 2. Both of these results are obvious consequences of our 
theorem. 

We recall some properties of pfaffians and refer the reader to Lang’s book [1] 
for more details. Let X be an alternating n by n matrix over R. If n is odd then 
det(Y) = 0. If n = 2m is even then det(X) = Pf(X)* where Pf(X), the pfaffian 
of X, is a homogeneous polynomial of degree m in the entries of X. Consequently 
we have Pf(X’) = Pf(—X) = (— 1)” Pf(X). If we assume in addition that X is 
invertible then X~! is also alternating. Indeed XX~' = I, implies that (X7~')X’ 
=], and so (X~')' = —X7'. Furthermore all diagonal entries of X~' are zero 
since the cofactors of X corresponding to diagonal entries are zero. 


Lemma. If X is an invertible alternating n by n matrix over R, n = 2m, then 
Pf( X~') - Pf(X) =(-1)". 

Proof. If Y and Z are n by n matrices over R and Y is alternating, then 
Pf( ZYZ') = det(Z)Pf(Y), 


see [1, XIV, §10, Theorem 7]. By setting Y= X7!, Z = X, we obtain Pf(X’) = 
det(X) - Pf(X~!). Since det(X) = Pf(X)’, the assertion of the lemma follows. 
|| 


We are now ready to prove the theorem. 


Proof of the Theorem. Let x;; and y,;,1<i<j <n, be independent indetermi- 
nates over the ring of integers Z. Let S$ denote the corresponding polynomial ring 
ZX; ;, Vij; 1 <i <j <n]. Denote by X resp. Y the n by n alternating matrix over 
S whose (i, j)-th entry is x;; resp. y,, for 1 <i <j <n. Assume that there exists a 
monic polynomial g(A) over S of degree m such that det(AL, — XY) = q(A)* and 
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q( XY ) = 0. Let ¢: S > R be the homomorphism which sends each x; > a,; and 
each y,, > b,,. Then 6-X) = B, d(Y) = C and if p = 6(q) we see that (1) holds. 

Hence it suffices to prove (1) in the special case where R= S, B = X, and 
C = Y. We may view X and Y as matrices over the field of fractions K of S$. Then 
X is invertible, X~' is alternating, and we have 


det(AI, — XY) = det(.X) - det(AX7! — Y) = P(X)? Pf(AX71- YY)’. 


Hence, if p(A) = (—1)"PfLX) - PA X7' — Y), then f(A) = p(A)’ and, by the 
lemma, p is a monic polynomial in A of degree m. Next we need Gauss Lemma 
and the fact that R is a unique factorization domain, see [1, V, §6]. Since 
p(A) € K[A] and f(A) € RIA], the equality f(A) = p(A)* and the Gauss Lemma 
imply that p(A) € R[A]. This proves the first assertion of (1). 

We claim that all roots of p(A), in some algebraic closure E of K, are simple. 
For this purpose it suffices to show that some specialization of p(A) has only 
simple roots. For instance we can take the specialization @: R — Z such that 


s(x) = (9 ee 2 | 


_f{ O 1 0 2 — 0 m 
sry = (9) e( 2 5) ° o(S 0) 
Then p(A) > (A + 1A + 2)°+: (A +m) and our claim is established. 
At this stage of the proof we know that every eigenvalue, say uw € E, of A = XY 
has algebraic multiplicity 2 and so 


1 < dimker( pl, — A) < 2. 


Since wl, — A = X(wX~'— Y) and wX~' — Y is alternating, we conclude that 
ker(ul,, — A) = ker(uX~' — Y) and that ker(wJ,, — A) has even dimension. Con- 
sequently the geometric multiplicity of 4 is 2 and so p(A) = 0. Thus the second 
assertion of (1) also holds. 4m 


The reader may now wonder what can be said about the case when n is odd. 
The answer can be obtained easily from the theorem. 


CoroOLiary. Let A, B,C and f(A) be as in the theorem except that now n is odd, 
say n = 2m + 1. Then there exists a monic polynomial p(A) over R of degree m such 
that 


f(A) =Ap(A)’, — Ap( A) = 0. (2) 


Proof. Let B, = B ® (0), C, = C © (0), and A, = B,C,. By the Theorem we 
have 
det(Al, 41 — Aj) = pa) 


and p,(A,) = 0 where p,(A) is a monic polynomial over R of degree m + 1. Since 
the last row of A, is zero, we have A|p,(A)* and consequently p,(A) = Ap(A) and 
f(A) = Ap(A)’. From p,(A,) = 0 it follows that p,(A) = 0. Thus (2) holds. m 
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Two of the most useful elementary facts about probability are these: 
P(A OB) = P(A)P(B) (1) 
if A and B are independent events; 
P(A UB) =P(A) + P(B) —-P(ANB) (2) 


always. Numerous exercises based on these ideas are found in every introductory 
text. What is not widely recognized, however, is that one can get a lot of mileage 
from these (or related) elementary equations even in complex situations if they are 
used recursively in a proper manner. Recursion and a few elementary facts about 
probability can provide a basis for calculating the risks inherent in very compli- 
cated structures. 

The need to assess the risks found in complex technological ventures has 
become readily apparent with disasters such as Three Mile Island, the Union 
Carbide accident at Bhopal, India, the space shuttle Challenger’s explosion in 
1986, and most recently the Exxon Oil spill at Valdez. Engineers have long 
wrestled with such reliability problems using techniques quite different from the 
ones we will present here. Our discussion will lead to some contemporary topics in 
discrete mathematics such as trees and computational complexity and toward 
implementation of a simple reliability algorithm in Pascal, illustrating in the 
process the advantage that Pascal has over FORTRAN as a language for discrete 
mathematics. A brief concluding commentary will indicate the impact that the 
ideas described here have had on reliability research during the past decade. 

FiGuRE | provides an illustration of a type of model widely used. This figure is a 
simple example of a fault tree. The leaf nodes are nodes 5, 7, 8, 9, 10, and 11, and 
the other nodes are internal nodes. The tree is a full binary tree in that each node 
has either two children or no children. The components of the fault tree are of 
three types: AND gates (intersections), OR gates (unions), and basic events. (The 
basic events correspond to the leaf nodes.) The purpose of the fault tree is to 
represent the top event (labeled “system failure” in this case) in terms of unions 
and intersections of basic events, which are assumed to be mutually independent 
and whose probabilities of occurrence are assumed to be known. In real applica- 
tions (such as nuclear or chemical reactors or the space shuttle), fault trees are 
used to model failure modes of very complex systems, and in such cases evaluation 
of top-event probability is an important measure of the reliability of the system. In 
the case of FiGURE 1, system failure occurs if there is both a power outage and 
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cooling system failure. A power outage may be caused by either sabotage or by a 
transformer failure, whereas the cooling system fails if a valve sticks or the pump 
fails. Transformer failure and pump failure are in turn defined as unions or 
intersections of other basic events. For simplicity we shall assume throughout this 
discussion that fault trees are represented in binary form. This involves no loss of 
generality since unions or intersections of more that two events may be rewritten 
in terms of binary operations simply by introducing parentheses. (In other words, a 


non-binary fault tree can always be redrawn as a binary tree by introducing 
additional nodes.) 


System 

Failure 
Power Cooling System 
Outage Failure 


Sabotage Transformer Valve Pump 
Failure Stuck Failure 
Power Circuit Breaker Equipment Lo 
_| Surge Fails Failure P 


ss of 
ower 
(9) 8) 7) 
QO OR gate (union) 


C\ AND gate (intersection) 
C) Basic Event 


Fic. 1. A simple fault tree. 


If it is known that the leaf nodes in the fault tree are mutually independent, 
then an easy bottom-up hand calculation suffices to calculate the probability of the 
top event. Starting at the bottom one works up the tree calculating the probability 
of every event encountered using equations (1) and (2). 

For reasons that will become apparent shortly, however, it’s much preferable to 
take a top-down point of view and to think about the problem in terms of 
recursion. To do so let’s first denote by p, the probability that the event 
corresponding to node n in the fault tree does in fact occur. Then the goal is to 
evaluate p,, and a recursive top-down calculation can proceed as follows: 


P, = P2P3 = (P4 + Diy — PaP1)( Ds + Do — DsDo)- 
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The computation is finished when p, and p, are replaced by 
P4 = PoP 10 
Do =P7 + Dg — D7Dz. 


If instead of writing p,, we write p(n), then we can view the above calculation as 
evaluation of a single value of the recursive function p defined by: 


(i) p(n) = probability of node n if this node is a basic event, 
(ii) p(n) = p(j) p(k) if n is an AND gate with children j and k, 
(iii) p(n) = p(j) + p(k) — p(j) p(k) if n is an OR gate with children j and k. 


To eliminate the duplication of function evaluations in (iii), it is preferable in this 
case to write 


Gii) p(n) = 1 — 1 — p(y) — plk)). 

If this were the end of the story, then fault trees would be a trivial concept. By 
the procedure just described, the probability of the top event could be determined 
by calculating a single value of a recursive function, and the nature of the 
recursion requires that one recursive function call be made for each node in the 
fault tree. This fact is implicit in G)—-(iii) if one removes the duplication in (iii) as 
suggested. Therefore the amount of computation required to determine the 
probability of the top event is directly proportional to the number of nodes in the 
tree. This is a “linear time” algorithm and represents a happy state of affairs. It’s 
equally easy to observe that the bottom-up approach also requires visiting each 
node in the tree only once, so in terms of the amount of calculation required the 
two approaches are equivalent. 

What makes the real world infinitely more interesting (and complex) than this 
simple example is the fact that events often are not independent. Consider FIGURE 
2 for example. The only feature that is changed from FiGureE 1 is found in the 
contributing factors to POWER OUTAGE, the event labeled node 2 in the fault 
tree. SABOTAGE has been eliminated as a cause, and LOSS OF POWER now 
appears. This basic event LOSS POWER is the same event that leads to PUMP 
FAILURE, event number 6. So now the mutual independence of different parts of 
the tree is destroyed. (Notice that the two “loss of power” events are now given 
the same label “7.” This is necessary since basic events with distinct labels are 
assumed to be mutually independent.) The number of distinct basic events is 
reduced from six to five, but even if we consider these five still to be mutually 
independent, the fact that one of them appears at two different locations in the 
tree means that neither the bottom-up nor the top-down procedure previously 
used will now be valid. The reason is that both procedures assume that whenever 
an AND or OR gate appears in the tree, the children will represent independent 
events. This is no longer the case for the top node in the tree. Since LOSS OF 
POWERR is a contributing factor to both event 2 and event 3, the children of event 
1 are no longer independent. 

We mention at this point that duplication of interior node labels at multiple 
locations in the fault tree is perfectly acceptable as long as the entire subtrees 
rooted at each location are identical. For reasons of convenience in the complexity 
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System | 
Failure 


Power Cooling System 
Outage Failure 


2) 3) 


Loss of ~ Transformer Valve Pump 

Power Failure Stuck Failure 
Power Circuit Breaker Equipment Lo 
Surge Fails Failure P 


ss of 
ower 
(9) (8) 7) 
QO OR gate (union) 


C\ AND gate (intersection) 


C) Basic Event 


Fic. 2. Appearance of node 7 at two locations in the tree means that AND gates and OR gates no 
longer always have independent children. 


analysis given later, however, we will assume that distinct interior nodes have 
distinct integer labels. 

The net effect of duplicated leaves in the fault tree is to render the bottom-up 
approach useless and to force reconsideration of the top-down approach. Equation 
(1) no longer offers a valid means for treating AND gates. Equation (2) is still valid 
since it doesn’t require independence. So OR gates can be treated in terms of 
equation (2) if we can decide how to handle AND gates. To better understand the 
situation, we introduce some appropriate notation: 


Notation. E, denotes the event corresponding to the node labeled “i” in the 
fault tree. Is S is a set of node labels in the fault tree, then the probability that all 
of the events E,, i € S, occur is denoted by 


Ds = P| Nz). 


iecS$ 


For an AND gate n with dependent children j and k, it is no longer true as in 
equation (1) that the probability of the AND gate is the product of the probabili- 
ties of its children. Using the notation just introduced, however, it is true that 
Pony = Pr, ky: More generally, suppose now that S is a set of node labels, that n € § 
is the label of an AND gate in the tree, and that j and k are the children of node 
n. If we let C denote the intersection of the collection of events {E;: i € S and 
i#n}, then we have available to us the following recursive mechanism for 
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eliminating AND gates from the set S: 


Ds =P a E,| = P(E, OC) = P(E,NE,NC)=DPsug.g-m (3) 


ieS 


If n were an OR gate rather than an AND gate and if C denotes the same set as 
above, then the manner in which we work our way down the tree is a bit different: 


ps = P( () B,) = P(E, NC) = P(E, UE) OC] 


iecS$ 
=P|(E,NC)U(E,NC)| 
= P(E,NC)+P(E,NC)—-P(E,NE,NC) 
= Psu T Psutey-tny ~ Psug-eny (4) 


The value of equations (3) and (4) is that they do not require that the events E, 
and E, be independent. This enables us to proceed recursively all the way down 
the tree from the top event to the basic events without concern as to whether the 
basic events occur at multiple locations in the tree. When we have finally 
eliminated all AND gates and OR gates in this manner, we are left with a large 
number of terms of the form p, where T is a set of node labels representing basic 
events in the tree. By assumption these represent mutually independent events, so 
each such term p, may be immediately evaluated as 


Pr = LT P(E,). (5) 


As an illustration, calculation of the probability of system failure in Figure 2 
might begin as follows: 


Pay ~ Pe,3) 
= Pg,4 + P37 — Pe.4.7 
= Pg,9,19 + (Ps,7 + Po. ~ Ps,6,7) ~ PB,7,9, 10): 


In this sample computation, node 1 Is first replaced by its children using equation 
(3). Then node 2 is removed using equation (4). Next the term Pg, 4, is replaced by 
Pp,9,10) USing equation (3), Pa. is replaced by ps» + Do,7 — Ps,6,7 by using 
equation (4), and py 47 becomes py 7,9 19, by again using equation (3). In the 
term Ds 7 in the last expression, each element of the set {5,7} is a label 
corresponding to a basic event. Therefore p,; 7 may be evaluated as the product of 
the probability of .7, times the probability of E,. The remaining terms of the last 
expression require further replacement of AND or OR gates by their children 
using equations (3) and (4). The entire computation is mapped in FicureE 3. The 
arithmetic itself is not shown in the figure, and only the subscript set S is shown 
for each term p, encountered during the computation. The purpose is to show the 
manner in which each term is evaluated by reference to other terms until finally all 
AND and OR gates have been removed from every term and equation (5) can be 
used. 
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3,7 3,4 3,4,7 


59,10 
35.10 | 


56,910 


Fic. 3. One possible recursion tree generated by recursively applying equations (3) and (4) to calculate 
the probability of the top event in the fault tree of FiGure 2. 


FIGURE 3 is a recursion tree. While the fault tree itself is binary, each node in 
the recursion tree is either a leaf, a node with one child, or a node with three 
children. These cases correspond respectively to these cases in which: 


e all nodes in the set are basic events; 
e an AND gate is used for pivoting via equation (3); 
e an OR gate is used for pivoting via equation (4). 


The recursion tree will not be uniquely determined unless one specifies a rule for 
choosing the precise node to use in the decomposition process at each step. 

The shape of the recursion tree is not as important as its size, however, for the 
size of this tree reflects the amount of computation that must be performed to 
evaluate the probability of the top event. One measure of size is simply to count 
the number of nodes in the recursion tree. A more convenient measure, however, 
is to count only the leaf nodes. One reason that it is preferable to count only the 
leaves is that this eliminates the quandary over how to terminate the recursion. For 
example, one could terminate the recursion by expanding the product in equation 
(5) whenever a set of basic events is encountered. Alternatively, one can continue 
to think recursively and when S is a set of basic events, compute p, as ps = 
PinyPs —(n) where n © S. In the latter case we continue the recursion until S is 
empty, in which case p, = 1. If we showed this additional recursion in the 
recursion tree of FIGURE 3, each leaf in the tree as shown would have a “branch” 
descending down from it in which the basic events are themselves eliminated 
one-at-a-time until only the empty set remains. Notice that this causes no change 
in the number of leaves in the recursion tree. So whether this kind of “tail 
recursion” is used or not will be irrelevant if we measure the complexity of the 
computation by the number of leaves. Another benefit to counting only the leaves 
is that this measure of computational complexity is independent of the manner in 
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which the nodes are selected for “pivoting” via equations (3) and (4) throughout 
the recursive process. 


THEOREM. Suppose equations (3) and (4) are used recursively to determine 
top-event probability for a full binary fault tree in which only basic events appear at 
multiple locations in the fault tree. Let ys be the recursive function defined on the set 
of node labels in the fault tree as follows: 


(1) wW(n) = 1 if n is a basic event (leaf node), 
(2) wn) = W(j)W(k) ifn is an AND gate with children j and k, 
(3) Wn) = WC) + WK) + W(DW(K) if n is an OR gate with children j and k. 


Then for every node n in the fault tree, W(n) gives the number of leaves in the 
recursion tree produced by using equations (3) and (4) recursively to evaluate 
P(E,,) = Dyy. Un particular the number of leaves in the recursion tree generated by 
calculating the probability of the top event does not depend on the order in which 
AND gates or OR gates are selected for decomposition via equations (3) and (A4).) 


Proof. If S is a set of nodes in the fault tree and m and n are interior nodes 
(not leaves) in S, it is easy to check that the terms generated by elimination of m 
and n from S using equations (3) and (4) do not depend on which node (m or n) is 
eliminated first. This is true whether both are AND nodes, both are OR nodes, or 
there is one of each. From this observation it follows that the final collection of 
terms that remain after all interior nodes in S are eliminated in this manner 
(corresponding to the leaves in the recursion tree) does not depend on the order of 
application of equations (3) and (4) or the choices of nodes to which these 
equations are applied. 

As for the function yw, first note that if n is a basic event in the fault tree, then 
Py is known and recursion terminates immediately (1 leaf node). So the number of 
leaves generated in evaluating p,,, is 1, which is (7) by definition. 

Suppose now that n is an AND gate with children j and k. Suppose also that it 
is known that the desired property holds for nodes j and k, that is W(j) and W(k) 
represent the number of leaves in the recursion tree generated by evaluation of 
Py, and py, respectively. The evaluation of p,,, begins with the fact that p,,. = 
Pyj,~) Furthermore, if node & in this set is ignored while j and all its descendents 
that are not basic events are eliminated via use of equations (3) and (4), the 
number of terms generated is then (j). (The process is the same as it would be if 
k were not in the set. If k were not present, then each term so generated would be 
a leaf in the recursion tree generated by evaluation of p, Fe) Since k is present, 
however, the remaining terms will all now consist of k together with some set of 
basic event node labels. (This is the point in the argument at which the fact that 
interior nodes have distinct labels is important. If this were not the case, k might 
have been eliminated from some of the terms in the process of eliminating j and 
its descendents.) In each remaining term k and all its non-basic-event descendents 
must be removed, and the manner for doing so is identical to what it would be if k 
were now the only element in each such set. Therefore each term obtained in 
removing j and its non-basic descendents leads in turn to W(k) leaves in the 
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function P(S) 
input: A set S of positive integers corresponding to node lables in the fault tree 
output: The probability that all the events corresponding to node labels in S do occur 
begin 
if S is empty then 
P<- | 
else if there exists an AND gate nm in S with children j and k then 
P— P(S U {j,k} — {n}) 
else if there exists an OR gate n in S§ with children / and & then 
P — P(S U {yj} -— (n}) + PCS U {k} — {n}) — PCS U {j,k} — {n)}) 
else let nm denote any basic event label in S then 
P = P(S — {n}) < probability of event n 
end 


Fic. 4. Pseudo-code for a recursive function to incorporate use of equations (3) and (4). The probability 
of the top event of a full binary fault tree is obtained by evaluating P(S) where S contains the single 
node number corresponding to the top event. This pseudo-code is easily translated into Pascal since 
Pascal supports both set operations and recursion. The requirement that the tree be full binary is not 
essential, but it simplifies the description of the manner in which AND gates and OR gates are treated. 


recursion tree, for a total of W(j)w(k) = w(n) such nodes. Therefore (1) has the 
desired characterization. 

The argument for OR gates is similar. One then starts instead with the equation 
Pony = Poy + Py — Py.xy Uf one assumes that the children of n have the desired 
property, then w(/) is the number of leaves generated in evaluation of Py; W(k) is 
the number of leaves generated in evaluation of p,,, and as argued in the 
preceding paragraph, w(j)y(k) is the number of leaves generated in evaluation of 
Pyj, 4: Thus evaluation of p,,) leads to w(j) + W(k) + W()W(k) = Wn) leaves. 

Because the number of terms generated by use of equations (3) and (4) is so 
great, hand calculations are not feasible even for relatively small trees. The beauty 
of equations (3) and (4) is that they represent an easily programmed recursive 
approach, especially if the programming language supports both recursion and 
standard set notation (as does Pascal). In fact, the entire manner for evaluating the 
probability of the top event based on repeated use of equations (3) and (4) is 
wrapped up in the recursive function definition shown in Figure 4. To implement 
this function in Pascal requires only a few appropriate type declarations to define 
the data structures used to represent nodes in the fault tree and a simple utility 
function to return the node number of an AND gate, OR gate, or basic event in S. 
To evaluate the probability of the top event, one simply issues the initial call to 
evaluate P(S) where S is the set containing the single node number corresponding 
to the top event. A Pascal program to evaluate top-event probability based on this 
recursive function definition consists of fewer than 100 lines of source code, and 
most of that is used to read the fault tree from an input file. (An easy way to 
represent the fault tree is to let each line of the input file represent a node in the 
fault tree and consist of a character indicating node type followed by left and right 
children in the case of an AND or OR gate or the probability in the case of a basic 
event.) 

The algorithm based on equations (3) and (4) appears in [3]. Its role in fault-tree 
research is somewhat akin to Euler’s method for numerically solving differential 
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equations. Because of its intuitive appeal and simplicity, its value is primarily 
pedagogical. Euler’s method is shunned for serious work because other methods 
are much more accurate. In the case of the fault tree algorithm, the limiting factor 
is the computational complexity as described by the recursive function w of the 
theorem. An attractive feature of the function w is that evaluation of yw requires 
only a number of computations proportional to the number of nodes in the fault 
tree. This means that we can use the recursive function & as a means of 
pre-calculating the number of steps required to apply equations (3) and (4) as a 
means of evaluating top-event probability in any fault tree. 


An Illustration. Figure 5 has appeared frequently in the scientific literature 
[1,2,4, 5]. Once it was in fact used to demonstrate a special electronic apparatus 
called ESCAF developed under the auspices of the French Atomic Energy Agency 
[2]. Notice that there are numerous replicated basic events making large modules 
in the fault tree dependent upon one another. 

The top event is node 21, and this tree is still small enough that a manual 
calculation of (21) proceeds quickly. Alternatively, it is a simple matter to embed 
the recursive function w in a simple program and to execute the program on the 
fault tree of Figure 5 and learn that #(21) = 1,000,447. Therefore use of equa- 
tions (3) and (4) to evaluate the probability of the top event would generate a 
recursion tree with 1,000,447 leaves. 


& 


Fic. 5. Fault tree from [1} which has been used to demonstrate a number of algorithms. 


How big is big? Fault trees of much greater size and complexity than the one in 
FiGuRE 5 are routinely encountered in real applications. Is there no hope for using 
recursive methods on such trees? Fortunately there is, but the moral of the story is 
that independence in the fault tree is ignored only at great peril. Whereas the first 
simple approach examined requires that independence be present everywhere in 
the fault tree (that is, no duplication of basic events is permitted), the approach 
based on equations (3) and (4) makes no use of independence whatsoever except in 
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An “QOceans of Zeros” Proof That a Certain Non-Liouville Number 
is Transcendental 


M. J. KNIGHT 
P. O. Box 732, Lompoc, CA 93438 


This note contains a simple new proof of the transcendence of the number F: 


F= ¥ (1/B), (1) 
n=0 
for any positive integer B > 1, and is well suited to a classroom presentation. It is 
an argument using only the definition of a transcendental number and the 
representations of numbers in the bases 2 and B. The powers of F, when written 
in base B, become more and more difficult to describe, but may be described well 
enough to yield transcendence. 

The first numbers proven transcendental were the Liouville numbers, character- 
ized by having orders of approximation of arbitrarily large sizes (see [3] for 
background material and additional references on transcendental numbers). The 
proof that they are transcendental uses Liouville’s theorem that the order of 
approximation possible for an algebraic number of degree m is less than m + 1. 
Although Baker calls the Liouville theorem the first practical criterion for proving 
transcendence [1], the argument given here could have been used by Liouville to 
prove that F and many other numbers are transcendental. 

The number F looks like it might be a Liouville number, since taking partial 
sums in (1) leads to an infinite number of very good rational approximations to F. 
In fact, these give approximations of order 2, not nearly good enough to prove F a 
Liouville number, nor even good enough to satisfy the very strong transcendental 
result of Roth [4]. The number F is not a Liouville number, because the continued 
fraction for F is known [6] and has bounded partial quotients, while the continued 
fraction for any Liouville number has unbounded partial quotients. The number F 
is an example of a Fredholm number; such numbers have been shown to be 
transcendental by different methods [2, 5]. 

The proof is by contradiction, so assume that F is algebraic and satisfies the 
polynomial 


c,F’+c,_,F°' +++: +c, =0 


with c; € Z, and c, > 0. Let H be the maximum of the absolute values of the 
coefficients of the polynomial. Rearranging, we have 


C,Fo +++) =), FS + °° (2) 


with all coefficients on both sides nonnegative. Think of this as an equation 
between two base-B representations, with the required carries not yet performed. 

The basic trick used is that, in base B, each of the powers of F with exponents 
less that e have a common, arbitrarily long, span of all zeros, i.e., an “ocean” of 
zeros, whereas the same span for the eth power has an isolated nonzero “island.” 
By picking such a span long enough, based on e and H, an isolated nonzero island 
will remain in the expression on the left of equation (2), but the same span will be 
all zeros on the right. Thus, the equality cannot hold. 
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We now specify the location of the ocean of zeros and the nonzero island 
(consideration of the effect of carries will follow). 
Let 


f(x) = Lex™, 
n=0 


so that F = f(1/B), and let a(r, k) denote the coefficient of x” in f(x)*. Observe 
that a(r, kK) is equal to the number of ways that r can be written as a sum of k 
powers of 2, where different orderings of the terms are counted as distinct. 


LEMMA 1 (Oceans of Zeros and the Island). Fix positive integers e and m, and 
consider the exponents k for which 1 < k <e. For all r in the span N — (2"7' — 1) 
<r<N+(2” — 1), with N = (2° — 1)2”, ar, k) satisfies 


a(r,k) = (6 ifr = Nand k =e 
0, otherwise. 


Proof. The binary expansion of N is 111...111000...000, with e 1s and m Os. 
If r is in the given interval and not equal to N, then r has more than e 1s in its 
expansion and a(r, k) is 0 by the remark before the lemma. Suppose r = N. Then 
N can be written as a sum of k powers of 2 only if kK = e, and in that case it can be 
done in e! ways, since all e! permutations of the distinct powers of 2 have to be 
counted. This completes the proof. 


Consider equation (2) represented in base B, still without carries, and use 
Lemma 1 to describe the powers of F for the span of digits given by Lemma 1], 
with the value of m to be chosen later. The left-hand side will have this span be all 
zeros, except at digit N. The right-hand side will have this span consisting only of 
zeros. Both sides will have nonzero entries at some digits beyond N + (2” — 1). 

Now consider all possible carries. The term at digit N, on the left-hand side of 
equation (2), is c,e!, which is independent of m, so for large m all of its carry will 
remain in the left half of the span, forming the island. 

The carries from all locations beyond the span, for either side of equation (2), 
will not reach position N. This will follow from Lemma 2, which shows that no 
single power of F will have carries that reach that far, for the sum on either side 
consists of at most e + 1 terms, each with a positive coefficient bounded by H. But 
since both e and A are independent of m, the effects of multiplying by H and 
adding no more than e + 1 terms may be made insignificant by choosing m large. 


LEMMA 2 (carries for the kth power of F’). Let e, m and N be as in Lemma 1, 
with m large, and let R = N + 2”. Then 
°c a(r,k 1 
Lu BNar 
r=R B 


Proof. First observe that a(r,k) is bounded by the product of k! with the 
number of all possible choices of k powers of 2 that are not greater than r. This 
product may be bounded by (1 + log, r)*k!, which, for large r, is bounded by r. 
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Thus the sum in the lemma is bounded by 


oor f =) Ip ea 
— = —_ —o- __ —_— + _ 
r R R+1 |° 
oR B B B B** 
This equality may be shown by multiplying the series on the left by (1 — B~')* and 
telescoping the result. For m large, the expression on the right is less than 
B-+*D. as required. 

Thus these carries do not affect the island. The carries, in fact, are insignificant 
(around m places versus 2” places needed to reach the island). The nonzero entry 
from a(N, e) will not be affected by other terms on the left-hand side and not 


matched by terms on the right-hand side. Therefore, (2) cannot hold and F is 
transcendental. 


Acknowledgement. This paper has benefited greatly from the valuable comments of the referees. 
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Simple Tests for Classifying Critical Points 
of Quadratics with Linear Constraints 
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1. Introduction. Optimization texts (e.g., [5, pp. 234—-5]) frequently give second- 
order tests for classifying (perhaps constrained) critical points of general smooth 
functions in terms of “‘accessory problems” involving quadratic forms 


q(x) = x’Ax, x? =[x,x, °°: x,] (1.1) 


(perhaps subject to linear constraints of the form Cx = 0), where A is n Xn 
symmetric and C is m Xn of rank m <n, all scalars being real. Advanced texts 
generally provide no algorithms for this classification, however, and the standard 
determinant tests used in most elementary texts (including those on multivariable 
calculus) are incomplete, inefficient, and difficult to motivate. The pivot algorithms 
presented below are simple to understand and to implement, and the sign rules are 
obvious. I believe that they provide a nice application of the Gauss-type proce- 
dures taught extensively to undergraduates. 
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Thus the sum in the lemma is bounded by 


e oF f ane ea 
—={1-—] |—+—- |. 
r R R+1 
oR B B B B** 
This equality may be shown by multiplying the series on the left by (1 — B~'!)* and 
telescoping the result. For m large, the expression on the right is less than 
B-N+tD as required. 
Thus these carries do not affect the island. The carries, in fact, are insignificant 
(around m places versus 2” places needed to reach the island). The nonzero entry 
from a(N, e) will not be affected by other terms on the left-hand side and not 


matched by terms on the right-hand side. Therefore, (2) cannot hold and F is 
transcendental. 


Acknowledgement. This paper has benefited greatly from the valuable comments of the referees. 


REFERENCES 


1. A. Baker, Transcendental Number Theory, Cambridge University Press, New York, 1975. 

2. J.H. Loxton and A. J. van der Poorten, Transcendental and algebraic independence by a method of 
Mahler, chapter 15 of A. Baker and D. W. Masser, Transcendence Theory: Advances and 
Applications, Academic Press, New York, 1977. 

3. Ivan Niven, Irrational Numbers, The Carus Mathematical Monographs, no. 11, The Mathematical 
Association of America, 1967. 

4. K. F. Roth, Rational approximations to algebraic numbers, Mathematika, 2 (1955) 1—20. 

5. Th. Schneider, Einfuhrung in die Transzendenten Zahlen, Springer, Berlin, 1957. 

6. J. Shallit, Simple continued fractions for some irrational numbers, Journal of Number Theory, 11 
(1979) 209-217. 


Simple Tests for Classifying Critical Points 
of Quadratics with Linear Constraints 


PAUL BINDING* 
Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada T2N 1N4 


1. Introduction. Optimization texts (e.g., [5, pp. 234—-5]) frequently give second- 
order tests for classifying (perhaps constrained) critical points of general smooth 
functions in terms of “accessory problems” involving quadratic forms 


q(x) = x’Ax, x’ =[x,x, °°: x,] (1.1) 
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symmetric and C is m Xn of rank m <n, all scalars being real. Advanced texts 
generally provide no algorithms for this classification, however, and the standard 
determinant tests used in most elementary texts (including those on multivariable 
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obvious. I believe that they provide a nice application of the Gauss-type proce- 
dures taught extensively to undergraduates. 
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In sections 2—4 we discuss a complete definiteness test for (1.1) without 
constraints, for a symmetric matrix A. If A is definite, i.e., g(x) = 0 only if x = 0, 
then a simple recursive reduction to the form 


q(x) =p,yj + +++ +D,y, (1.2) 


is possible for suitable linear combinations y, of the x, avoiding expensive 
eigenvalue calculations. The p, (which all have the same sign) may be calculated 
via nested subdeterminants of A, via successively completing the square, or as the 
pivots in the Gauss reduction of A to row echelon form. Completing the square is 
rejected in favor of determinants in [4], presumably because of the symbolic 
algebra involved. On the other hand, row-reduction of A needs only about n°/6 
multiplications to reduce the pivots. The only way determinants can compete is if 
they are evaluated as a nested set via pivoting, so one might as well use pivots from 
the outset, and avoid setting up the determinants. 

The situation is more clear-cut when some pivots vanish, since the determinant 
tests then need significant modification. Most authors who discuss the issue 
require, either implicitly or explicitly, the earlier nested determinants correspond- 
ing to all permutations of x,,...,x,; see e.g., [1,2]. This is computationally 
prohibitive for large n. There is an analogous pivot test for nonnegative definite- 
ness in [7, p. 257], but it is not clear how it applies when all the a,;, are zero, for 


example when A = |° ) , 


In Section 5 we admit the constraint Cx = 0 by applying the results of Sections 
3 and 4 to a “reduced” matrix 


R=N'AN (1.3) 


where the columns of N form a basis for the null space of C. Since N may be 
computed directly from the row echelon form of C, the complete algorithms are 
easily implemented by beginners. Moreover it is also easy to see why the method 
works, in contrast with standard determinant methods, which usually involve the 


“bordered” matrix B= | ° “|, and which we review in Section 6. When pivots 


vanish, expensive variable permutations are unnecessary, and the pivot method 
then has a considerable advantage, as we illustrate with a numerical example in 
Section 7. 


2. Basic pivoting. We start with some observations on reduction of the matrix 
A of (1.1) by elementary operations. If a,, #0 then n—1 elementary row 
operations convert each of a,,,..., a, to zero, and leave the first row unchanged. 
These operations correspond to premultiplication by elementary matrices, and 
taken together they convert A to L, A, say, where L, is lower triangular. (Iterating 
this procedure, we obtain the familiar LDU decomposition of A; see [7, p. 25] and 
Section 3.) By symmetry of A, the corresponding n — 1 column operations convert 
Ai2,+++,@,, tO Zero, SO 


T 
ai 0 


L, ALT = 
0 A 


, (2.1) 


>1 


where A,, is the result of applying the row and column operations to the bottom 
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right (n — 1) X (m — 1) submatrix of A. Combining (2.1) with (1.1) we have 
a(x) = ay yi +xS,As\xsy, (2.2) 
where y, is the first component of L>’x and x4, =[x,,..., x,]. 


Remark. The column operations do not affect A. ,, so we may obtain it by row 
operations alone. In fact if we first divide row one of A by a,, and next create the 
zeros underneath, then we obtain 


,, {1 ef 
na-[ s] 29 
where L’, is lower triangular and y, = [1 e’]x. Thus all the information in (2.2) 
may be obtained from LA, provided we store the “pivot” p, = a). 

Iterating the procedure used for (2.1), we obtain 


L,aut=| © 2.4 
i i 0 A,, ( . ) 

and 

i 
q(x) = ye PEYE + XS {AS (X5;; l<i<n (2.5) 
k=1 

where U, is upper triangular, D, = diag(p,,..., p;), the y, are components of 
L; "x, and x4, =[x,,,,...,x,]. This reduction is possible provided each of the 
“pivots” p, # 0, and it is equivalent to “completing the square” in q(x) to isolate 
the terms in x,,...,x;. When i = n, we obtain (1.2). Alternative arguments may 


be found in, say, [4, $5.11] or [7, $6.2]. 


3. Unconstrained definiteness. As we have seen, calculation of the right side of 
(2.1) is a little more efficient (and familiar) via (2.3). Moreover (2.1) also shows that 
A. , is symmetric, and this can be used as a check on the arithmetic. Alternatively 
we can complete the A., by symmetry, saving nearly half the work: indeed it can 
be shown that only n(n — 1)(1 + 4)/6 multiplications are needed to reduce A to 
an upper triangular matrix U with unit main diagonal. With reference to (2.5), the 
successive pivots are the p,, and y = Ux, so perhaps the simplest method of 
computing the row echelon form of A also yields (1.2). Moreover y = Ux shows 
that the y, are linearly independent combinations of the x,, so q is positive 
definite (PD) if the p, are all positive. Further a;, = q(e;) where e; is the ith 
column of J. Thus p, = a,, > 0 if g is PD, and q cannot be PD if any a,; < 0. 
Similar reasoning applies to the other A, ,, and leads us to the following. 


PD Test. 


0. set i = 1. 

1. Test each a,, and stop if < 0: q is not PD. Stop if i = n: q is PD. 
2. Calculate A.,, as in (2.3)—necessarily a,, # 0. 

3. Replace A by A,,, i by i + 1, and return to 1. 


The top left i x i determinants a; of A can be shown to satisfy 


D; = , where ay = 1, (3.1) 
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p; as usual being the pivots for A. Thus PD and ND (negative definiteness) can 
also be tested via the a,. Note that the pivot sign rule for ND is obvious, but the 
determinant sign rule depends on (3.1) or an equivalent argument. Computation is 
also more direct via the p,, as we discussed earlier. 


4. A general definiteness test. Recall that (2.4) depends on the pivots p, not 
vanishing. If some p, = 0 then the first row of A.,, either vanishes or else has a 
nonzero element. In the former case, (2.4) is independent of x;,, and we may set 
p; = 0 and continue with A, (;,,1), which is defined by suppressing row and column 
one of A.,. In the latter case, we may write 


(A.;)1, =O0F (Ac) = (4.1) 
say. With x,,, = 1 and x,,, = 0 for the remaining positive k # j, we obtain 
i+] itk 
x. AS jXs; = 2yX;41 + (A3s))i 


which takes both signs if |x,,,| is large. Thus A is indefinite, and we are led to the 
following algorithm for positive semidefiniteness (PSD). 


PSD Test. As for the PD Test, but replace 1 by: 


1. Test each a,, and stop if <0: q is not PSD. Stop if i = n: q is PSD. Stop if 
a,,=0#a,, for some j: q is not PSD. If each a,; = 0, delete row and 
column one to obtain A,,, and go to 3. 


Corresponding determinant tests need caution, as the following shows: 


Example 1. A = I? |. Then a, = a, = 0 in G.1), but g may be PSD or NSD, 
depending on the sign of 6. 

A standard test (e.g., [6, p. 257]) is that g is PSD if all principal minors of A 
(obtained by deleting some subset of rows and corresponding columns) are 
non-negative. Although elegant, this implicitly amounts to permuting the x, in 
q(x), hence to permuting the rows and columns of A. Such tests are explicit in, for 
example, [1, §203] and [2], but are computationally prohibitive if n is large. 

A complete definiteness classification is easily constructed for the PSD test, in 
contrast with most tests in the literature (though see [6]). 


General Test. If the PSD test fails, either the pivot is negative, and then we may 
continue with Step 2, or else (4.1) holds. Thus either all the p, may be obtained, 
and the definiteness status may be read from (1.2), or else at some stage (4.1) 
forces qg to be indefinite. 


5. Constrained pivot tests. The first objective is to determine a basis of solu- 
tions to Cx = 0, and this may be accomplished simply once C is converted to 
reduced row echelon form by elementary row operations. Explicitly, we may write 


LC=[I E|P (5.1) 
where L is invertible, m < m, and lower triangular; J is m Xm; P is ann Xn 
permutation matrix; and FE is m xX 1. Here we write | = n — m and we recall that 
C is m Xn with rank m. It follows that the general solution to (1.3) may be 
written in “parametric form” as 


Px = at (5.2) 
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where t € R’ is a vector of parameters, and so 


x = Nt where N = P| |. 

We may now substitute this into (1.1) to obtain an unconstrained quadratic 

q(x) = t’Rt, in just / variables, where R= N/’7AN. In order to calculate R 
efficiently, we may partition 


F G 
Pl4p = | | 53 
G' H (9:3) 
where F is m Xm, G is m Xl and H is 1X1. Then 
R=E'FE-E'G-G'E+H (5.4) 


and we have the following 
Constrained Tests. 


1. Calculate E and P from (5.1), and R from (5.3), (5.4). 
2. Carry out the tests of Sections 3 and 4 with A replaced by R. 


6. Comparison with border tests. Early tests were of eigenvalue type, and 
evolved into bordered determinant tests; see [3, pp. 114-116], [2] for further 
references, and [2, p. 297 footnote] for a version involving the top left i xi 


subdeterminants B, of B = | o cI. Many recent authors have called this version 
the bordered Hessian test, since the accessory problems of Section 1 involve the 


Hessian A of the Lagrangian. It can be shown that 


Pi = Bisom/Pi+2m—1 1 < l < l, (6.1) 


where p, are the pivots of R tested in Section 5. It follows that g is PD (resp. ND) 
if 


0< (—1) "Bis om (resp. (—1)'"""Bis om); 1 <1< l. 


These sign conditions can confuse beginners, and even though Valiaho [8] removes 


the factor of (—1)” by using |’. | instead of B, the problem of motivation 


remains. 

It might be noted that some proofs of (6.1) use (5.4), but the computational 
aspect seems to have been ignored, except by Luenberger [5, §10.6]. His method 
takes longer to compute than ours, requiring Gram—Schmidt orthonormalisation 
and eigenvalue calculations. On the other hand it gives more detailed information 
than definiteness, aimed at convergence rates for critical point search algorithms. 


7. Zero pivots. If some pivots p; vanish, then the determinant methods of 
Section 3 may still be applied to the analogue of R with the t; permuted. Since 
t; =x,,,; by (.2), this is tantamount to recalculating the B;,.,, of (6.1) but for a B 
obtained by permuting the x,,,;. Spring [6] has derived this form of the border 
test, improving previous tests (e.g., [2], [8]) which implicitly or explicitly require 
permutation of all the x,;. Even Spring’s test is computationally prohibitive, 
however, if / is large. By contrast the tests of Section 5 are usually easier to 
calculate if some p, vanish. 


954 PAUL BINDING 


Let us illustrate this with the following. 


Example 2 (6, Ex. 3]. Classify q(x) = —x7 + x3 + x{ — x2 subject to x, — x3 = 
0. Spring calculates the top left 3 x 3, 4 x 4 and 5 < 5 minors of 


0 1 0 -!1 0 
1 -1 0 0 0 
B= 0 0 1 0 0 
—1 0 O 1 0 


0 OO 0 0 -1 


followed by those obtained from permuting the final three rows and corresponding 
columns. Although these calculations are simple, they are numerous, and we 
emphasize that this is the most efficient of the border tests. 

The test of Section 5 starts with A = diag(—1,1,1, —1) and C = [1, 0,—1, 0] so 


E=[0,-1,0],P =I and PUP =A. 
The partitioning (5.3) of P’AP gives 
F={|-1],G=[0,0,0] and HA = diag(1,1, —-1) 
SO 
E'FE = diag(0, —1,0), G'E = E’G =0 


and finally (5.4) gives R = diag(1,0, —1). The pivots p; are therefore 1, 0 and —1 
and so q(x) is indefinite subject to the constraint. Only two nontrivial multiplica- 
tions are required, both for E’ FE. 
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ELEMENTARY PROBLEMS 


E 3469. Proposed by Huseyin Demir, Middle East Technical University, Ankara, 
Turkey. 


Suppose P is a point in the interior of triangle ABC and suppose AP, BP, CP 
meet the lines BC,CA, AB respectively at the points D, E, F. Prove that the 
centroids of the six triangles PBD, PDC, PCE, PEA, PAF, PFB lie on a conic if 
and only if P lies on at least one of the three medians of the triangle. 


E 3470. Proposed by B. M. M. de Weger, University of Twente, Enschede, The 
Netherlands. 


Let S, be the nth Schréder number, defined as the number of polygonal paths 
in the Cartesian plane that start at (0,0), end at (n, n), contain no points above the 
line y =x, and are composed of steps taken from the set {(0, 1), (1, 0), , D}. 
E 3343 [1989, 734; 1991, 367] asserted that S, = 0 (mod 3) when n is even. 
Determine §S, modulo 3 when n is odd. 


E 3471. Proposed by William Calbeck, Florida International University, Miami, 
FL, and Bruce Reznick, University of Illinois at Urbana-Champaign. 


Let P, be the set of all integer valued polynomials of degree at most k}..e., the 
set of all polynomials p of degree at most k such that p(n) € Z for n € Z. (It is 
known that p € P, if and only if 


p(x) =a, + a,(7 | + a,(3} +o +a,( 5]. 
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where dp, 4,4 ,...,a, are integers.) Let r(k) be the smallest power of 2 strictly 
greater than k. 
(a) If p € P,, show that the sequence {(—1)?“}"_, is periodic with period r(k). 
(b) Show that any given sequence of plus and minus ones with period 2” occurs 
for some p in P5n_, 


FE 3472. Proposed by Hunter Snevily, California Institute of Technology, Pasadena. 


Suppose / and k are relatively prime positive integers and n =h +k. Show 
that for each j there are h™ (" ri | k-element subsets of {1,2,...,” — 1} with sum 
congruent to j modulo h. 


E 3473. Proposed by Lawrence J. Wallen, University of Hawaii at Manoa, 
Honolula. 


Suppose 0 < 0, <6, < ++: <6, <7. Let A be the n by n matrix whose entry 
in the :th row and jth column is sin|@, — 6,|. Show that 


|| All < cot(aw/2n) 


and that the estimate is best possible. Here || Al] is defined as sup |AX|, where the 
supremum is taken over all column vectors X in R” with Euclidean norm 1. 


E 3474. Proposed by Andrew Lenard, Indiana University, Bloomington. 


Let S be the set of nonnegative real numbers whose expansion to the base 4 
involves only the digits 0 and 1. 

(a) Prove that if x € S, y © S, and x #y, then(x +y)/2€S. 

(b) Suppose T is a set of nonnegative real numbers properly containing S. Prove 
that there exist x € T, y © T, x # y such that (x + y)/2 € T. 


SOLUTIONS OF ELEMENTARY PROBLEMS 


Path-Connected Homeomorphic Decompositions of the Disc 


E 3220 [1987, 680]. Proposed by Erwin Kronheimer, Birkbeck College, University 
of London. 


(a) Is it possible to decompose a closed 2-disc into two disjoint homeomorphic 
path-connected subsets? 

(b) Into four such sets? 

(The corresponding question for three sets seems difficult.) 


Solution I by the proposer. Let D = {(x, y)|x* + y” < 1} and let A be the union 
of the open upper half disc {(x, y)|x* +y* <1 and y> 0} with part of its 
boundary, as described below. 

In mostly counterclockwise order, the part consists of a point P on the open 
diameter {(x,0)| — 1 <x < 1}; a sequence of disjoint open intervals G, for n > 1 
on that diameter, converging monotonically to P from the right as n — ; the 
open semicircle G, = {(x, y)|x* + y* = 1 and y > 0}; a sequence of disjoint open 
intervals G_, for n > 1 on the diameter, converging monotonically to a point P’ 
from the left, where P’ is to the left of P; a sequence of disjoint closed intervals 
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Slowly Converging Sets 


E 3367 [1990, 150]. Proposed by John Kieffer, University of Minnesota, Min- 
neapolis. 


Let A, and A, be disjoint infinite sets of positive integers whose union is the 
set Z* of all positive integers. For each infinite subset S of Z* let f(S) be the set 
obtained from S by removing the least element s of S and then adjoining all the 
elements of A,_, less than s if s isin A,. Let f*(S) = f(f(S)), f°GS) = ff 7(S)), 
etc. Given any positive integer k, prove that for sufficiently large n either 


KEf"( Ay) Of"(A,) or KEF"™(A_) UF"(A,). 


Solution by David Callan, University of Wisconsin, Madison, and Douglas B 
West, University of Illinois, Urbana. For a given partition of Z* into two infinite 
sets A and B, we obtain the stronger result of computing the exact minimum value 
g(k) such that f"(A) and f"(B) agree on k (both contain or both omit) for all 
n > g(k). In terms of k alone, g(k) is bounded by the Fibonacci number F,, 
define by 


Let C; be the jth maximal block of consecutive integers intersecting only one of 
{A, B}, written in increasing order. Using concatenation to write sets in increasing 


order, we have Z*=C,C,C,C, °-:, and we may assume A=C,C3 °°: and 
B=C,C, +++. Let c; = |Cjl, and let z; be the largest element in C;; we compute 
g(k) from k and c,,..., oe where k € C,. Let m(S ) denote the smallest element 
of aset S, and let S’ = S — {m(S)}. Let D, = Cy +++ C;_C;, where j’ = j — 2j/2!] 


and C, = O(z, = 0). We have f(S) = D,S' for any set Ss with m(S) € C,,4. 

Define a sequence a, by a_,; = 4) = 1 and a, = a,_, + c,a,;_, for j > 1. For 
any set S with m(S) € C,,,, we claim f%(S) = S’ and, further, each intermediate 
set f‘(S) intersects [m(C j+v,% in S’, and the sequence of initial segments 
{f'(S) A C,C, +++ Che; <a, depends only on j. For j = 0, this is clear. Suppose 
j > 1 and mS) ECuy We have f(S) = DS’ = f(m(C;_)C,;S) [m(C,) vanish- 
ing] and so by the inductive hypothesis, f%(S) = f9%-'*%-2m(C,_)C,S') = 
fo%-(C,S') = S'. The other statements also hold by induction on j since they are 
true for i = 1. It follows that the set U = U(S,k) ={n: 1 <n<a, k ef"(S) 
depends only on k for k < z, and is empty for z; <k < mS ), 

Now let E; = CjCj42Cj+4 77° - We claim that (A, B} reaches {z, FE’, ,, E;, 2} by 
a;—-1 simultaneous iterations, and we track the iterates to ascertain on which k 
they agree. For j= 1, f"(A)=E, and f°(B) =z,E, as desired. We have 
(2, Fi .4, Ejay.) 2 CE +1 Ci 41 E42) after a;_, iterations. The former pair disagrees 
on z, and the latter. pair on z, + 1. After ia, further steps, for 1 <i <c,,,, the 
resulting pair disagrees on z, + 1 + 7 and the ‘final pair is (E,, 3, 2;,,E),2). This is 
because each of these groups of a; steps erases the current first term of each set, 
by the first result of the previous paragraph. (Note the reversal of order in the final 
pair.) Thus by the inductive hypothesis {A, B} reaches {z,,,E\.5, E;,3} in (a; — 1) 
+ a,;_, + (c;,, — Ida; = a,;,,; — 1 steps as required. Observe that the entire se- 
quence (z,F),,, FE...) > (E,43, 2;4;E}42) agrees on all k <z,: for the first a;_, 
steps by inspection, thereafter by the results of the preceding paragraph. Further, 
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agreement holds (for the same reasons) on z, + i at all pairs following the noted 
disagreement, for 0 <i <c,. This determines g and yields g(z,) = a,, g(z, + i) = 
a;._, + la; for 1 <i <c,,\. 

Finally, we prove g(k)<F, for every partition A,B. From the formula 
g(z; +i) =a,;_, + ia;,, we have g(k) = g(k — 1) +a, if z;+1<k <z,,,, and 
g(k) = g(k — 1) +a4,_, if k =z, + 1. Since a, = g(z;) for all j and the sequence 
g is increasing, we have g(k) < g(k — 1) + g(k — 2), with equality if and only if 
kK=z,+2ork =z,+1=2z,_, + 2. In particular, g(k) = F,, if and only if c; = 1 
forl <j <k —2. 

The original problem was solved by R. J. Hendel, Central Michigan University Problem Group, 


Western Maryland College Problems Group, and the proposer. Two incomplete solutions were also 
received. 


Reversing a Recurrence Relation 


E 3374 [1990, 239]. Proposed by N. J. Fine, Deerfield Beach, FL. 


Suppose J is a function on the non-negative integers such that J(O) = J(1) = 2 
and 


J(n +1) =J(n) + |J(n —1)/2] forn > 1. 
Prove that 
(v3 -—1)(J(n) + 1) =J(n—- 1) forn>1. 


Solution by David Callan, University of Wisconsin, Madison. The result follows 
by induction on n, using the following lemma. 


Lemma. If b is a positive integer, a = |(b + 1(V3 — DI, and c=b+ la /2], 
then \(c + 1IXV3 — DI =b. 


Proof. By hypothesis, a < (b + 1(V3 - 1)<a+41. Since a is an integer, 
b+(a-—1)/2<c<b+a/2. The inequalities c<b+a/2 and a<(b+1) 
x(V¥3 — 1) imply c + 1 < (b/2)(73 + 1) + (V3 + 1/2, which upon multiplica- 
tion by V3 — 1 transforms to (c + 1(V¥3 - I) <b +1. 

The inequalities (b + 1(V3 — 1) <a+4+1 and b+(a+/2<c+1 imply 
(b/2)/3 + 1) + (V3 — 1)/2 <c + 1, which by the same multiplication as before 
transforms to b + (V3 — 1)?/2 <(c¢ + D(¥3 — 1). Hence b < (c + 1XV3 — 1, 
which completes the proof. 


Solved also by J. Anglesio (France), R. J. Chapman (Great Britain), D. Doster, J. Guillera (Spain), 
C. Hill, A. A. Jagers (The Netherlands), P. Karson (student), O. P. Lossers (The Netherlands), L. E. 
Mattics, T. S. Norfolk, M. Riazi-Kermani, H. D. Ruderman, J. H. Steelman, R. Stong, J. S. Summer, M. 
Vowe (Switzerland), J. T. Ward, H. C. Wei, and the Central Michigan University Problem Group. One 
partial solution was submitted. 


A Complex Sequence That Is Nonzero Infinitely Often 


E 3384 [1990, 427]. Proposed by Jacob Sturm, Rutgers University, Newark, NJ. 


Let {a,}°_, be a sequence of distinct complex numbers such that Lla,| con- 
verges. Let {c,}”_, be a bounded sequence of non-zero complex numbers. Prove 
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agreement holds (for the same reasons) on z, + i at all pairs following the noted 
disagreement, for 0 <i <c,. This determines g and yields g(z,) = a,, g(z, + i) = 
a;_, + la; for 1 <i <c,,\. 

Finally, we prove g(k)<F, for every partition A,B. From the formula 
g(z; +i) =a,;_, + ia;,, we have g(k) = g(k — 1) +4, if z;+1<k <z,,,, and 
g(k) = g(k — 1) +a4,_, if k =z, + 1. Since a, = g(z;) for all j and the sequence 
g is increasing, we have g(k) < g(k — 1) + g(k — 2), with equality if and only if 
kK=z,+2ork =z,+1=2z,_, + 2. In particular, g(k) = F,, if and only if c; = 1 
forl1 <j <k —2. 

The original problem was solved by R. J. Hendel, Central Michigan University Problem Group, 


Western Maryland College Problems Group, and the proposer. Two incomplete solutions were also 
received. 


Reversing a Recurrence Relation 


E 3374 [1990, 239]. Proposed by N. J. Fine, Deerfield Beach, FL. 


Suppose J is a function on the non-negative integers such that J(O0) = J(1) = 2 
and 


J(n +1) =J(n) + |J(n — 1)/2| forn > 1. 
Prove that 
(v3 — 1)(J(n) +1)| =J(n- 1) forn > 1. 


Solution by David Callan, University of Wisconsin, Madison. The result follows 
by induction on n, using the following lemma. 


Lemma. If b is a positive integer, a = |(b + 1(V¥3 — DI, and c=b+ la /2], 
then \(c + 1XV3 — 1)] =b. 


Proof. By hypothesis, a < (b + 1(V3 - 1)<a+41. Since a is an integer, 
b+(a-—1)/2<c<b+a/2. The inequalities c<b+a/2 and a<(b+1) 
x(V¥3 — 1) imply c + 1 < (6/273 + 1) + (V3 + 1/2, which upon multiplica- 
tion by V3 — 1 transforms to (c + 1(V¥3 - 1) <b +1. 

The inequalities (b + 1(V3 —- 1) <a+1 and b+(a+1)/2<c+1 imply 
(b/2)/3 + 1) + (V3 — 1)/2 <c + 1, which by the same multiplication as before 
transforms to b + (V3 — 1)?/2 <(c + 1V3 — 1). Hence b < (c + 13 — D, 
which completes the proof. 


Solved also by J. Anglesio (France), R. J. Chapman (Great Britain), D. Doster, J. Guillera (Spain), 
C. Hill, A. A. Jagers (The Netherlands), P. Karson (student), O. P. Lossers (The Netherlands), L. E. 
Mattics, T. S. Norfolk, M. Riazi-Kermani, H. D. Ruderman, J. H. Steelman, R. Stong, J. S. Summer, M. 
Vowe (Switzerland), J. T. Ward, H. C. Wei, and the Central Michigan University Problem Group. One 
partial solution was submitted. 


A Complex Sequence That Is Nonzero Infinitely Often 


E 3384 [1990, 427]. Proposed by Jacob Sturm, Rutgers University, Newark, NJ. 


Let {a,}°_, be a sequence of distinct complex numbers such that Lla,| con- 
verges. Let {c,}”_, be a bounded sequence of non-zero complex numbers. Prove 
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(b) Find the largest constant c such that 


sin x \? tan x 3 
+ >2+ cx” tan x 
x x 


for 0 <x < 7/2. 


Editorial remark. The editors overlooked the fact that the following solution of 
(b) is much shorter than the published solution by Jean Anglesio. Other solutions 
of (b) were received from Richard A. Groeneveld, Kee-Wai Lau (Hong Kong), 
J. Ernest Wilkins, Jr., and the Lamar University Problem Solving Group. 


Solution II of (b) by David Callan, University of Wisconsin, Madison. We must 
determine the infimum on the interval (0,7 /2) of the function @ defined on the 
interval (0, a) by 


(*) h(x) = {(x7' sin x)” + x71 tan x — 2)/{x? tan x} 


= 27-'x-> sin2x + x74 — 2x73 cot x. 
We claim that @ is decreasing on (0, 77/2), so that 


inf nf h*) = $(7/2) = 16/m%*. 


0<x< 


Thus 16/7* is the largest value of c satisfying (b). 
On the interval (0, 7) we have the two Maclaurin expansions 


sin2x = 2x — 4x°/3 + > (—1)"(2x)7"*?/(2n + 5), 


n=0 
xcotx=1-—x«7/3 — Y) 27"*4/B, xe" t4/(2n + 4)3; 
n=0 
the latter is easily deduced from the fact that the Bernoulli numbers B, can be 


defined by the power series expansion t/(e' — 1) = Y°_,)B.t'/r! for |t| < 27. 
Hence for 0 <x < 7 we have 


Go 2n+5 (-1)" , 
d(x) = aera Bones + my} . 
Thus, if W(x) = 6(yx) for 0 <x < 7”, we have 
8x 16x? mo 2ent (—1) 
WO) =~ 5a ties + Loss! Bansal + Gy als 


Since 


2(2n + 5)|Bony4l = 4(2n + 5) "(20)" * Y ko 2-4 
k=1 


> 4(2n + 5)"2r) "> 1 


for n > 3, we have Bon +4| > 1/(4n + 10) for n > 3. Thus W(x) > 0 on the 
interval (0,777) and so w’ is increasing there. On the other hand, from (*) we 
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readily obtain 
W ((7/2)’) = 2o'b' (7/2) = (2/7)"(1 — 10/77”) < 0. 


It follows that w(x) is negative throughout the interval (0,(7/2)”) and so w is 
decreasing there. Hence ¢ is decreasing on the interval (0, 7/2) and our claimed 
result is established. 


Rationally Independent Subsets of the Reals 


E 3319 [1989, 356; 1990, 854]. Proposed by Paul Erdés, Hungarian Academy of 
Sciences, Budapest, and Janos Surdnyi, Eétuds Lordnd University, Budapest. 


Suppose S is a set of 2m + 1 irrational real numbers. Prove that S has a subset 
T of cardinality n + 1 such that no nonempty subset of T has a rational sum. 

Editorial remark. Professor Ginter Rote of the Technical University of Graz, 
Austria, has pointed out to us that the second solution published in the November, 
1990 issue is flawed, in that the hyperplane mentioned in the solution should 
contain the set Q of rational numbers rather than merely the origin 0. A corrected 
version of this solution is as follows. 


Solution IT by an anonymous contributor. Let V be the finite-dimensional vector 
space over @ generated by 1 and the elements of S. Since S is a finite set of real 
numbers none of which is contained in Q, there exists a hyperplane H through the 
one-dimensional subspace of V generated by 1 such that H contains no elements 
of S. Clearly n + 1 of the elements of S fall on the same side of H. These n + 1 
elements of S form a set T having the property stated in the problem. In fact no 
nontrivial linear combination of the elements of 7 with nonnegative rational 
coefficients is in H, so that in particular no non-empty subset of 7 has a rational 
sum. 


Sum-Free Sets Modulo n 


E 3346 [1989, 735; 1991, 368]. Proposed by Dean S. Clark, University of Rhode 
Island, Kingston. 


Call a subset T of Z mod n sum-free if the sum of two distinct elements of T is 
not in JT. Let s(n) be the maximum cardinality of a sum-free subset of Z mod n. 

(a) Prove that s(n) = n/2 if n is even. 

(b) If n is odd, prove that s[n] > [n/3] + 1. 

(c)* If n is odd, is s(n) = |n/3] + 1? 


Editorial Comment. For odd n several solvers (correctly) conjectured that 
“s(n) = |n/3] + 1 if n has no prime factors congruent to 5 modulo 6, but 
s(n) = n(p + 1)/@p) if p is the smallest prime factor of n congruent to 5 mod- 
ulo 6.” 

Fred Galvin has remarked to us that the preceding conjecture follows easily 
from the following known result: If s*(n) is the maximum cardinality of a subset 7 
of Z mod n which does not contain the sum of any two elements of T, distinct or 
not, then 

n 
i} 


(*) s*(n) = max { 


d\n 


d+1 
3 
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readily obtain 
W' ((m/2)’) = 7714" (/2) = (2/7)*(1 — 10/27?) <0. 


It follows that #’(x) is negative throughout the interval (0,(7/2)) and so & is 
decreasing there. Hence ¢ is decreasing on the interval (0, 7/2) and our claimed 
result is established. 


Rationally Independent Subsets of the Reals 


E 3319 [1989, 356; 1990, 854]. Proposed by Paul Erdés, Hungarian Academy of 
Sciences, Budapest, and Janos Surdnyi, Eé6tvds Lordnd University, Budapest. 


Suppose S is a set of 2m + 1 irrational real numbers. Prove that S has a subset 
T of cardinality n + 1 such that no nonempty subset of T has a rational sum. 

Editorial remark. Professor Ginter Rote of the Technical University of Graz, 
Austria, has pointed out to us that the second solution published in the November, 
1990 issue is flawed, in that the hyperplane mentioned in the solution should 
contain the set Q of rational numbers rather than merely the origin 0. A corrected 
version of this solution is as follows. 


Solution IT by an anonymous contributor. Let V be the finite-dimensional vector 
space over @ generated by 1 and the elements of S. Since S is a finite set of real 
numbers none of which is contained in Q, there exists a hyperplane H through the 
one-dimensional subspace of V generated by 1 such that H contains no elements 
of S. Clearly n + 1 of the elements of S fall on the same side of H. These n + 1 
elements of S form a set J having the property stated in the problem. In fact no 
nontrivial linear combination of the elements of T with nonnegative rational 
coefficients is in H, so that in particular no non-empty subset of T has a rational 
sum. 


Sum-Free Sets Modulo n 


E 3346 [1989, 735; 1991, 368]. Proposed by Dean S. Clark, University of Rhode 
Island, Kingston. 


Call a subset T of Z mod n sum-free if the sum of two distinct elements of T is 
not in T. Let s(n) be the maximum cardinality of a sum-free subset of Z mod n. 

(a) Prove that s(n) = n/2 if n is even. 

(b) If n is odd, prove that s[n] > |n/3] + 1. 

(c)* If n is odd, is s(n) = |n /3] + 1? 


Editorial Comment. For odd n several solvers (correctly) conjectured that 
“s(n) =|n/3] + 1 if mn has no prime factors congruent to 5 modulo 6, but 
s(n) = n(p + 1)/@p) if p is the smallest prime factor of n congruent to 5 mod- 
ulo 6.” 

Fred Galvin has remarked to us that the preceding conjecture follows easily 
from the following known result: If s*(7) is the maximum cardinality of a subset T 
of Z mod n which does not contain the sum of any two elements of T, distinct or 
not, then 

n 
i} 


(*) s*(n) = max { 


d\n 


d+1 
3 
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Proofs of (*) may be found in [1] and [2]. To deduce the assertion in quotation 
marks from (*), it suffices to prove the following lemma given by Galvin. 


Lemma. If T is a sum-free set of Z modn in the sense of the problem and if 
2a € T for some a € T, then |T| < |n/3] + 1. 


Proof. lf 0€T, then JT = {0}. Thus we may assume a #0. Let T= 
{a,2a,X1,...,%X,-»}, where t=|T|. Then the 2(t — 2)+ 1 elements 3a,a + 
Xiye++9€4tX,_5,24 +%X,,...,2a + xX,_, are all different and none of them be- 
longs to T. Hence 2(t —- 2) + 1<n-—t,ie,t<n/3 +1. 

It follows from the lemma that s(n) = max{s*(n),|n/3] + 1}. This gives the 
following evaluations. If n is even, s(n) = s*(n) = n/2. If n is odd and has no 
prime factor congruent to 5 modulo 6, then s*(n) = |n/3] and s(n) = |n/3] + 1. 
If n is odd and has a prime factor congruent to 5 modulo 6, then s(n) = s*(n) = 
(p + 1)n/(3p), where p is the smallest such prime factor. 


REFERENCES 


1. Palahenedi Hewage Diananda and Hian Poh Yap, Maximal sum-free sets of elements of finite 
groups, Proc. Japan Acad., 45 (1969) 1-5. 
2. Anne Penfold Street, Sum-free sets, Springer Lecture Notes in Mathematics, 292 (1972) 123-271. 


ADVANCED PROBLEMS 
6673. Proposed by Paresh J. Malde and Allen J. Schwenk, Western Michigan 
University, Kalamazoo. 


Let c, =e, — d,, where e, is the number of connected labeled n-vertex graphs 
with an even number of edges and d, is the number of connected labeled n-vertex 
graphs with an odd number of edges. For example, c, = 0 — 1 = —1 and c, = 3 
— 1 = 2. Find a general formula of c,,. 

6674. Proposed by Paul Erdés, Hungarian Academy of Sciences, Budapest. 


If n is an integer greater than 1, let P(n) denote the largest prime factor of n. 
Prove that n|P(n)! for almost all n, i.e., prove that if 


S(x) ={n<x:nt P(n)}}, 
then 


lim [S(x)|/x = 0. 


6675. Proposed by Alain Tissier, Montfermeil, France. 
Define 
00 ( _ 1) n—1 x 1/n 


f(x)= bh 


n=1 n 


for x > 0. Determine the asymptotic behavior of f(x) as x > 0. 
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Proofs of (*) may be found in [1] and [2]. To deduce the assertion in quotation 
marks from (*), it suffices to prove the following lemma given by Galvin. 


Lemma. If T is a sum-free set of Z modn in the sense of the problem and if 
2a € T for some a € T, then |T| < |n/3] + 1. 


Proof. If 0€T, then JT = {0}. Thus we may assume a #0. Let T= 
{a,2a,X1,...,%X,-»}, where t=|T|. Then the 2(t — 2)+ 1 elements 3a,a + 
Xyy22-,44+X,_5,24 + X,,...,2a +X,_, are all different and none of them be- 
longs to T. Hence 2(t — 2) + 1<n—-—t,ie., t<n/3 +1. 

It follows from the lemma that s(n) = max{s*(n),|[n/3] + 1}. This gives the 
following evaluations. If n is even, s(n) = s*(n) = n/2. If n is odd and has no 
prime factor congruent to 5 modulo 6, then s*(n) = [n/3] and s(n) = |[n/3] + 1. 
If n is odd and has a prime factor congruent to 5 modulo 6, then s(n) = s*(n) = 
(p + 1)n/(3p), where p is the smallest such prime factor. 


REFERENCES 


1. Palahenedi Hewage Diananda and Hian Poh Yap, Maximal sum-free sets of elements of finite 
groups, Proc. Japan Acad., 45 (1969) 1-5. 
2. Anne Penfold Street, Sum-free sets, Springer Lecture Notes in Mathematics, 292 (1972) 123-271. 


ADVANCED PROBLEMS 
6673. Proposed by Paresh J. Malde and Allen J. Schwenk, Western Michigan 
University, Kalamazoo. 


Let c, =e, — d,, where e,, is the number of connected labeled n-vertex graphs 
with an even number of edges and d, is the number of connected labeled n-vertex 
graphs with an odd number of edges. For example, c, = 0 — 1 = —1 and c, =3 
— 1 = 2. Find a general formula of c,,. 

6674. Proposed by Paul Erdés, Hungarian Academy of Sciences, Budapest. 


If n is an integer greater than 1, let P(n) denote the largest prime factor of n. 
Prove that n|P(n)! for almost all n, i.e., prove that if 


S(x) ={n<x:nt P(n)}}, 
then 


lim |S(x)|/x = 0. 


6675. Proposed by Alain Tissier, Montfermeil, France. 
Define 
00 ( _. 1)" "x17" 


f(x)= bh 


n=1 n 


for x > 0. Determine the asymptotic behavior of f(x) as x —> 0. 


966 PROBLEMS AND SOLUTIONS [December 


SOLUTIONS OF ADVANCED PROBLEMS 
A Set of Points Related to Nonnegative Cosine Polynomials 


6608 [1989, 652]. Proposed by Oliver D. Anderson, University of Western Ontario, 
London, Canada. 


Let S, be the set of all points (x,,..., x,,) in R” such that 


n—-k n 
x,= vt] S t? (k =1,2,...,n), 
j=0 j=0 
where fp, ¢,,...,¢,, are arbitrary real numbers not all zero. It is easy to see that S, 
is bounded and contains the origin as an interior point. 

(a) When n = 2 prove that S, is the convex hull of the set consisting of the 
point (0, — 4) and the points of the ellipse 2x? + (4x, — 1)* = 1. 

(b)* Is S,, convex for all n? 

(c)* Is S, closed for all n? 


Editorial remark. The version of the problem originally published specified that 
t) = 1. The symmetrical version given above is more convenient to work with. The 
two versions are equivalent. For suppose P(t), ¢,,...,¢,,) is the point in R” whose 
kth coordinate 1s 


n—k n 
titer) Dt (k= 1,2,...,n), 
jJ=0 j=0 


where fp, ¢,,...,¢, are real numbers not all zero; then 


P,(tostys--+5tn) = P,(1, tito, to/tos.--st,/ty) if ty #0 
and 


Pi(tostis-+-5tn) = PACA, tea 1 /tys tra o/ ts +++, t,/t,,0,...,0) 
ift,= > =¢,_,=0,¢t, #0. 
Solution to (b) and (c) by Zoltan Sasvari, Technische Universitat, Dresden, 
Germany. The answer to both questions is yes. A lemma of L. Fejér and F. Riesz 
({1], [2], [3]) asserts that a cosine polynomial A, + A, cost +--+: +A, cosnt with 
real coefficients is nonnegative for all real ¢t if and only if it can be expressed in the 
form 


Ag + Aycost + +++ +A, cosnt = |W + mye + +> +y,e'"l", 


where jo, 41,.--,, are real numbers. In other words the set of nonnegative 
cosine polynomials A, + A, cost + +--+ +A, cos nt can be parametrized as follows: 


a n—-r 
Ao = BK A, = 2D) MiMi 4, (r= 1,2,...,n), 
j=0 j=0 
where (Ho, My,---,M,) © R"*?. 
The definition of S,, the parametrization just given, and an obvious normaliza- 
tion show that (x,, x2,..., x,) € S, if and only if 1 + 2D%_,x, cos kt is nonnega- 


1991] PROBLEMS AND SOLUTIONS 967 


tive for all real ¢, i.e., if and only if 


1+2Re >) x,z* >0 ( *) 
k=1 


for all complex z of absolute value 1. The set of points (x,,...,x,) € R” 
satisfying (*) for all z of modulus 1 is obviously closed and convex, so that 
assertions (b) and (c) follow. 


REFERENCES 


1. L. Fejér, Uber trigonometrische Polynome, J. Reine Angew. Math., 146 (1916) 53-82, page 64. 

2. G. Pélya and G. Szegé, Aufgaben und Lehrsdtze aus der Analysis, Springer-Verlag, 1925, Volume 2, 
Part VI, Problems 39—41. 

3. F. Riesz and B. Sz.-Nagy, Functional Analysis, Ungar Publishing, 1955, pp. 117-118. 


Solution of (a) by the editors. Since the lines through (— 1/2, 0) with slopes +1 
are tangent to the ellipse 2x? + (4x, — 1)* =1 at the points (+2/3,1/6), it 
suffices to show that (x,, x,) € S, if and only if —1/2 <x, < 1/2 and 


Ix,|<x,+1/2 if-1/2<x,<1/6, 
Ixy] < (4x, — 8x3) if 1/6 <x, < 1/2. 


From the above solution of (b) and (c) we note that (x,, x,) € S, if and only if 


1+ 2x, cost + 2x, cos2t > 0 (*) 


for all ¢ in [0, 7). Putting v = tan?(t/2), we see that (*) holds for all ¢ in [0, 7) if 
and only if 


1+ 2x,(1—v)/(1 + v) + 2x,{2(1 - v)’/(1 +. v)’ - 1} > 0 
for all nonnegative real v. Thus (x,, x,) € S, if and only if 
1+ 2x, + 2x, + 2(1 -— 6x,)v + (1 — 2x, + 2x,)v? > 0 


for all v > 0. Now if a,b,c are given real numbers, av* + bv +c > 0 for all 
nonnegative v if and only if a > 0, c > 0, and either b > 0 or ac — b*/4> 0. 
Thus (x,,x,) € S, if and only if 1+ 2x, + 2x,>0, 1—2x,+2x,>0, and 
either 1 — 6x, > 0 or (1 — 6x,)* — (1 + 2x,)? + 4x? < 0. Thus (x,, x5) € S, if 
and only if |x,|<x,+1/2 and either x, <1/6 or x7 + 8x3 — 4x, < 0. The 
assertion of (a) follows. 


Editorial comment. Anderson informs us that (b) and (c) also follow from a 
result well known to time-series analysts, namely that the sum of two independent 
moving average processes is also a moving average. 


Parts (b) and (c) were solved also by V. M. Joshi and the proposer. Direct solutions of (a) (i.e., 
solutions not using the Fejér—Riesz characterization of nonnegative cosine polynomials) were obtained 
by Everett Dade and Bruno Haible. 
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Zeros of Sections of the Zeta Function 


6627 [1990, 252]. Proposed by G. Boese, Munich, Germany 


If n is a positive integer, put 
n 
LAs) = Liv’. 
j=l 


It is known that there are infinitely many n such that £, has zeros in {s: Res > 1}. 
However, prove that 


(i) if n = 1,2,3,4,5, then Ref (s) > 0 for Res > 1; 
(ii) if n = 6,7, 8,9, 10, then |f,(s)| > 0 for Res > 1. 


Solution of (i) by the editors. For n = 1,2,3 we have Re f(s) >1-1/2-1/3 
= 1/6 for Res > 1. 


For the cases n = 4,5 it is helpful to consider the function f defined by 


f(s) =1+27°+4+ 47°. If Res > 1, we have 2~* = x + iy, where x and y are real 
and x* + y* < 1/4. Hence for Res > 1 we have 


Re f(s) =14+x4+x?—-y?=7/8 + Ux +1/4)° -x?-y? 
>7/8-1/4=5/8, 


with equality if and only if x = —1/4 and x*+y*=1/4, ie., if and only if 
s = 1+ 27in/log 8, where n is an integer not divisible by 3. Thus if n = 4 or 5, 
we have 


Re 2,(s) > 5/8 — 1/3 — 1/5 = 11/120 
for Res > 1. 


Solution of (ii) by the proposer. The cases n = 6,8,9,10 can be handled very 
simply. Since 


(1 —-27°)G(s) =1+4+37°%4+57%—-8°* — 107 — 127°, 
we have 
(1 — 2>°)g,(s)| > 1 — 37! — 57! - 8-1 — 107! — 127! = 19/120 
for Res > 1. Since 
(1— 27*)O.(s) =14+37°+35°% + 77% — 107% — 127° — 147° — 16 
we have 
(1 = 2) g4(8)| 21 -3°b S87 
— 107! — 127' — 147' — 167'=11/1680 


for Res > 1. Similarly, it is easy to check that the sum of the absolute values of 
the 15 terms of the Dirichlet series for 


(1 — 27°)(1 — 37°)g(s) - 1 
is at most 1249/1260 for Re s > 1, so that 


(1 — 27°)(1 — 37) £(s)| > 1 — 1249/1260 = 11/1260 


for Res > 1. Finally the sum of the absolute values of the 15 terms of the 
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Dirichlet series for 
(1—2°°)(1— 3 )eio(s) — 1 
is at most 233/252 for Re s > 1, so that 
(1 — 2°*)(1 — 37) E495) | > 1 — 233/252 = 19/252 


for Res > 1. 
The case n = 7 is less straightforward. Let 


Z(u,v,w,t)=1+utu*t+ou(l+u)t+wete. 


Since Z(27°,37°,57°, 77°) = £.(s), it suffices to prove that |Z(u,v,w, t)| > 0 in 
the 4-disc 


{(u,v,w,t):|ul < 1/2, |v] < 1/3, lw] < 1/5, |t| < 1/7}. 
< < 


For fixed values of v, w, t with |v| < 1/3, |w| < 1/5, |t| < 1/7 we shall use 
Rouché’s theorem to compare the number of zeros of Z(u, v, w, t) in {u: |u| < 1/2} 
with the number of zeros of Z(u,0,0,0) = 1 +u+u? there. To apply Rouché’s 
theorem it suffices to know that for |u| = 1/2 we have 


loi+u)t+wte| <llt+ul/3+1/54+1/7<ll+ustu’. 
If we put x = 2Reu =u + U, where uu = 1/4, then 
tu =lt+ut+ait+ui=x+5/4 
and 
tutu? =1+(u+i) + (u +i) — ui + (u + f)ua + (uil)’ 
= (x + 5/8) + 27/64. 
Thus it suffices to show that if 


2 
} 


F(x) = {(x + 5/8)° + 27/64} “" — {x + 5/4}'7°/3 — 12/35, 


then F(x) > 0 for x € [—1,1]. 

The positivity of F on [—1, 1] can be verified in many different ways, including 
the following: (a) The minimum of F on [—1, 1] can be determined by differentiat- 
ing and then solving the resulting cubic equation. (b) Since it is easy to show that 
|F’(x)| < 1 for x € [-—1, 1], it suffices to calculate that F(x) > 0.02 whenever x is 
an integer multiple of 0.04 in [—1, 1]. (c) By two squarings the inequality F(x) > 0 
for x € [—1,1] can easily be shown to be equivalent to the positivity of a certain 
quartic polynomial on [—1, 1]. 

Once the positivity of F on [0,1] has been demonstrated, Rouché’s theorem 
shows that for fixed v,w,t the function Z(u, v, w, t) has the same number of zeros 
in {u: |u| < 1/2} as the function Z(u, 0,0, 0), namely none. Thus |Z(u, v, w, t)| has 
a positive lower bound for |u| < 1/2, |v| < 1/3, |w| < 1/5, |t] < 1/7 and so |£,(s)| 
has a positive lower bound for Res > 1. 


Editorial comment. Most of the results of this problem were obtained in a more 
computational manner by Robert Spira, ‘‘Zeros of Sections of the Zeta Function 
II,’ Math. Comp., 22 (1960) 163-173. Spira also showed that ¢,, does have zeros 
with real part greater than 1. 


No other solutions were received. 
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Schmidt’s Query Becomes Lindsey’s Theorem 


6636 [1990, 535]. Proposed by F. Schmidt, Bryn Mawr College, Bryn Mawr, PA. 


If G is a finite group, let f(G) be the average order of an element of G, i.e., 


f(G) =|IGl" & ord(g), 
geEG 
where ord(g) is the order of the group element g and |G| is the number of 
elements in G. 
(i) Show that if G is a nilpotent group of order n, then f(G) < f(Z,,), where Z,, 
is the cyclic group of order n. 
(ii)* Is it true that if G is any group of order n, then f(G) < f(Z,,)? 


Solution of (i) by Richard Stong, University of California at Los Angeles. Observe 
that f(G, X G,) = f(G,) f(G,) provided ord(G,) and ord(G,) are relatively prime. 
Therefore, since a nilpotent group is a direct product of p-groups, it is enough to 
show the result for p-groups. One easily calculates that 

p"*} + pu" 1 
f (Zn) Pare] > pr. 
[See the following solution of (ii) for details. ] 

Further if G is any other group of order p”, then G contains no element of 
order p”. Hence no element of G has order exceeding p"~' and f(G) < p"~}. 
Thus if G is a nilpotent group of order n, then f(G) < f(Z,,) with equality if and 
only if G is cyclic. 


Solution of (ii) by John H. Lindsey II, Ft. Myers, Florida. The answer is yes. Let 
a be a set of primes, let G, = {g: g € G and |g| is divisible only by primes in 7}, 
and let f. (G) = IG, "Leec. ord(g). Let g, be the p-part of an element g in G, 
i.c., if ord(g) = p'm, where p + m, then g, =g°”, where 1=am + bp’ with 
a,b = Z. Let n = [1,x, be the decomposition of 1 into prime powers. Then 


nf(Z,) = » ord(g) = » ord(II, g,) 


gEZ,, gEZ,, 
= )) IlL,ord(g,)=[] )  ord(g,), 
geZ, pin g E(Z,), 


where (Z,,), is the p-primary component of Z,. Thus f(Z,,) = I1,,,f(Z,),). 
Furthermore, 


F(Z) p) = F(Z,,) 
np {1 + p(p — 1) + p’p(p —-1)+-:: +n,(n,/p)( Pp — 1)} 
ny {1 + p(p — 1)(n3 - 1)/(p? - 1)} 
pn, 1 
pt+l “(p+)” 


If n, > 1, then 
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Let 7, be the set of odd primes dividing n and let 7 be any finite set of primes. 
Put h(n) = (2/3)'™~™'f(Z,,), so that h(n) = I1,,,4,,(n,). Then (ii) follows by 
taking 7 = 7, U {2} in the following theorem. 


THEOREM. If G is a group of order n, then f_ (G) < h(n). 


Proof. We may as well replace 7 by 71 (7, U {2}). Then we use induction 
first on |G| and next on |7|. We first show that 4 (7) is increasing in the sense that 
(h_(m) < h(n) if m\n. In fact, if m =n/q for g a prime, then 


h(m) _ Me(Ma/4) 


h(n) — h,(q) 
Furthermore, if 1, > q, then 
9 a(n4/4) 
hin(mo/4) _ S2Znya) 8 att 9 o 
h(n) f(Z,,) Mo 8q * 16" 
q+1 
Ifn, = 4, 
h_(n 1 1 3 +] 
n(%q/4) _ - 34+) 


h(n) h(a) ~ 3f(Z,) 2 aq” 


which is at most 


if g is odd. If g = 2, 
1 1 1 2 


h(a) f(Z,) 3/2 3 
In any event, 
hn(Mg/a) _ 2 
h(n) 3. 
If w is empty, f.(G) =1=h,) <h,(n), by monotonicity. For the rest of this 


paragraph, suppose 7 = {2}. Suppose n =n. If G is cyclic, f,(G) = f(Z,) = 
h_(n). If G is not cyclic, 


2 
f,(G) <5 < 241 


Finally, suppose that p is an odd prime dividing n. By monotonicity 
D 2 

—————— ——— Fl 

pt 1P241°7 


<h_(n). 


2 
h(n) > h,( pn) = h,(P)ha(m2) > 3 


23 3 2 G 
>>73-n,=n,2f, ; 

5 3g = M2 > fp(G) 

Now, suppose that p is an odd prime in 7. Define a class to be the set of all 
a-clements having the same p-part. It suffices to show that the average order over 
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each class is at most h,(n). Suppose that the w-element g = xy has p-part: x. 
Then ord(g) = ord(x)- ord(y) and y is a (7 — {p})-element in C.(x) = C(x). 
Hence y > yx) is 1-1 ord( )-preserving correspondence between such y and the 
(ar — {p})-elements of H = C(x)/(x). Thus the average of ord( ) over this class is 
ord(x)f,,_,,, 4). Suppose ord(x) <,. Using the induction hypothesis (since 
lar — {p}| < |r|) and monotonicity, 


ord( x) f,,-¢ (1) < ord(x)h,_,,(IH|) < ord(x)A,_,,,(1Gl/ord(x)) 
= ord(x)h,_.("p/ord(x))h,(n/n,) 


2 
= ord(x) Zoran) 


Dp 
3 Spel od(a) 


<h(n,)h,(n/n,) =h,(n). 


Therefore we are through unless there exists x of order n,. Suppose C(x) € G. 
Then for some prime q # p, |C(x)|||GI/q. Using the induction hypothesis and 
monotonicity, we see that the average of ord( ) over this class is 


(1) <n,h,- (Hl) =1,h, (Hl) <2,h,(IGl/(an,)) 


h,(n,/q) 
h,(Nq) 


<h,(n,)h,(n/n,) =h,(n). 


The only case where our class-average proof breaks down is when ord(x) = nN, 
and C(x) = G. Then, by Schur-Zassenhaus, G = (x) X H for some p’-group H. 
By a calculation similar to that for nf(Z,,) and by the induction hypothesis applied 
to H, we find that 


FAG) =f( Zora )fa( 1) = h,(ord(x)) f, (4) 
h_(ord(x))h_(IHl) =A, (1G). 


(We do not really need Schur-Zassenhaus, just the bijection on G_: g > ( Lys gy) 
where independently g, runs over (x) and g, runs over the p’-elements of G, 
which are in 1-1 correspondence with (G/(x))_.) This completes the proof. 


< ord(x ne h,(n/n,) < —~ 4 n,h(n/n,) 


Ny a —{p} 


2 p 
=n,h,(n/n,) < nph,(n/ny) = < pap ete("/np) 


No other solutions were received for Part (ii). Part (i) was solved also by David Callan, O. P. 
Lossers, the late David Richman, John Henry Steelman, and the proposer. 


UNSOLVED PROBLEMS 


EDITED BY RICHARD GUY 


In this department the Monthly presents easily stated unsolved problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied by 
relevant references (if any are known to the author) and by a brief description of known partial or 
related results. Typescripts should be sent to Richard Guy, Department of Mathematics and 
Statistics, The University of Calgary, Alberta, Canada T2N 1N4. 


MONTHLY UNSOLVED PROBLEMS, 1969-1991 
RICHARD K. Guy 


This is the eleventh of the two-yearly articles that have appeared since the 
inception of this section of the Monthly. It is written in June, but, as I prepare to 
serve under my sixth editor, I make a New Year’s resolution to keep the articles 
flowing more regularly. In part this depends on you to keep me supplied with 
suitable problems, and these are not easy to find [1971, 1113-1114; 1973, 
1120-1121; 1981, 755-756; 1983, 122]. 

References in brackets are to year and page numbers of this Monthly, while 
dates in parentheses refer to publications listed at the end, and other items are 
labelled (tbp) if they are likely to be published formally, or as written communica- 
tions (wrc) if publication plans are not presently known. Dates and pages in 
brackets are also appended to items in the bibliography indicating where the 
problem originally appeared in the Monthly. 

We mention here some other collections of unsolved problems. Croft, Falconer 
and Guy have recently published Unsolved Problems in Geometry (1991) [UPIG] 
and I have edited collections of unsolved problems in number theory (1989, 1990), 
as well as the informal collections issued in connection with the annual Western 
Number Theory conferences, and I hope that the second edition of the book 
[UPINT | will be available by the time this article is being read. A collection of 37 
unsolved problems in combinatorial games appears on pages 183-189 of Guy 
(1991). 

It would seem that Joseph Gallian [1990, 133], with his survey paper (1989) and 
two other papers (1987, 1988) has become keeper of the records for the Kotzig- 
Ringel tree-labelling problem [1969, 1128]. My bibliography, which needs to be put 
into electronic form, contains 185 items. 

In discussing the famous Sylvester problem and its relation to the magic 
configuration problem of Murty [1971, 1000] I misquoted [1989, 903-905] a 
conjecture of Smyth. It should have read “éIf a configuration contains more than 
five special points, then all special points are collinear?” A counterexample, for 
which Smyth did not have the source, is provided (for k > 2) by the set T of the 
vertices and the centre, c, of a regular (4k + 2)-gon, together with the set 
S = {s,,..., 85,4} of points at infinity on the sides (and diagonals) of the polygon. 
The special points are c US which are 2k + 2 in number and not all collinear. 
Smyth’s other conjecture: “iThe only nontrivial configuration in which the number 
of special points exceeds the number of ordinary points is the failed Fano plane?” 
remains uncontradicted. 

Fuiredi (1991) has extended to w < y3 the solution of Fejes Téth’s [1971, 528] 
problem asking for the densest packing of points (resp. circles of unit diameter) in 
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an infinite strip of width w (resp. w + 1), no two points being distance less than 1 
apart. 

For those of us who have struggled with crossing-number problems [1973, 52] it 
is some comfort that Bienstock (wrc) has shown that some of them are provably 
hard. He and Dean (wrc) have also described, for each k > 4, a class of graphs 
with crossing number k, but unbounded rectilinear crossing number. An interest- 
ing breakthrough has been made by Douglas Woodall (tbp) towards verifying what 
may be called Zarankiewicz’s conjecture, that the (minimal) crossing number of 
K,, ,» the complete bipartite graph on m + n vertices, is equal to the known upper 


bound 
“(Kno < [5] [5] 


by showing its truth in the cases v(K, 7) = 81 and v(K,,) = 144. A counting 
argument then implies that the formula holds for (m,n) = (7, 8), (8, 8), (7, 10) and 
(8,10). Unfortunately, the argument used by Kleitman (1970) to deduce the case 
(5,n) from his result for (5,5) doesn’t seem to work for (7,7), so the smallest 
unknown values are for (7,11) and (9,9). The counting argument shows that 
v(K, ,,) > 220 and a parity argument shows that the value is odd, so it is 221, 223 
or 225. Similarly, v( Ky 4) is 248, 250, 252, 254 or 256. Which? 

Gerver (wrc) has produced a “‘sofa’” [1976, 188] of area 2.2195 ... which can be 
moved round a right-angled corridor of width 1, which beats the earlier record of 
2.215649 ...[1977, 811 and see UPIG, G5] and which he conjectures to be best 
possible. 

Students of Conway’s game of Sylver Coinage [1976, 634] should consult the 
tables on pp. 171-172 of Richard Nowakowski’s chapter in Guy (1991), where the 
results of George Sicherman’s extensive calculations are incorporated. Nowakowski 
is also the natural person to appoint as repository for any additional information 
about this game. 

We announced [1987, 963; 1989, 905] that Tunnell had effectively solved the 
“congruent number” problem [1980, 43]: Monsky’s (1990) paper on this topic has 
since appeared. Table 1 [1981, 759] is now complete and will appear as Table 7 in 
the second edition of UPINT. 

Butler (1991) answers all three of Piranian’s [1980, 555] questions—see (a), (b), 
(c) at foot of [1991, 139]. 

The interesting sidelight thrown by Lagarias (1990) on the notorious 3x + 1 
problem [1983, 35; 1985, 3-23] has now appeared. He is the natural repository for 
information on this problem, which he writes that he is not done with. He plans a 
joint paper on the heuristics for the total stopping time. There are two distinct 
probability models: a random walk based on mod2 behavior, and a branching 
process based on mod 3 behavior (going backwards). Both models can be solved 
exactly in principle and predict a maximum excursion of (c, + 0(1))In N as 
N — o, where cy = 41.1...is the same constant for both models. 

At a recent Western Number Theory conference Basil Gordon asked if it is true 


that for every positive integer n there is a one-one map L of {1,2,...,n} onto 
{0,1,...,7 — 1} such that L(ab) = L(a) + L(b) whenever a, b and ab are all in 
{1,2,...,n}. It is easy to confuse this problem with that of Forcade, Lamoreaux 


and Pollington [1986, 119; see also 1989, 905] where the mapping is to be onto a 
group (called an FLP group below). Gordon’s problem is a special case, so the 
counterexamples found by Forcade and Pollington (1990) for their problem also 
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serve as counterexamples for Gordon’s. Contrast the two examples for n = 10: 


1 2 3 4 5 6 7 8 9 10 
Lp O 1 8 2 4 9 7 3 6 5 
Le 0 1 4 2 6 5 9 3 8 7 


The former is obtained by using 2 as a primitive root of n + 1 = 11 and the 
entries, which are exponents (logarithms) are the elements of the additive group of 
residue classes mode 10. However L(3) + L(3) = 8 + 8 #6 = LG X 3) unless the 
addition is so interpreted. We can satisfy Gordon’s requirements by rearranging 
the entries as in the second example. Forcade and Pollington gave the counterex- 
amples 195 and 255 to their own problem and 195 is the least such. Perhaps 
n = 105 (but not n = 35) is the smallest counterexample to Gordon’s problem; 
John Selfridge has done a backtrack on this by hand, but it should be confirmed by 
machine. Chandler (1988) has shown that every odd order FLP group is commuta- 
tive. 

Tomaszewski [1986, 280] considered n real numbers a,,..., a, satisfying L”_,a? 
= 1 and asked if, of the 2” sums of the form ) + a,, it is possible that there are 
more with |i +a,|> 1 than there are with |} +a,| <1. Ron Holzman (wrc) 
proves that the proportion with smaller sum is at least 63/169 > 0.37278. 

It seems to be very hard to determine whether or not there is a point in the 
plane of a unit square which is at a rational distance from each of its corners [1988, 
33 and see UPINT, D19], but Bremner and Guy (1989 and tbp) have in some sense 
found all tilings of the square with four rational triangles. 

In [1988, 35] we also asked for integer solutions of the simultaneous equations 

xe+y?+z7*=a* +d? +c? = x*y2z?2 =a’b*c? 
and later [1989, 907] gave some history. Because of our backlog, John B. Kelly’s 
neat general solution (1991) has taken longer to appear than expected. 

Murray Klamkin [1988, 845] conjectured that if a curve subtended a constant 
angle from the points of each of two concentric circles, then it was itself a circle. 
Johannes Nitsche (1990) proved this, as did Gerd Baron, and suggested that it 
might be of interest to consider similar questions in higher dimensions. 

A certain amount of correspondence concerning Shyam Sunder Gupta’s [1989, 
425] quest for decimal numbers whose sum and difference with their reversals are 
both squares, has yielded no examples other than those reported in [1989, 908]. 
Are there infinitely many non-palindromic examples? Are there any odd, and 
therefore necessarily non-palindromic, examples, besides 65 + 56 = 117,37? 

Martin Aigner writes concerning Bill Sands’s guessing game [1990, 314] that if 
an unknown element x from [1,..., 7] is to be deduced from (truthful) answers of 
form “x = k” or “O < |x —k| < d” or “x — k| > d” to questions “Does x = k?” 
and if L(n) is the number of answers in the worst case, and 2"~* <d < 2""', 
then, for n > 2d(h + 1) + 2h, 


n—-2"—h 
+h forn#2"-*4+1, n>4 
my alt 2dtl 
MM) V0 oh 
+h forn=2"774+1, n>5 
2ad+ 1 
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with larger values for L,(n) if n < 2d(h + 1) + 2h. Howard Cary Morris and 
others also obtained partial results. Variations in which there is, or may be, an 
untruthful answer, may be worth considering. 

A. Zulauf of the University of Waikato, New Zealand, has proved that the first 
player can win the game of Barranca [1990, 315] by first choosing 2 and eventually 
achieving a score of 224, 216 or 210 (it is easy for each player to prevent the 
opponent from obtaining a perfect score of 220). The complete analysis is too 
extensive to publish here. 


REFERENCES 


Daniel Bienstock, Some provably hard crossing number problems (preprint) [1973, 52]. 

Dan Bienstock and Nate Dean, Bounds for rectilinear crossing numbers (preprint) [1973, 52]. 

Andrew Bremner and Richard K. Guy, Delta-lambda configurations in tiling the square, J. Number 
Theory, 32 (1989) 263-280; MR 90g: 11031 [1988, 33]. 

Andrew Bremner and Richard K. Guy, Nu-configurations in tiling the square, Math. Comput., 58(1992) 
(to appear) [1988, 33]. 

James P. Butler, The perimeter of a rose, this Monthly, 98(1991) 139-143 [1980, 555}. 

K. A. Chandler, Groups formed by redefining multiplication, Canad. Math. Bull., 31(1988) 419-423; 
MR 89m:20021 [1986, 119]. 

Hallard T. Croft, Kenneth J. Falconer and Richard K. Guy, Unsolved Problems in Intuitive Mathemat- 
ics, Vol. II, Unsolved Problems in Geometry [UPIG], Springer-Verlag, 1991. 

R. W. Forcade and A. D. Pollington, What is special about 195? Groups, nth power maps and a 
problem of Graham, in R. A. Mollin (editor) Number Theory, Proc. Ist Conf. Canad. Number Theory 
Assoc., Banff, 1988, de Gruyter, 1990, pp. 147-155 [1986, 119]. 

R. Frucht and J. A. Gallian, Labeling prisms, Ars Combin., 26(1988) 69-82; MR 90a:05178 [1969, 1128]. 

Zoltan Fiuredi, The densest packing of equal circles into a parallel strip, Discrete Comput. Geom., 
6(1991) 95-106 [1971, 528]. 

Joseph A. Gallian, Labeling prisms and prism related graphs, in Proc. 18th SE Internat. Conf. Combin. 
Graph Theory, Comput., Boca Raton, Cong. Numerantium, 59(1987) 89-100; MR 89d:05162 [1969, 
1128]. 

Joseph A. Gallian, A survey—recent results, conjectures and open problems in labeling graphs, 
J. Graph Theory, 1311989) 491-504; MR 90k:05132 [1969, 1128]. 

Joseph L. Gerver, On moving a sofa around a corner (preprint) [1976, 188]. 

Richard K. Guy (editor), Unsolved Problems, in R. A. Mollin (editor) Number Theory and Applica- 
tions, Proc. NATO ASI, Banff 1988, Kluwer, Dordrecht, 1989, 593-602. 

Richard K. Guy, Canadian Number Theory Association unsolved problems, 1988, in R. A. Mollin 
(editor), Number Theory, Proc. Ist Conf. Canad. Number Theory Assoc., Banff, 1988, de Gruyter, 
1990, pp. 193-206. 

Richard K. Guy, Unsolved Problems in Intuitive Mathematics, Vol. I, Unsolved Problems in Number 
Theory [UPINT ], 2nd edition, Springer-Verlag, 1992. 

Richard K. Guy (editor), Combinatorial games, Proc. Symp. Appl. Math., 43, Amer. Math. Soc., 
Providence, 1991 [1976, 634]. 

Jeffery C. Lagarias, The set of rational cycles for the 3x + 1 problem, Acta Arith., 56(1990) 33-53 
[1983, 35]. 

Ron Holzman, On the product of sign vectors and unit vectors (preprint) [1986, 280]. 

John B. Kelly, Two equal sums of three squares with equal products, this Monthly, 98(1991) 527-529 
[1988, 35]. 

Daniel J. Kleitman, The crossing number of Ks ,,, J. Combin. Theory, 9(1970) 315-323; MR 43 #6123 
[1973, 52]. 

Paul Monsky, Mock Heegner points and congruent numbers, Math. Z., 204(1990) 45-67; MR 91e:11059 
[1980, 43]. 

Johannes C. C. Nitsche, Isoptic characterization of a circle (proof of a conjecture of M.S. Klamkin), this 
Monthly, 9711990)45-47; MR 90j:51029 [1988, 845]. 

Douglas R. Woodall (tbp), Cyclic-order graphs and Zarankiewicz’s crossing-number conjecture [1973, 
52]. 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 
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xix + 710 pp. 
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A book of a mere 151 pages entitled Geometric Inequalities, by O. Bottema, 
R. Z. Djordjevic, R. R. Jani¢é, D. S. Mitrinovi¢é, and P. M. Vasi¢, published in 1969, 
contained about 400 inequalities and listed 225 authors, and aroused a great deal 
of interest. The giant of a book under review derives from this sturdy infant, 
contains 750 names of those who have worked on inequalities up to the year 1986, 
and lists thousands of inequalities. It also describes conjectures, unsolved prob- 
lems, and research dating back to the 19th century. Publications in Chinese, 
Japanese, Serbo-Croatian, Bulgarian, Rumanian, Hungarian and Dutch are re- 
viewed, besides, of course, those in English, French, German and Russian. The 
scope of this work dazzles the imagination. The series editor, Michiel Hazewinkel, 
ends the first Preface in the book with a challenge: 


Well, if you know of a better ’ole, go to it! 
Bruce Bairnsfather 


The reviewer was a child in WWI, and remembers the cartoon by Bruce 
Bairnsfather, grimly designed to reassure the British public that trench warfare 
could be made bearable. The cartoon showed a rubicund Cockney smoking a pipe, 
cooking sausages over a paraffin stove, quaffing a beer, in a comfortable-looking 
‘trench illuminated by candles. The series editor also gives a short quotation from a 
G. K. Chesterton novel, another from The Chinese Maze Murders by a R. van 
Gulik, states the title of a famous paper by Eugene Wigner, and ends his 
quotations with one from William Blake and one from Joseph Louis Lagrange. 

In other words the series editor makes it plain that the world is his oyster Gif he 
is a She, I apologize), and will not have his subject deprecated. He points out that 
geometric inequalities have important applications both to geometry and to areas 
well beyond the traditional areas of geometry, such as the theory of complex 
functions and the calculus of variations, to the theory of embedding theorems for 
function spaces, and more generally in providing a priori estimates in several 
areas, such as differential equations. 

In any triangle with sides a, b and c, we have the inequalities 


b+c>a, c+a>b, a+b>c. 


This fundamental set of inequalities, naturally called triangle inequalities, appears 
in Euclid, but certainly precede Euclid, being known to any donkey seeking a 
bundle of hay for rapid consumption: the shortest distance between any two points 
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is the straight line segment joining them. Chapter I of the book under review is 
concerned with the existence of a triangle satisfying more complex conditions, 
beginning with inequalities involving the radius of the circumcircle, the radius of 
the incircle, and the semi-perimeter. There are 48 references given at the end of 
this chapter, which sets the tone for the whole book, since essential historical 
descriptions are given in the text, links between theorems are traced, and short 
proofs are given where appropriate. MObius appears in the references, with an 
1852 date, and, to come to modern times, Paul Erd6s appears in a joint note with 
Murray Klamkin. 

Chapter II deals with “Duality Between Geometric Inequalities,” Chapter III 
with “Homogeneous Symmetric Polynomial Geometric Inequalities,’ Chapter 
IV with “Duality Between Different Triangle Inequalities and Triangle Inequali- 
ties with (R,r,s),’..., no aspect of geometric and related inequalities is ne- 
glected. 

In 1941, waiting for the inevitable daylight air-raid on Southampton, I was 
doodling with pencil and paper, and wondered whether the orthogonal projection 
of a given triangle into an equilateral triangle could be extended to the orthogonal 
projection of a given triangle into a triangle of given shape. In proving that this 
could always be done, I discovered a two-triangle inequality for triangles with 
respective sides a,b,c and a’, b’,c' and areas A and QA’: 


a’(-a” + b’2 + c’*) + b?(a”’ _ b’2 + c’*) + c*(a”’ + b’2 _ c'*) > 16AQ’, 


with equality iff the two triangles are similar. 

I thought that this was an interesting inequality, and it reminded me of the 
general inequality which connects the mixed area of two convex domains with their 
respective areas. I tried to have my inequality published, but some people at 
Oxford were disturbed by the term orthogonal projection and rejected publication. 
The late Beniamino Segre, a great geometer, a refugee from Italy, was in a 
depressed mood, and labelled it ““Mathematical Tripos” stuff (which was curious, 
because the Tripos in those days contained a lot of fine geometry). The inequality 
might have been forgotten if I had not discovered that outside Britain well-known 
geometers such as Hadwiger had been very interested in mere single-triangle 
inequalities. I used another geometrical method, not involving orthogonal projec- 
tion, for proving my inequality, and this was accepted as a Research Note for the 
Cambridge Philosophical Society Proceedings. 

J. Neuberg, a 19th-century geometer, had discovered the inequality, and re- 
marked that this was evidently an equality if the two triangles are similar, but had 
not proved that equality implied the similarity of the two triangles. For this reason 
the theorem was labelled the Neuberg-Pedoe inequality. 

In recent times this inequality has attracted a fair amount of attention. Two 


young mainland-Chinese geometers have generalized the inequality to n-simplexes 
in Euclidean n-space, incidentally showing that so-called college algebra should 
not be neglected. The book under review has a section of 15 pages on The 
Neuberg-Pedoe and the Oppenheim Inequalities, with 38 references. 

The last chapter of 100 pages is on inequalities in E”, with 129 titles listed at 
the end of the chapter. Before that, inequalities are considered for quadrilaterals, 
polygons, circles, and there are Some Particular Inequalities which, of course, 
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consider isoperimetric inequalities. Many of the references go back to the 19th 
century, but one special one caught my attention: 

S. A. J. Lhuilier: De relatione mutua capatitatis et terminorum figurarum, 
geometrice considerata: seu de maximis et minimis, Varsaviae, 1782. 

My 1941 doodling with the orthogonal projection of a given triangle into one of 
a given shape is, of course, Lhuilier’s theorem, that a 3-edged prism can be cut by 
a Suitable plane in a triangle of any given shape. Priority is acknowledged in 
‘“‘Homogeneous Coordinates and the Lhuilier Theorem,” Crux Mathematicorum, 
vol. 9, 1983, pp. 160-165, which was also a response to Dieudonné’s defiant “‘ Who 
ever uses homogeneous coordinates?” 

Lest it be thought that this remarkable book is too highbrow for the average 
reader, the authors stress that their book is intended for a wide circle of readers, 
for students attending high schools, colleges and universities, as well as for their 
professors. To show the great variety among those listed in the index of names in 
this book, I list those known to me, either personally, or by correspondence. 

Sir Alexander Oppenheim appointed me to the University of Singapore. He was 
knighted for his services to Commonwealth education, and, in particular, for 
running a prison-camp university when Singapore was overrun by the Japanese 
army. Hidetosi Fukagawa is a high-school teacher in Japan, in his late forties. I 
have collaborated with him over the past six years in writing Japanese Temple 
Geometry Problems (reviewed in this journal, April, 1991), the product of geometry 
enthusiasts in 18th- and 19th-century Japan who painted their theorems on 
wooden tablets and hung them in the precincts of shrines and temples. These 
enthusiasts came from all social classes in Japan. George Tsintsifas is the principal 
of a private high school in Saloniki, Greece. Walther Janous teaches at an Ursuline 
convent in Innsbriick, Austria. 

Jordan Tabov, whose wife is also a mathematician, runs mathematics competi- 
tions, in which he distinguished himself as a student, in Sofia, Bulgaria. L. Yang 
and J. Zh. Zhang are the two young mainland-Chinese mentioned earlier. They 
could be called the alpha and omega of geometry in China, producing remarkable 
new theorems which can be proved by the use of computers and Hilbert’s 
theorems on polynomials. They first appeared in Crux Mathematicorum, demon- 
strating the amazing constructions possible using only a “rusty” compass (one of 
fixed radius). Geometry in mainland China seems to have survived the “cultural” 
revolution. Murray Klamkin has worked for the Ford Motor Company. He is once 
again a university professor and a great collector and solver of problems. J. F. 
Rigby is an English academic, and one of England’s leading geometers. Donald 
Coxeter has lived in Canada for many years, and can be called the G.O.M. of 
geometry there. Roland Eddy hails from St. John’s University, Newfoundland, and 
was encouraged by his professor, W. J. Blundon, prominently featured in this 
book, to study geometry, and has justified his professor’s insight. The great Jordi 
Dou is a retired architect, and lives in Barcelona. Geometry is his natural element. 
Bernhard Neumann has lived in Australia for many years, and is a very distin- 
guished mathematician. The collected papers of Neumann and his late wife 
Hannah have just been published by the same press which published my book 
written with Fukagawa. The late I. M. Jaglom is well known because of the 
translations from the Russian of his books on geometry. T. J. Fletcher was an 
Inspector of Schools for Her Majesty’s Government some years ago, wrote excel- 
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lent textbooks and pioneered in the making of films on geometry. S. Iwata is the 
editor of a remarkable book, an Encyclopaedia of Geometry, in Japanese, and one 
of the circle of devotees who meet to discuss Japanese Temple Geometry Prob- 
lems. My collaborator, Hidetosi Fukagawa, owes much to Prof. Iwata’s encourage- 
ment over the years. V. C. Linis and Fred Maskell were strong supporters of Léo 
Sauvé, the great editor of Crux, in its early days. They are all, alas, departed, but 
not forgotten. L. J. Mordell is still well known for his work in number theory, and 
was professor in Manchester for many years. 

Dmitri Mavlo wrote charming letters from Moscow in excellent English to both 
Léo Sauvé and myself, with photographs, inviting us for a tour of the Soviet Union, 
and suggesting possible problems for Crux. He no longer writes, and there is a hint 
in his last letter, before the Gorbachev era, that he had been in trouble with the 
regime. Leon Bankoff is a dentist in Los Angeles and has dramatized the beauties 
of geometry for many years. Bob Osserman is an excellent geometer whose paper 
“Curvature in the Eighties’ was one of six in the special geometry issue of the 
MontTHLy, October, 1990. Van der Waerden is probably still remembered for his 
magnificent books on modern algebra, originally published in German, and began 
the formidable task of putting algebraic geometry on a sound footing, with a series 
of papers in Mathematische Annalen. Paul Erd6s, of course, is known to the whole 
world, and is unique in many ways. 

Last, but not least, J. C. (Chris.) Fisher, D. Ruoff and J. Shilleto are of the new 
generation of active geometers in North America who enjoy working and publish- 
ing together. 

It is clear that there will be another edition of the great work under review, 
perhaps in ten years from now. The list of names is bound to grow exponentially. 
The authors welcome comments and corrections, and are obviously sincere in this. 
Having written two books with a coauthor, I cannot even imagine the logistics of 
cooperation between three authors which produced this book. Perhaps, as is the 
case with the making of some movies, there will be a book which will explain how 
the book was produced. In the meantime, I feel that this may well be the most 
inspirational book of the decade which has just passed, setting a pattern for 
cooperation in the joys of mathematical discovery which few have ever dreamed of. 
A world congress on geometric inequalities, which is very possible, would be a 
major cultural event of our time. For the immediate future, however, this book 
should be (possibly chained!) in every university and college library, and, yes, in the 
library of every school which is intent on improving its mathematics teaching. 
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ham, Alfred Gray, Daniel R. Grayson, R.J. Gregorac, Ralph Grimaldi, William 


Gustafson, 


Peter Hagis, Heine Halberstam, Mary-Elizabeth Hamstrom, Frank Harary, Da- 
vid Harbater, Lester L. Helms, Joe Hemmeter, Douglas A. Hensley, Adolf Hilde- 
brand, Aimo Hinkkanen, Kevin Hockett, Robert V. Hogg, Thomas W. Hungerford, 
I. Martin Isaacs, John R. Isbell, Mourad E.H. Ismail, Nathan Jacobson, Gerald J. 
Janusz, Richard P. Jerrard, Carl G. Jockusch, Charles R. Johnson, Norman John- 
son, James P. Jones, D. Kalman, Gerald Kaminski, Kenneth Kaminsky, Robert 
P. Kaufman, Nicholas D. Kazarinoff, Clark H. Kimberling, Allan Kirch, Victor L. 
Klee, Daniel J. Kleitman, Neal Koblitz, Fred Kochman, Joseph Konhauser, R. Kop- 
perman, Tsit-Yuen Lam, Ray G. Langebartel, Charles F. Laywine, Felix Lazebnik, 
Arnold Lebow, Carl Lee, Myrna Lee, Derrick H. Lehmer, Joseph Lehmer, Emma 
Lehmer, W. Lucas, Carsten Lund, Frederick W. Luttmann, Russell D. Lyons, 


Hosam Mahmoud, Daniel Maki, Marvin Marcus, David E. Marker, M. Mar- 
telli, R. Daniel Mauldin, Jim Mauldon, Leon R. McCulloch, Lynn McLinden, 
George F. McNulty, Russel Merris, P. Meyer, Joseph B. Miles, Frank B. Miles, 
Hugh L. Montgomery, John C. Morgan, David E. Muller, D. Mullins, Melvyn B. 
Nathanson, Morris Newman, Karl K. Norton, Alec Norton, Andrew M. Odlyzko, J. 
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Marshall Osborn, Robert Osserman, James Owings, Donald Passman, N. Tenney 
Peck, G.W. Peck, Rhodes Peele, Richard E. Pfeifer, Richard S. Pierce, A. Pix- 
ley, Carl Pomerance, Horacio A. Porta, Stephen V. Portnoy, Jim Propp, George 
B. Purdy, Bruce A. Reznick, Norman Richert, David R. Richman, Bruce Richter, 
Joseph B. Roberts, Wayne Roberts, Derek J.S. Robinson, C. Ambrose Rogers, Ken- 
neth A. Ross, Bruce L. Rothschild, Joseph J. Rotman, David E. Rowe, Ranjan Roy, 
Lee A. Rubel, Walter Rudin, 


Donald G. Saari, Hans Sagan, Bill Sands, Jonathan Schaer, Edward Schein- 
erman, James H. Scherl, Andrzej Schinzel, Thomas Schmidt, Frank W. Schmidt, 
Eric Schmutz, Lowell Schoenfeld, Paul E. Schupp, David B. Secrest, John L. Self- 
ridge, M. Senechal, Richard D. Shafer, Jeffrey O. Shallit, Daniel Shanks, Daniel B. 
Shapiro, Louis Shapiro, S. Shariari, Donald R. Sherbert, Lance W. Small, J. Laurie 
Snell, William M. Snyder, Lawrence Somer, H.M. Srivastava, Richard P. Stanley, 
Ralph G. Stanton, Dennis Stanton, John Steinig, William Stenger, Dorothy M. 
Stone, Arthur H. Stone, Gilbert Strang, Robert Strichartz, Keith Stroyan, Lajos 
F. Takacs, Gaisi Takeuti, F. Tall, M. Townsend, Edward C. Turner, Daniel UIl- 
man, Stephen V. Ullman, Daniel J. Velleman, Samuel S. Wagstaff, Walter Wallis, 
John H. Walter, Edward T.H. Wang, Lawrence C. Washington, William C. Water- 
house, William A. Webb, Paul M. Weichsel, Elliott C. Weinberg, Howard J. Weiner, 
Gregory P. Wene, Alan Wessel, John E. Wetzel, Robert A. Wijsman, Jet Wimp, 
Andrew Woldar, Katherine Yerion, Peter Yff, William R. Zame, Peter Zvengrowski, 
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Telegraphic Reviews are designed to alert readers in a timely manner to new books and 
computer software appropriate to mathematics teaching and research. Special codes classify 
reviews by subject area and appropriate use: 


T: Textbook P: Professional Reading 
C: Computer Software L: Undergraduate Library 
S: Supplementary Reading 13: Grade Level 


1—4: Semesters 
**: Special Emphasis 
??: Questionable 


Readers are advised that price information is subject to change, that computer software is 
often available also on other machines, and that hardware variations often cause software 
incompatibilities. Selected books and software packages receive a second, more extensive 


review in the MONTHLY. 

General, S(15-17), L**. Introduction to Classi- 
cal Mathematics I: From the Quadratic Reciprocity 
Law to the Uniformization Theorem. Helmut Koch. 
Transl: John Stillwell. Math. & Its Applic., V. 
70. Kluwer Academic, 1991, xvii + 453 pp, $149. 
[ISBN: 0-7923-1231-7] A unique volume providing 
clear, concise highlights of nineteenth-century math- 
ematics (from Gauss to Riemann) in historical or- 
der but with consistent modern notation and proofs. 
Assuming two years of (European-level) university 
mathematics—which is summarized in twenty pages 
of appendices—the text covers virtually all of clas- 
sical mathematics with.refreshing economy. Exten- 
sive integrated historical notes provide narrative and 
historically-rooted rationale. No exercises. LAS 


General, P. Frontiers in Pure and Applied Mathe- 
matics. Ed: Robert Dautray. North-Holland (US 
Distr: Elsevier Science), 1991, 286 pp, $108.50. 
[ISBN: 0-444-88133-6] Nineteen research papers 
presented at a Paris symposium held to celebrate the 
60th birthday of Jacques-Louis Lions. Topics touch 
on a wide variety of structures and themes (e.g., nu- 
clear energy, Hamiltonian systems, turbulence, im- 
munology) reflecting the diverse influence of Lions’ 
work. Numerous photos from the symposium intro- 
duce the volume. LAS 

General, S(15-18), P, L. Symmetry of Dis- 
crete Mathematical Structures and Their Symmetry 
Groups: A Collection of Essays. Eds: K.H. Hof- 
mann, R. Wille. Res. & Expos. in Math., V. 15. Hel- 
dermann Verlag, 1991, vi + 168 pp, $68 (P). [ISBN: 
3-88538-215-6] Five essays, by distinguished math- 
ematicians, delivered at a large symposium on the 
role of symmetry in the arts, architecture, literature, 
music, the natural science, and in every-day life. The 
contributions of Bokowski, Schulte, and Wills exem- 
plify the function of symmetry in modern mathemat- 
ical research; those of Dress, Wille, and Hofmann 
(each in German) are scholarly essays on the con- 
cept of symmetry in mathematics, but are general in 
nature and require no training in mathematics. LCL 


Elementary, T(13), S. Practical Problems in 
Mathematics for Heating and Cooling Technicians, 


Second Edition. Russell Devore. Delmar, 1991, viii 
+ 248 pp, $10.95 (P). [ISBN: 0-8273-4062-1] Arith- 
metic, measurement, areas, arcs, volumes, etc., ap- 
plied to pipe and duct geometry, heat load calcula- 
tions, estimates and bills, and other common plumb- 
ing tasks. Workbook style, with space for answers to 
be written in. LAS 


Mathematics Appreciation, T(13-14). Math- 
ematical Modeling for the Marketplace: Applying 
Graph Theory in Liberal Arts and Social and Man- 
agement Sciences. Helen Christensen. Kendall 
Hunt, 1988, vii + 167 pp, $18.95 (P). [ISBN: 0- 
8403-4661-1] Graphs and digraphs provide tools 
and techniques to model and solve problems from 
a variety of liberal arts and social science disciplines. 
Intended for a “liberal arts mathematics” course; ac- 
cessible with minimal mathematical background, yet 
stimulating for better-prepared students. Includes 
examples of student projects, and chapter on induc- 
tive reasoning. MW 


Education, C, P. How to Use the Spreadsheet as 
a Tool in the Secondary School Mathematics Class- 
room. William J. Masalski. NCTM, 1990, 76 pp, 
$16.50 (P); 2 diskettes included. [ISBN: 0-87353-303- 
8} Thirty spreadsheet activities including magic 
squares, root-finding by successive approximation 
and Newton’s method, and probability simulations. 
Computer data files provided for Apple Works and 
Better Working Spreadsheets on Apple IIe’s, but 
book contains enough detail to allow translation to 
other spreadsheets. Both problem-solving and elec- 
tronic blackboard activities. Useful for methods 
courses and teacher-enhancement projects. MW 


Education, T, P. Teaching and Learning Com- 
puter Programming: Multiple Research Perspectives. 
Ed: Richard E. Mayer. Lawrence Erlbaum Assoc, 
1988, xvi + 320 pp, $49.95. [ISBN: 0-8058-0073-5] 
Reports of recent theory-based research studies on 
Logo, BASIC, and Pascal programming. Although 
early research did not support initial strong claims of 
discovery-learned programming as a vehicle for learn- 
ing problem solving skills, these studies suggest some 
transfer is achieved by more explicit instruction. MW 
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Education, T(15-17). Ezperiences in Math for 
Young Children, Second Edition. Rosalind Charles- 
worth, Deanna J. Radeloff. Delmar, 1991, vii + 454 
pp, $17.67. [ISBN: 0-8273-4630-1] For pre-service 
and in-service teachers of pre-school and primary age 
children. Background information on concept devel- 
opment a la Piaget, then sample tasks emphasizing 
a developmental sequence through naturalistic, in- 
formal, and structured learning. MW 


Education, T(14-16). Elementary Mathematics 
for Teachers, Second Edition. Donald F. Devine, 
Judith Olson, Melfried Olson. Wiley, 1991, xvii + 
820 pp, $51.95. [ISBN: 0-471-85947-8] Fairly stan- 
dard treatment with references (though often periph- 
eral) to NCTM Standards and recent learning re- 
search. Two chapters on problem solving, one on 
Logo, but little integration of either into the rest of 
the text. MW 


Education, S(16-17). Math Power in the Com- 
munity. Ed: Gerald Kulm. AAAS, 1990, xviii + 217 
pp, $9.95 (P), [ISBN: 0-87168-398-9]; Math Power at 
Home, xxiv + 217 pp, $9.95 (P), [ISBN: 0-87168-397- 
0]; Math Power in School, xxii + 235 pp, $9.95 (P); 
$19.95 set. [ISBN: 0-87168-396-2] Coordinated sets 
of enrichment activities for middle school students 
in home, school, and community settings. Leader 
notes and student sheets, some borrowed from other 
sources, guide exploration in patterns and symbols; 
rational numbers; geometric and spatial reasoning; 
and probability and statistics. Emphasis on problem 
solving, application, estimation, and conceptual un- 
derstanding. Permission to copy allows teachers to 
coordinate efforts by sending home targeted activi- 
ties. MW 


History. Paul Halmos: Celebrating 50 Years of 
Mathematics. Eds: John H. Ewing, F.W. Gehring. 
Springer-Verlag, 1991, ix + 320 pp, $49. [ISBN: 
0-387-97509-8] <A Festschrift published on the oc- 
casion of Halmos’s 75th birthday: half biographi- 
cal reminiscing by Halmos’s many friends and col- 
leagues, half mathematical exposition on topics re- 
lated to Halmos’s diverse mathematical interests. 
Contains many fascinating tales by and about the 
many men of mathematics who have been influenced 
by Halmos at different stages of his career. What 
shines through is not only Halmos’s well-known love 
for mathematics and writing, but also his dedication 
to teaching and his thorough sense of professional- 
ism. LAS 


Foundations, S(16-18), P, L. Aziomatic Set The- 
ory. Paul Bernays. Dover, 1991, viii + 227 pp, $6.95 
(P). [ISBN: 0-486-66637-9] An unabridged and un- 
altered republication of the 1968 Second Edition of 
the work first published by North-Holland in 1958. 
Includes an introduction to the original Zermelo- 
Fraenkel set theory and a detailed account of the 
author’s independent presentation of a formal sys- 
tem of axiomatic set theory. LCL 


Linear Algebra, T(15-16: 1, 2). Matriz The- 
ory with Applications. Jack L. Goldberg. McGraw- 
Hill, 1991, xxi + 520 pp. [ISBN: 0-07-557200-1] De- 
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signed to be used as a textbook in an upper-division 
course on matrix theory for students in engineer- 
ing and the physical sciences. The first six chapters 
provide the foundation for a detailed study of the 
eigenvalue-eigenvector problem; the remaining chap- 
ters cover more advanced material on unitary sim- 
ilarities, quadratic forms, and convex polyhedrons. 
Optional subsections and exercises illustrate the use 
of MATLAB. AO 


Linear Algebra, T(17), S, P, L. Generalized In- 
verses of Linear Transformations. S.L. Campbell, 
C.D. Meyer, Jr. Dover, 1991, xi + 272 pp, $8.95 
(P). [ISBN: 0-486-66693-X] Emphasis placed on util- 
ity by presenting many diverse applications, e.g., of 
the generalized inverse to electrical engineering and 
of the Drazin inverse to Markov chains. Elementary 
linear algebra assumed. (1979 Pitman edition, TR, 
December 1979.) KS 


Algebra, T(18: 1, 2), P. Ring Theory, Student 
Edition. Louis H. Rowen. Academic Pr, 1991, xxvil 
+ 623 pp, $55. [ISBN: 0-12-599840-6] An abridged 
edition of the author’s two-volume work (TRs Jan- 
uary 1989 and February 1989). More suitable as 
a graduate-level text in ring theory. Material not 
needed for proving essential theorems, especially in 
the later chapters, has been cut. Formating and 
numbering of results matches the original volumes, 
enabling the student to find easy access to further 
reading. LCL 


Algebra, P. Representation Theory of Finite 
Groups and Finite-Dimensional Algebras. Eds: G.O. 
Michler, C.M. Ringel. Progress in Math., V. 95. 
Birkhauser Boston, 1991, ix + 520 pp, $86.50. 
[ISBN: 0-8176-2604-2] Seven survey articles and 
seventeen research articles presented at the conclud- 
ing conference in Bielefeld, 1991, for the Deutsche 
Forschungsgemeinschaft, begun in 1984. JS 


Algebra, P. Computational Aspects of Lie Group 
Representations and Related Topics. Ed: A.M. Co- 
hen. CWI Tract, V. 84. Stichting Mathematisch Cen- 
trum, 1991, ii+ 142 pp, Dfl. 39 (P). [ISBN: 90-6196- 
395-8] Proceedings of the 1990 Computational Al- 
gebra Seminar at CWI, Amsterdam. LC 


Calculus, T(13). Brief Calculus with Applica- 
tions, Third Edition. Roland E. Larson, Robert P. 
Hostetler, Bruce H. Edwards. DC Heath, 1991, xvi + 
919 pp, $40 net. [ISBN: 0-669-21767-0] In this edi- 
tion chapters begin with an overview and end with 
discussion questions. Calculator exercises and new 
business applications are added; the treatment of 
probability is expanded. (First Edition, TR, March 
1984; Second Edition, TR, December 1987.) OJ 

Real Analysis, T(15-16: 1, 2), S, L. Fundamen- 
tals of Abstract Analysis. Andrew M. Gleason. Jones 
& Bartlett, 1991, xi + 404 pp, $35. [ISBN: 0-86720- 
209-2] “Were I starting from scratch, I would prob- 
ably write a different book, but not a very different 
book.” Re-issue of a 1966 Addison-Wesley text (TR, 
March 1969) that introduces budding mathematics 
majors to the set-theoretic point of view that has 
so profoundly shaped modern mathematics. Based 
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on naive set theory, this “bridge” text moves from 
sets and logic through construction of real and com- 
plex numbers, to metric spaces and analytic func- 
tions. A classic text that has withstood well the test 
of time. LAS 


Complex Analysis, S$(18), P. Geometric Func- 
tion Theory in Several Complez Vartables. Junjiro 
Noguchi, Takushiro Ochiai. Transl. of Math. Mono., 
V. 80. AMS, 1990, xi + 282 pp, $78. [ISBN: 0-8218- 
4533-0] An expanded and rewritten version of the 
Japanese original published in 1984. A comprehen- 
sive introduction, at a high level, to theory, meth- 
ods, and results of geometric function theory in sev- 
eral complex variables. Chapters treat hyperbolic 
manifolds, currents and plurisubharmonic functions, 
meromorphic maps, Nevanlinna theory, and value 
distribution of holomorphic curves. Presumes famil- 
iarity with basics of several complex variables and 
complex manifolds. PZ 


Complex Analysis, T**(17-18: 1-3), P**, L. 
Theory of Complez Functions. Reinhold Remmert. 
Grad. Texts in Math., V. 122. Transl: Robert 
B. Burckel. Springer-Verlag, 1991, xix + 453 
pp, $59. [ISBN: 0-387-97195-5] A readable, invit- 
ing, enthusiastic—sometimes downright exuberant— 
introduction (though at a high level) to the theory 
of complex functions of one variable. <A veritable 
treasure trove of mathematical, historical, and cul- 
tural gems: e.g., includes brief biographies of six key 
figures, a photo of Riemann’s tomb, line drawings 
of Euler, Riemann, Cauchy, and Weierstrass, etc., 
etc. Development traces three-fold approach to sub- 
ject: Cauchy’s, via function theory; Riemann’s, via 
geometric methods; Weierstrass’s, via power series. 
With a goodly number of exercises. A remarkable, 
generous, and rewarding book. PZ 


Differential Equations, P. Differential Equations: 
Stability and Control. Ed: Saber Elaydi. Lect. Notes 
in Pure & Appl. Math., V. 127. Marcel Dekker, 
1991, xvii + 532 pp, $110 (P). [ISBN: 0-8247-8404-9] 
Proceedings (fifty-two papers) of an international 
conference held at Colorado College, June 7-10, 1989. 
In addition to papers on stability and control, the 
volume contains chapters on recent developments in 
oscillation theory of differential and difference equa- 
tions, and the present status of the theory of equa- 
tions with impulses. LCL 


Differential Equations, T(15-16: 1, 2). Dzf- 
ferential Equations with Applications and Histori- 
cal Notes, Second Edition. George F. Simmons, 
John S. Robertson. McGraw-Hill, 1991, xxi + 629 
pp, $44.95. [ISBN: 0-07-057540-1] Main changes 
in this edition include more exercises; new chapters 
on Fourier series, on partial differential equations, 
and on numerical methods; new material on operator 
methods; and engineering applications of the Laplace 
transform. (First Edition, TR, June-July 1972; Ex- 
tended Reviews, October 1975 and November 1975.) 
LCL 


Differential Equations, T(18: 1, 2), P. Nonlin- 
ear Ordinary Differential Equations and Thetr Ap- 
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plications. P.L. Sachdev. Pure & Appl. Math., V. 
142. Marcel Dekker, 1991, xviii + 578 pp, $110. 
[ISBN: 0-8247-8364-6] A high-level, but compara- 
tively leisurely and informal, introduction to theory 
and applications of nonlinear ordinary differential 
equations, from the analytic point of view. Chapters 
1-4 are elementary; many results are classical. Chap- 
ters 5-9 survey modern results and techniques. Ap- 
plications are made to fluid dynamics, biomathemat- 
ical problems, and other areas. A wealth of exam- 
ples and diagrams motivate more theoretical treat- 
ments. PZ 


Numerical Analysis, T(15-17: 1), C, L*. Nu- 
merical Methods and Software. David Kahaner, 
Cleve Moler, Stephen Nash. Prentice Hall, 1989, 
xii + 495 pp. [ISBN: 0-13-627258-4] Written in 
a style similar to the now-classic Computer Meth- 
ods for Mathematical Computations by G. Forsythe, 
et al (TR, February 1978). Intermediate in level 
between a cookbook and a numerical analysis text- 
book, this volume discusses many of the topics most 
important for scientists and engineers who will use 
numerical methods in their work. Topics include 
computer arithmetic, solution of linear and nonlin- 
ear systems of equations, interpolation, numerical 
quadrature, linear and nonlinear least-squares, data 
fitting, integration of ordinary differential equations, 
random number generators and simulation, and the 
fast Fourier transform. AO 


Operator Theory, P. The Theory of Subnormal 
Operators. John B. Conway. Math. Surveys & 
Mono., V. 36. AMS, 1991, xv + 436 pp, $128. 
[ISBN: 0-8218-1536-9] A research monograph con- 
taining background material and exercises. Recent 
development of subnormal operator theory “oriented 
towards rational functions rather than polynomials.” 
Extensive bibliography. MLR 


Functional Analysis, S(18), P. Interpolation 
Functors and Interpolation Spaces, Volume I. Yu. 
A. Brudnyi, N. Ya. Krugljak. Math. Lib., V. 47. 
North-Holland (US Distr: Elsevier Science), 1991, 
xv + 718 pp, $157. (ISBN: 0-444-88001-1] Classi- 
cal interpolation theorems, interpolation spaces and 
interpolation functors, the real method, and selected 
questions of the theory of the real method. Volumes 
II and III to be on the complex method and appli- 
cations, respectively. Acquaintance with functional 
analysis and function theory required. KS 


Functional Analysis, P. Orthogonal Schauder 
Bases. Taqdir Husain. Pure & Appl. Math., V. 
143. Marcel Dekker, 1991, xvii + 283 pp, $99.75. 
[ISBN: 0-8247-8508-8] “Is each multiplicative lin- 
ear functional on a complex Fréchet algebra contin- 
uous?” This monograph contains an account of re- 
cent research on this long-standing open question of 
E. Michael. MLR 


Analysis, P. Progress in Approzimation Theory. 
Eds: Paul Nevai, Allan Pinkus. Academic Pr, 1991, 
xi + 916 pp, $189. [ISBN: 0-12-516750-4] Sixty-two 
research articles on pure and applied approximation 
theory and related areas. Note price. LC 
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Analysis, T(18: 1), S, P, L. Integral Equations: 
A Practical Treatment, From Spectral Theory to Ap- 
plications. David Porter, David S.G. Stirling. Texts 
in Appl. Math. Cambridge Univ Pr, 1990, xi + 372 
pp, $69.50; $27.95 (P). (ISBN: 0-521-33151-X; 0-521- 
33742-9] Written for a broader audience interested 
in applications as well as basic theory; assumes some 
background in real and functional analysis, differen- 
tial equations. Offers a readable treatment of such 
topics as compact and positive operators, approxima- 
tion methods, and singular integral equations. Plen- 
tiful exercises, commentary, appendices, index. JS 


Analysis, P. Inequalities: Fifty Years On from 
Hardy, Littlewood, and Pélya. Ed: W. Norrie 
Everitt. Lect. Notes in Pure & Appl. Math., V. 
129. Marcel Dekker, 1991, ix + 283 pp, $99.75 (P). 
[ISBN: 0-8247-8488-X] Thirteen (of the fourteen) ple- 
nary lectures given at an international conference 
held at the University of Birmingham, England, July 
13-17, 1987. The conference was devoted to encour- 
aging the study of inequalities and to acknowledge 
the indebtedness of the subject to the work of Hardy, 
Littlewood, and Pélya. LCL 


Algebraic Geometry, P. Number Theory III: Dio- 
phantine Geometry. Ed: Serge Lang. Springer-Ver- 
lag, 1991, xiii + 296 pp, $59. [ISBN: 0-387-53004-5] 
An up-to-date exposition of results and conjectures 
in the field. The wide-ranging text reaches such top- 
ics as heights, Faltings’ finiteness theorems, modular 
curves over Q, Arakelov theory, and diophantine ap- 
proximations. Proofs are mostly omitted. A valuable 
reference. SG 


Algebraic Geometry, P. Complez Abelian Vari- 
ettes and Theta Functions. George R. Kempf. Uni- 
versitext. Springer-Verlag, 1991, ix + 100 pp, $29. 
[ISBN: 0-387-53168-8] An exposition of the ana- 
lytic approach to complex abelian varieties. The 
book covers invertible sheaves and sections, and co- 
homology of abelian varieties, mappings to projec- 
tive space, theta functions, moduli spaces, modular 
functions, and Mumford’s theta group. SG 


Differential Geometry, P. D-modules and Spheri- 
cal Representations. Frédéric V. Bien. Math. Notes, 
V. 39. Princeton Univ Pr, 1990, 131 pp, $22.50 
(P). [ISBN: 0-691-02517-7] The author’s Ph.D. the- 
sis (advisor, Joseph Bernstein); the goal is to study 
the representations of reductive Lie groups which oc- 
cur in the space of smooth functions on an indefinite 
symmetric space. LCL 


Geometry, S**(15-18), P**, L***. Unsolved 
Problems in Geometry. Hallard T. Croft, Kenneth J. 
Falconer, Richard K. Guy. Problem Books in Math. 
Springer-Verlag, 1991, xv + 198 pp, $39.95. [ISBN: 
0-387-97506-3] An invaluable resource that will be 
appreciated at several levels: for research mathe- 
maticians there is a supply of problems with bibli- 
ography and current status; for general mathemati- 
cians, it is a source of interesting reading; and for 
laymen (and students), it is an introduction to geom- 
etry at a research level. The problems are grouped 
into seven chapters: convexity; polygons, polyhe- 
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dra, and polytopes; tiling and dissection; packing 
and covering; combinatorial geometry; finite sets of 
points; general geometric problems. We are indebted 
to the authors and other correspondents who have 
contributed to this volume over the past 30 years; a 
marvelous book! LCL 


Geometry, P. Vision Geometry. Eds: Robert A. 
Melter, Azriel Rosenfeld, Prabir Bhattacharya. Con- 
temp. Math., V. 119. AMS, 1991, xiii + 237 pp, $87 
(P). [ISBN: 0-8218-5125-X] Contains fourteen pa- 
pers from a Special Session on “Geometry Related 
to Computer Vision” at the AMS meeting in Hobo- 
ken, New Jersey, October 20-21, 1989. OJ 


Control Theory, T(17-18), P. Lecture Notes in 
Control and Information Sciences-153: Control The- 
ory in the Plane. O. Hajek. Springer-Verlag, 1991, 
x + 269 pp, $42 (P). [ISBN: 0-387-53553-5] An 
example-driven introduction to the study of control 
systems governed by ordinary differential equations. 
The first three chapters are general in nature; the 
subsequent four chapters are devoted to topics in the 
plane, with special reference to the two-dimensional 
phase plane as state space. Includes many exer- 
cises. LCL 


Systems Theory, T(16-17: 1), P, L. Kalman Fil- 
tering with Real-Time Applications, Second Edition. 
C.K. Chui, G. Chen. Ser. in Inform. Sci., V. 17. 
Springer-Verlag, 1991, xvi + 195 pp, $39.50 (P). 
[ISBN: 0-387-18395-7] A concise introduction to the 
mathematical theory of Kalman filtering, this edi- 
tion has an expanded section on “real-time system 
identification,” and a brief introduction to wavelet 
analysis. LCL 


Probability, P. Advances in Probability Distribu- 
tions with Given Marginals: Beyond the Copulas. 
Eds: G. Dall’Aglio, S. Kotz, G. Salinetti. Math. & 
Its Applic., V. 67. Kluwer Academic, 1991, xviii + 
231 pp, $94. [ISBN: 0-7923-1156-5] Contains eleven 
invited papers given at a conference in Rome, April 
1990. A breadth of topics reflects the wide variety of 
work in the area. TAV 


Probability, P. Probability Theory and tts Ap- 
plications in China. Eds: Yan Shi-Jian, Yang 
Chung-Chun, Wang Jia-Gang. Contemp. Math., 
V. 118. AMS, 1991, xv + 333 pp, $46 (P). [ISBN: 
0-8218-5126-8] Nineteen papers, mostly surveys of 
recent works, collected to demonstrate the breadth 
of current work in probability in China. The editors 
hope this collection will help spur collaborative work 
with Western researchers. TAV 


Stochastic Processes, P. Stochastic Analysis: 


Liber Amicorum for Moshe Zakai. Eds: Eddy 
Mayer-Wolf, Ely Merzbach, Adam Shwartz. Aca- 
demic Pr, 1991, xx + 532 pp, $69.95. [ISBN:0- 


12-481005-5] Twenty-nine papers in the fields of 
stochastic differential equations, nonlinear filtering, 
two parameter Martingales, Wiener spaces and re- 
lated topics—all areas of Zakai’s interests. Presented 
in Haifa in January 1991 in honor of Zakai’s 65th 
birthday. TAV 


Stochastic Processes, T(16), P*. The Theory 


1991] 


of Optimal Stopping. Y.S. Chow, Herbert Robbins, 
David Siegmund. Dover, 1991, xii + 141 pp, $6.95 
(P). [ISBN: 0-486-66650-6] A reprint of the 1971 
text (original title: Great Ezpectations, TR, Febru- 
ary 1972). The topic remains fresh and fascinating 
twenty years later. Contains numerous examples, ex- 
ercises, and a good (if dated) bibliography. TAV 


Stochastic Processes, P. Generalized Diffusion 
Processes. N.I. Portenko. Transl. of Math. Mono., 
V. 83. AMS, 1990, ix + 180 pp, $81. [ISBN: 0-8218- 
4538-1] The author’s (1978) doctoral dissertation, 
this treatment of diffusion processes in a medium 
moving in a nonregular way assumes a strong back- 
ground: Martingale theory, stochastic differential 
equations, Markov processes, and the main prop- 
erties of second order partial differential equations 
of parabolic type. With this background the reader 
finds a coherent and careful text. Contains a limited 
bibliography. TAV 

Statistics, P. Nonparametric Functional Estimation 
and Related Topics. Ed: George Roussas. NATO 
ASI Ser. C, V. 335. Kluwer Academic, 1991, xiii 
+ 708 pp, $215. [ISBN: 0-7923-1226-0] Collection 
of papers presented at a NATO-sponsored Advanced 
Study Institute on nonparametric functional estima- 
tion held in Greece in 1990. Includes sections on 
curve and functional estimation, parameter selection, 
regression models, dependent data, time series anal- 
ysis, and various related topics. List of attendants, 
index. Note price. JS 


Statistics, P, L. Statistical Reasoning with Impre- 
cise Probabilities. Peter Walley. Chapman & Hall, 
1991, xii + 706 pp, $137. [ISBN: 0-412-28660-2] 
The author adopts a behavioral interpretation of 
probability to justify a set of coherence axioms lead- 
ing to a modification of Bayes theory to allow for 
imprecise probabilities (including, but not limited to, 
upper and lower probabilities). Resulting theory is 
compared and contrasted with Bayes theory. Some 
discussion on the assessment and elicitation of im- 
precise probabilities. RWJ 


Statistics, T(17), S*, P. Probability, Statistical 
Optics, and Data Testing: A Problem Solving Ap- 
proach, Second Edition. B. Roy Frieden. Ser. in 
Inform. Sci., V. 10. Springer-Verlag, 1991, xx + 443 
pp, $59 (P). [ISBN: 0-387-53310-9] While the set- 
ting is statistical optics, the author’s intent is to pro- 
vide an extensive introduction to the power and ap- 
plicability of probabilistic and statistical modelling. 
The stochastic nature of many problems in optics 
gives a rich and relevant environment for this study. 
A well-conceived and executed text. TAV 


Statistics, T(15-17: 1), L. Statistical Methods: 
The Geometric Approach. David J. Saville, Gra- 
ham R. Wood. Texts in Stat. Springer-Verlag, 1991, 
xv + 560 pp, $49.50. [ISBN: 0-387-97517-9] Novel 
presentation of the main ideas of analysis of vari- 
ance, analysis of covariance, and regression analysis. 
Uses a geometric approach, assuming a background 
of elementary vector geometry and a first course in 
statistics. Contains numerous examples, which are 
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principally agricultural or biological. Computations 
are done using Minitab commands. RSK 


Computer Literacy, S(14-16), L*. Computeri- 
zation and Controversy: Value Conflicts and Social 
Choices. Eds: Charles Dunlop, Rob Kling. Aca- 
demic Pr, 1991, xviii + 758 pp, $34.95 (P). [ISBN: 0- 
12-224356-0] A thoughtfully selected and carefully 
edited anthology of nearly fifty readings on a va- 
riety of issues pertaining to the role of computers 
as a powerful factor for social change: technologi- 
cal utopianism, economic and organizational dimen- 
sions, transformation of work, social relationships, 
privacy, security, and ethical responsibilities. Each 
section opens with a very helpful introduction by 
the editors complete with references for further read- 
ing. LAS 

Computer Systems, S, C. Mathematica Help 
Stack. Robert Campbell, et al. Variable Sym- 
bols (2161 Shattuck Ave., Suite 202, Berkeley, CA 
94704-1313; 415-843-8701), 1990, $89. A Macin- 
tosh hyper-card application providing a visual tree- 
structured reference manual for the symbolic com- 
putation package Mathematica, Version 1.2. Open- 
ing “map” groups commands by type (e.g., calcu- 
lus, number theory, lists); next layer shows options 
within each type; third layer is a fully active “ref 
page” for the selected option, giving syntax, descrip- 
tion, examples, cross references, and offering an ac- 
tive window for user-added notes. Alternate com- 
mands and cross references are instantly available 
by a mouse-click. Requires 4MB to operate; to si- 
multaneously run Mathematica would require much 
more. LAS 


Computer Systems, S. Mathematica Quick Ref- 
erence. Nancy Blachman. Variable Symbols (2161 
Shattuck Ave., Suite 202, Berkeley, CA 94704-1313; 
415-843-8701), 1990, $14.95. A spiral-bound, slim (4 
x 8.5 in.) alphabetic list of all Mathematica built-in 
objects (functions, variables, options) and packages, 
giving exact syntax and brief description. Handy, 
easily portable, and very useful. LAS 


Computer Systems, S. UNIX RefGuide. McNulty 
Development. Prentice Hall, 1986, iv + 124 pp, (P). 
[ISBN: 0-13-938957-0] A spiral-bound card file con- 
taining a mixture of introductory examples and refer- 
ence charts for the most common UNIX commands. 
The thin selection of commands makes it more valu- 
able as a guide (for beginners) than as a reference. 
Somewhat dated, but still useful. LAS 


Artificial Intelligence, P. Neural Networks: The- 
ory and Applications. Eds: Richard J. Mammone, 
Yehoshua Y. Zeevi. Academic Pr, 1991, xv + 355 
pp, $39.95. [ISBN: 0-12-467050-4] Papers from a 
1990 conference at Rutgers giving overview of fea- 
tures of neural nets and promising approaches. Top- 
ics include self organization, learning, vision, charac- 
ter recognition, hybrid nets, nets to solve combina- 
torial problems. RM 

Artificial Intelligence, S(16), L. Computation & 
Cognition: Proceedings of the First NEC Research 
Symposium. Ed: C.W. Gear. SIAM, 1991, ix + 168 
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pp, $27.50. [ISBN: 0-89871-272-6] Eight research 
papers presented at a symposium held in Princeton, 
New Jersey in May 1989. Under the main theme 
of models for intelligent computation, it addressed 
a number of different research efforts in the area of 
computer design which are aimed at building com- 
puters for artificial intelligence and cognitive science. 
This includes such projects as cellular automata and 
neural networks. GMS 


Computer Science, S(15-16), P, L. Practical 
UNIX Security. Simson Garfinkel, Gene Spafford. 
O’Reilly & Assoc, 1991, xvii + 481 pp, $29.95 
(P). [ISBN:0-937175-72-2] UNIX is one of the most 
widely used and widely studied operating systems. 
One of the major complaints against it is that it is a 
very insecure system that can easily be broken into 
by the typical “hacker.” This book addresses that 
problem by defining what a secure operating system 
is and how to achieve relatively high levels within the 
typical UNIX implementation. GMS 


Computer Science, $(16), P, L. Taking Software 
Design Seriously: Practical Techniques for Human- 
Computer Interaction Design. Ed: John Karat. Aca- 
demic Pr, 1991, x + 357 pp, $44.50. [ISBN: 0-12- 
397710-X] Contains seventeen papers presented at a 
workshop on software engineering in April 1990. All 
of the papers address the cause of human-computer 
interface design, especially as it relates to the imple- 
mentation of practical real world software. Papers 
also address questions on interface design evaluation, 
tools, and training techniques. GMS 


Computer Science, S(15-16), P, L. POSIX Pro- 
grammer’s Guide: Writing Portable UNIX Programs 
with the POSIX.1 Standard. Donald A. Lewine. 
O’Reilly & Assoc, 1991, xxvii + 597 pp, $34.95 (P). 
[ISBN: 0-937175-73-0] POSIX stands for Portable 
Operating System based on UNIX. It is an attempt 
to create a single universal UNIX standard so that 
UNIX-based applications will be portable to a wide 
range of systems. Book contains a detailed technical 
description of the POSIX standard as established by 
OSF, the Open Software Foundation. GMS 


Applications, P. Transactions of the Eighth Army 
Conference on Applied Mathematics and Comput- 
ing. US Army Research Office (POB 12211, Research 
Triangle Park, NC 27709), 1990, xxi + 925 pp, (P). 
Technical papers on a wide variety of topics, notably 
turbulance, variational problems, and symbolic com- 
putation, from a conference held at Cornell Univer- 
sity during June 1990. LAS 

Applications (Biological Science), S(17-18). 
Theory of Heart: Biomechanics, Biophysics, and 
Nonlinear Dynamics of Cardiac Function. Eds: Leon 
Glass, Peter Hunter, Andrew McCulloch. Inst. for 
Nonlinear Sci. Springer-Verlag, 1991, xvii + 611 pp, 
$59. (ISBN: 0-387-97483-0] A collection of lengthy 
papers covering diverse topics in cardiology from a 
physical science perspective. Most of the papers con- 
tain little if any mathematics; however, a few demon- 
strate an impressive command of higher mathemat- 
ics. MU 
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Applications (Economics), P. Control Theory 
and Dynamic Games in Economic Policy Analysis. 
Maria Luisa Petit. Cambridge Univ Pr, 1990, xiv 
+ 338 pp, $49.50. [ISBN: 0-521-38523-7] Mathe- 
matical techniques applied to economic policy anal- 
ysis (stabilization and control of economic systems) 
based on theory of dynamic deterministic economic 
models, and continuous time analysis. Addressed to 
economists. RM 

Applications (Physics), T(18: 2), S, P, L. Lie 
Algebras, Part 1: Finite and Infinite Dimensional 
Ine Algebras and Applications in Physics. G.G.A. 
Bauerle, E.A. de Kerf. Stud. in Math. Physics, V. 1. 
North-Holland (US Distr: Elsevier Science), 1990, 
xvi + 394 pp, $105.75. [ISBN: 0-444-88776-8] Writ- 
ten to bridge the gap between formal rigorous de- 
velopment of theory of Lie algebras and its avail- 
ability for application by theoretical physicists; as- 
sumes only a background of undergraduate linear al- 
gebra (and considerable motivation). Primary goal is 
the structure theory for complex semi-simple finite- 
dimensional Lie algebras followed by construction 
and classification of Kac-Moody algebras. Necessary 
definitions and sufficient theorems developed to make 
results intelligible. Few proofs, many references. Nu- 
merous remarks, examples, exercises. References, in- 
dex. JS 

Applications (Physics), T(16-18: 1, 2), S, L. 
An Introduction to General Relativity. L.P. Hugh- 
ston, K.P. Tod. London Math. Soc. Student Texts, 
V. 5. Cambridge Univ Pr, 1990, 183 pp, $49.50; 
$19.95 (P). [ISBN: 0-521-32705-9; 0-521-33943-X] It 
is difficult to imagine a suitable audience for this 
overly ambitious text. Many significant gaps exist in 
the development of the mathematical formulae. Thus 
many readers who are not already fluent with the at- 
tendant mathematics will require extensive collateral 
instruction. Unfortunately, the authors continue the 
use of the Einstein summation convention, which cre- 
ates ambiguity and will frustrate many mathematical 
readers. MU 

Applications (Physics), S(17-18). Mechanics, 
Analysts and Geometry: 200 Years after Lagrange. 
Ed: Mauro Francaviglia. Delta Ser. North-Holland 
(US Distr: Elsevier Sci), 1991, xi + 559 pp, $137. 
[ISBN: 0-444-88958-2] Solicited articles by eminent 
scientists on dynamical systems, integrable systems 
and quantum groups, analytical mechanics and cal- 
culus of variations, global analysis, differential geom- 
etry, relativity and field theory, and history of math- 
ematics. MU 


Reviewers 


LC: Laura Chihara, St. Olaf; SG: Steven Galovich, Car- 
leton; OJ: Ockle Johnson, St. Olaf; RWJ: Roger W. John- 
son, Carleton; RSK: Richard S. Kleber, St. Olaf; LCL: Loren 
C. Larson, St. Olaf; RM: Richard Molnar, Macalester; AO: 
Arnold Ostebee, St. Olaf; MLR: Margaret L. Reese, St. Olaf; 
KS: Karen Saxe, Macalester; GMS: G. Michael Schneider, 
Macalester; JS: John Schue, Macalester; LAS: Lynn Arthur 
Steen, St. Olaf; MU: Milton Ulmer, Carleton; TAV: Theodore 
A. Vessey, St. Olaf; MW: Marthe Wallace, St. Olaf; PZ: Paul 
Zorn, St. Olef. 
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After inventing the @raph- 
ic Calculator and watching® it 
enhance the learning process. 
Casio has gone a step further. 
We've created The Next 
Generation, 

When students use the 
fx-7700G in tandem with a 
teacher's OH-7700G;, their minds 
are focused on the logic and 
purpose behind calculations, 
not just the mechanics of 
doing them. 

And Casio didi t just 
top Casio, we topped our 
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graphs. And 
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Plus, compared to 
our leading competitor. 
we have almost twice the 
memory. More zoom 
modes. 4-way scrolling (up 
and down, . 
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menu-Key 
approach that 
reduces key- 
board clutter 
while increas- 
ing function- 
ality. And a power- 
ful, “spread-sheet 
-like” 9X9 matrix 
that’s easy to view The OH-7700G. 
and edit. 
Casio’s Next Generation. A sure 
way to assist the teaching and im- 
prove the learning of America’s next 
generation. 
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New From 


Steven G. Krantz 


COMPLEX ANALYSIS: 
THE GEOMETRIC VIEWPOINT 


Steven G. Krantz 


Geometric methods have been used in 
complex analysis since the 1930s when 
Lars Ahlfors discovered that they give a nice 
way to look at the Schwarz lemma. Since 
that time they have become a central part of 
the research activities of complex analysis. 
However these important techniques have 
never found their way into a text accessible 
to a broad audience. 


Steven G. Krantz, a leading worker in com- 
plex analysis and a well-known mathemati- 
cal expositor, has written the first book ex- 
plaining how complex analysis can be stud- 
ied using methods of geometry. Assum- 
ing no background in Riemannian geometry, 
and only one semester of complex analysis, 
Krantz explains the role of Hermitian met- 
rics and of curvature in understanding the 
Schwarz lemma, normal families, Picard’s 
theorems, conformal mappings, and many 
other topics. A minimum of geometric for- 
malism is used to gain a maximum of ge- 
ometric and analytic insight. The climax of 
the book is an introduction to several com- 
plex variables from the geometric viewpoint. 
Poincaré's theorem, that the ball and bidisc 
are biholomorphically inequivalent, is dis- 
cussed and proved. 
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Except for the minimal background require- 
ments, the book is self-contained. A review 
of relevant topics in the classical theory of 
one complex variable is provided. The style 
is light and inviting. The book is a must for 
anyone with an interest in complex analysis. 
Take a glance at the main chapter headings 
and order your copy today 

™ Principal Ideas of Classical Function 
Theory, 

Basic Notions of Differential Geometry, 
Curvature and Applications, 

Some New Invariant Metrics, 

A Glimpse of Several Complex Vari- 
ables 


210 pp., 1990, Hardbound, 
ISBN 0-88385-026-5 


List $22.00 MAA Member $18.50 
Catalog Number CAM-23 


ORDER FROM 


(Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D. C. 20036 


Introducing E.Z. Math, E.Z. Algebra and E.Z. Arithmetic for the HP 48SX 


E.Z. Math, E.Z. Algebra and E.Z. Arithmetic are programs for the Hewlett Packard 48SX calculator conceived, written 
and programmed by Raymond La Barbera and the E.Z. Software Company. Each program comes on a 128K plug-in ROM 
card accompanied by an easy-to-understand, well-written, detailed manual loaded with lots of specific examples and _is 

esigned for use by students, teachers. parents and business people. Each program features an easy-to-use, logically 


organized, user-friendly interface which enables those who consider themselves to be calculator and computer illiterates, 
as well as those who don’t like to read manuals, to have full access to all program features quickly and easily. Since the 
HP 48S X is essentially an impressive, sharp-looking, 8 ounce pocket computer, students are easily motivated to take it 
along with them to study, practice, drill and master math in a study hall, on a train or bus, in a car, on line, on vacation, 
on a break—in short, for self-study at any time and in any place. 


What Can Be Done With E.Z. Math 


K.Z. Math effectively solves problems involving graphs, numbers, loans and savings. With k.Z. Math, anyone can: 
¢ Master the entire high school and college graphing curriculum, from algebra to calculus, with 188 families of equations, inequal- 
ities, functions, and systems, all arranged in an easy-to-use, user-friendly system of menus to make graphic analysis a snap! 
¢ Get extensive help with calculations involving fractions, whole numbers, complex numbers and number sequences 
¢ Easily do savings and loan calculations and generate complete amortization tables. 
e Learn many basic concepts including those involving sets, variables, graphing, solving, numbers, loans and savings 


What Can Be Done With E.Z. Algebra 
E.Z. Algebra is a comprehensive ninth grade high school basic algebra course as well as a high school and college remedial 
algebra course that builds a solid algebra foundation. With E.Z. Algebra, anyone can: 

¢ [Learn about sets, operations, variables, relations and other concepts essential to a real understanding of algebra. 

¢ Understand the sets of natural numbers, whole numbers, integers, rational numbers and real numbers. 

¢ Master the meaning and properties of the operations of addition, subtraction, multiplication, division, power and root. 

¢ Do all kinds of problems involving algebraic expressions, numerical phrases, equations and inequalities. 


What Can Be Done With E.Z. Arithmetic 


E.Z. Arithmetic is a comprehensive elementary school basic arithmetic course as well as a high school and college 
remedial arithmetic course that makes solving most arithmetic problems a snap! With E.Z. Arithmetic, anyone can: 
¢ Learn how to add, subtract, multiply, divide and order whole numbers, fractions, decimals, percents and integers. 
¢ Master the meaning, terminology and conversion methods for whole numbers, fractions, decimals and percents. 
e Drill and be graded on endless ; varied, randomly selected sets of problems involving whole numbers, fractions, decimals and 
integers, with the difficulty level, number of problems, operation and type of number user selectable. 


How To Order Copies or Get Further Information 
Each E.Z. Software program costs $130.00 ($125.00 retail, plus $5.00 shipping and handling). Take a 10% discount when 
ordering ten or more units. We accept payment by check, money order, COD, VISA, MC, AE and purchase order. If 
within 30 days you find that any E.Z. Software program fails to meet your expectations, we'll gladly take back your copy 
for a prompt, courteous refund. To order copies, either individually or bundled with HP 48SX calculators, please contact: 
SMI Corporation, 250 West New Street, Dept MM2, Kingsport, Tennessee 37660 
(800) 234-0123 or (615) 378-4821 or (615) 245-8982 (Fax). 


ADVENTURES OF A MATHEMATICIAN 


$.M. ULAM 

WITH A NEW INTRODUCTION BY DANIEL HIRSCH & WILLIAM G. MATHEWS, 
AND AN EPILOGUE BY FRANCOISE ULAM & JAN MYCIELSKI 

NOW IN PAPERBACK—The autobiography of mathematician 
Stanislaw Ulam, one of the great minds of the twentieth century, 
is rich with amazingly prophetic speculations and peppered with 
lively anecdotes. 384 pages, 25 photographs, $15.95 paper 
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WEIGHT MULTIPLICITIES, AND 


BRANCHING RULES 


VOLUME 1 & VOLUME 2 
$. KASS, R. V. MOODY, J. PATERA, & R. SLANSKY 


“This is mathematics the way it should be written, with lots of 


At bookstores 

or order toll-free 
1-800-822-6657. 
Visa/MasterCard only. 


examples. ... This book is unique at this time... . The authors 
have performed a service to the math/physics community.” 
—Pierre Ramond, University of Florida 
Los Alamos Series in Basic and Applied Sciences 
840 pages in 2 volumes, 600 tables, 
$75.00 cloth (2 vol. set) 
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Stories About Maxima 


and Minima 


V.M. Tikhomirov, 
Translated by Abe Shenitzer 


We are pleased to announce the first volume ina 
new series of expository books translated from the 
Russian KVANT Library, published jointly with the 
American Mathematical Society. 


Throughout the history of mathematics, maximum 
and minimum problems have played an important 
role. Many beautiful and important problems have 
appeared in a variety of branches of mathematics 
and physics. The greatest scientists of the past— 
Euclid, Archimedes, Heron, the Bernoullis and 
Newton, took part in seeking solutions to these 
concrete problems. The solutions stimulated the 
development of the theory, and, as a result, tech- 
niques were elaborated that made possible the 
solution of a tremendous variety of problems by a 
single method. 


This book presents fifteen “stories” designed to 
acquaint the reader with the central concepts of 
the theory of maxima and minima as well as with 
its illustrious history. 


In Part One, the author familiarizes readers with 
concrete problems that lead to discussion of the 
work of some of the greatest mathematicians of all 
time. Part Two introduces a method for solving 
maximum and minimum problems that originated 
with Lagrange. While the content of this method 
has varied constantly, its basic concept has en- 
dured for over two centuries. The final story is 
addressed primarily to those who teach math- 
ematics, for it impinges on the question of how to 
teach. The author strives to show how the analysis 
of diverse facts gives rise to a general idea, how 
this idea is transformed, how it is enriched by new 
content, and how it remains the same in spite of 
these changes. 


Contents 


Ancient maximum and minimum problems: 1. 
Why do we solve maximum and minimum prob- 
lems?; 2.The oldest problem—Dido’s problem; 3. 
Maxima and minima in nature (optics); 4. Maxima 
and minima in geometry;5. Maxima and minima in 
algebra and in analysis; 6. Kepler’s problem; 7. 
The brachistochrone; 8. Newton’s aerodynamical 
problem;Methods of solution of extremal prob- 
lems; 9. What is a function?; 10. What is an 
extremal problem?; 11. Extrema of functions of 
one variable; 12. Extrema of functions of many 
variables. Lagrange’s principle; 13. More problem 
solving; 14. What happened later in the theory of 
extremal problems; 15. The fifteenth story, or 
rather, a discussion. 


187 pp., 1991, Paperbound 
ISBN 0-8218-0165-1 


List: $23.00 MAA Member: $18.00 


Catalog Number: MAXIM 


ORDER FROM: 

Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, DC. 20036 

(FAX) (202) 265-2384 


Prepaid orders sent postage & 
handling free. Visa and Mastercard 
orders accepted. (Please give the card 
number and expiration date on credit 
card orders) We will bill for orders 
over $10.00. 


Topics in 
Noncommutative 


Geometry 
Yuri |. Manin 


There is a well-known correspondence 
between the objects of algebra and geometry: 
a space gives rise to a function algebra; a 
vector bundle over the space corresponds 
to a projective module over this algebra; 
cohomology can be read off the de Rham 
complex; and so on. In this book Yuri Manin 
addresses a variety of instances in which the 
application of commutative algebra cannot be 
used to describe geometric objects, emphasiz- 
ing the recent upsurge of activity in studying 
noncommutative rings as if they were function 
rings on “noncommutative spaces.” 


M. B. Porter Lectures, 
Rice University, Department of Mathematics 


Cloth: $35.00 ISBN 0-691-08588-9 
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Princeton Uni 


Introduction to 
Algebraic and 
Constructive 
Quantum Field 
Theory 


John C. Baez, Irving E. Segal, 
and Zhengfang Zhou 


The authors present a rigorous treatment 
of the first principles of the algebraic and 
analytic core of quantum field theory. Their 
aim is to correlate modern mathematical 
theory with the explanation of the observed 
process of particle production and of particle- 
wave duality that heuristic quantum field 
theory provides. Many topics are treated here 
in book form for the first time, from the origins 
of complex structures to the quantization of 
tachyons and domains of dependence for 
quantized wave equations. 
Princeton Series in Physics 
Philip W. Anderson, Arthur S. Wightman, and 
Sam B. Treiman, Editors 
$59.50 ISBN 0-691-08546-3 


To Infinity and 


Beyond 
A Cultural History of the Infinite 
Eli Maor 


Eli Maor describes, with abundant 
examples, the mathematician’s fascination 
with infinity. 

“Fascinating and enjoyable .... places 
the ideas of infinity in a cultural context 
and shows how they have been espoused 
and moided by mathematics.”—Science 
Over 160 illustrations 


Paper: $16.95 ISBN 0-691-02511-8 
Available from Princeton only in the U.S. and Canada 
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Revised 
and 
Updated 


THE LAST PROBLEM 


E. T. Bell 
Revised and updated by Underwood Dudley 


What Eric Temple Bell calls the last prob- 
lem is the problem of showing that Pierre 
Fermat was not mistaken when he wrote 
in the margin of a book, almost 350 years 
ago, that x” + y”™ = z” has no solution in 
positive integers when n > 3. The orig- 
inal text of THE LAST PROBLEM traced 
the problem from Babylonia in 2000 B.C. 
to seventeenth-century France. Along the 
way we learn quite a bit about history, and 
just as much about mathematics. Under- 
wood Dudley's notes bring Us up-to-date on 
recent attempts to solve the problem. 


The book is unique in that it is a biogra- 
phy of a famous problem. The book fits 
no categories. It is not a book of mathe- 
matics. Pages go by without an equation 
appearing. It is not a history of number the- 
ory because it includes too much about the 
history of the western world, and it is not 
a history of western civilization because its 
focus is on mathematics. It is too entertain- 
ing to be scholarly and contains too much 
mathematics to be widely popular. It is an 
unusual book. 


What T.A.A. Broadbent said about Bell's 
work applies to THE LAST PROBLEM. 
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His style is clear and exuberant, his 
opinions, whether we agree with them 
or not, are expressed forcefully, often 
with humor and a little gentile malice. 
He was no uncritical hero-worshipper, 
being as quick to mark the opportunity 
lost as the ground gained, so that from 
his books we get a vision of mathemat- 
ics as a high activity of the questing 
human mind, often fallible, but always 
pressing on the neverending search for 
mathematical truth. 


This is a rich and varied, wide-ranging book, 
written with force and vigor by someone with 
a distinctive style and point of view. It will 
provide hours of enjoyable reading for any- 
one interested in mathematics. 


328 pp., Paperbound, 1990 
ISBN-0-88385-451 -1 


List: $17.50 MAA Member: $13.50 
Catalog Number TLP 


ORDER FROM 


Lap Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D. C. 20036 


The Ohio State University 
invites applications for 
TRANSIT 


An NSF project to establish School/University Teams as 
regional technology training centers. Regional team training 
will be provided through summer in-service sessions and 
academic year follow-up conferences at Ohio State University. | 
Local living expenses with stipend support for pre-college team | 
| members is available. Regional teams will help create and/or | 
revise inservice training modules. Regional center teams will 
begin training teachers as school technology specialists at their | 
regional sites during Summer 1993. 


Deadline for completed applications is February 15, 1992. 


Write TRANSIT, c/o Frank Demana & Bert Waits, The Ohio 
State University, Mathematics Department, 231 West 18th 
Avenue, Columbus, OH 43210. 


University of Arizona 
GRADUATE FELLOWSHIPS 
IN THE 


MATHEMATICAL SCIENCES 

Up to 12 fellowships for outstanding new graduate students in the mathematical sciences may be 
available in 1992-93. Fellowship applicants should be seeking the Ph.D. and planning careers in teaching and/or 
fundamental research. Anticipated stipends are $12,000 or more for 12 months with both in-state and non- 
resident tuition waived. 

These fellowships are restricted to citizens, permanent residents, or individuals who have established 
intent to become citizens or permanent residents of the United States. Applications from U.S. women and 
students belonging to U.S. minority groups are particularly invited. Currently one-fourth of the U.S. graduate 
students in pure and applied mathematics at Arizona are women. 

The University of Arizona has excellent programs in traditional pure and applied mathematics, and 
is a leading institution in interdisciplinary applied mathematics. This presents a wealth of opportunities for 
graduate study encompassing such areas as dynamical systems, number theory, computational science, geometry, 
nonlinear partial differential equations, mathematical physics, probability, and applications of mathematics in 
the physical, biological, social, and engineering sciences. In addition, outstanding computational facilities for 
graduate study and research are available to the over 170 graduate students in the mathematica! sciences at the 
University of Arizona. 

Fellowship applicants of superior quality will be among the students invited to the Sixth Annual 
Workshop for Advanced Undergraduates on Current Ideas in Nonlinear Science, February 29 - March 3, 1992. 
Limited support is available for attendees. (The deadline for Workshop applications is February 1, 1992.) The 
workshop is designed to communicate topics in current active research in three areas: (i) Geometrical and 
Computational Aspects of Analysis and Number Theory, (ii) Analytical Approaches to Applied Problems, and 
(iii) Numerical Modelling. 

For information and application materials contact: 
W. M. Greenlee or T. W. Secomb 
Department of Mathematics/Program in Applicd Mathematics 
University of Arizona - Tucson, AZ 85721 - (602) 621-2068 
The University of Arizona is an Equal Opportunity Employer 


sources that will help you in your teaching. 
Extensive descriptions of advising programs 
that work is included, along with sugges- 
ache tions for teaching that describe a wide range 

of instructional techniques. You will learn 
about how to use computers in your teach- 
ing, and how to evaluate your performance 


A SOURCE BOOK FOR 
as well as that of your students. 


COLLEGE MATHEMATICS 
TEACHING 


Alan Schoenfeld, Editor. 
Prepared by the Committee on the 
Undergraduate Teaching of Mathematics 


Every faculty member concerned about teach: 
ing should read this book. Every admin- 
istrator with responsibility for the quality of 
mathematics programs should have a copy. 


Do you want a broader, deeper, more suc- 80 pp., 1990, Paper, 
cessful mathematics program? This Source ISBN 0-88385-068-0 
Book points to the resources and perspec- List $10.00 


tives you need. Catalog Number SRCE 
This book provides the means for improv- 


ing instruction, and describes the broad 
spectrum of mathematical skills and per- 
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spectives our student should develop. The 
curriculum recommendations section shows 
where to look for reports and course re- 


Studies in Numerical 
Analysis, 
Gene H. Golub, Editor 


MAA Studies in 
Mathematics #24 


415 pp. Cloth, 1985, 
ISBN-0-88385-1 26-1 
List: $46.50 

MAA Member: $34.50 


This volume is a collection of papers describing 
the wide range of research activity in numerical 
analysis. The articles describe solutions to a 
variety of problems using many different kinds 
of computational tools. Some of the computa- 
tions require nothing more than a hand-held 
calculator; others require the most modern com- 
puter. While the papers do not cover all of the 
problems that arise in numerical analysis, they 
do offer an enticing and informative sampling. 
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Derive Version 1 6 


The DERIVEs 
program 
solves both 
symbolic 

and numeric 
problems, 

and it plots 
beautifully too. 


2000 Years of . 
Mathematical Knowledge 
on a Disk 


¢ Symbolic math from algebra through 
calculus. 


¢ Plots in both 2-D and 3-D. 

* Simple, letter-driven menu interface. 
¢ Solves equations exactly. 

¢ Understands vectors and matrices. 


¢ Split or overlay algebra and plot 
windows. 


¢ Displays accepted math notation. 

¢ Performs arithmetic to thousands of 
digits. 

¢ Simplifies, factors and expands 
expressions. 


¢ Does exponential, logarithmic, 
trigonometric, hyperbolic and 
probability functions. 


Soft Warchouse: 


DERIVE®, A Mathematical Assistant is now available for palmtops through 486-based PCs. 


a compact card. 


DERIVE is a registered trademark of Soft Warehouse, Inc. 


Taylor and Fourier series 
approximations. 


4 
Permits recursive and iterative 
programming. 


Can generate Fortran, Pascal and 
Basic statements. 


System requirements 


PC version: MS-DOS 2.1 or later, only 
512Kb RAM and one 3.5" or 5.25" disk 
drive. Suggested retail price is $250. 


ROM-card version: Hewlett-Packard 
95LX Palmtop computer. Suggested 
retail price is $289. 


Contact Soft Warehouse for a list of 
dealers. Or, ask at your local computer 
store, software store or HP calculator 
dealer. Dealer inquires are welcome. 


Soft Warehouse, Inc. « 3660 Waialae Avenue 
Suite 304 ¢ Honolulu, Hl, USA 96816-3236 
Phone: (808) 734-5801 « Fax: (808) 735-1105 


MATHEMATICS BOOKS FROM BCS 
NUMERICAL LINEAR ALGEBRA by Willy Brandal (1991) 


Designed as an undergraduate text. Carefully written with the appropriate level of rigor 
to make it work. Various algorithms are mathematically developed and compared. Care is 
taken with regard to which results hold in the field of reals and which hold in the complex 
numbers. Topics include: Gaussian elimination, LU decomposition, Cholesky decomposition, 
the Gauss-Seidel, power, and QR methods, Aitken acceleration, and others. Includes over 
350 exercises distributed at the end of sections, and 5 computer assignments. 

ISBN 0-914351-05-2 Paperback only 212pp US$30.00 


Note. In order to keep the cost to the student as reasonable as possible: 
We do not intend to periodically float new editions. 
2) We do not send examination copies— we think your library (or department) should 
purchase a copy. 


Translations into English of classics, both old and new. 


FUNDAMENTALS OF GRAPH THEORY by A.A.Zykov (1990) 
Translation, assisted by the author, of the 1987 Russian edition (M.:Nauka.Gl.red.Fiz.- 
mat.lit-). Includes a Glossary-Index-Bibliography that should be of immense value to the 
researcher and student of Graph Theory. An impressive number of exercises of varying 
difficulty are included at the end of each section. This was written as a textbook, and is ad- 
mirable suited for that purpose. A sample of some topics included in this vast work: Simple, 
labeled, multi-, topological and directed graphs: numerous invariants— optimum and crit- 
ical graphs; the isomorphism problem and Vizing’s construction; reconstruction problems; 
matchings; embeddings in surfaces, planarity; Hadwiger’s conjecture; various colorings; per- 
fect graphs: reachability; kernels; and a great many others. There is an Appendix on Boolean 
methods in graph theory. 

ISBN 0-914351-04-4 Paperback only 371pp US$48.00 


THEORY OF CONVEX BODIES by T.Bonnesen and W.Fenchel (1987) 
Originally published as volume 3 of Springer-Verlag’s “Ergebnisse” series in 1934. 
ISBN 0-914351-02-8 Paperback only 183pp § US$30.00 


GRAPH THEORY: A Development from the 4-Color Problemby Martin Aigner (1987) 
An exceptionally readable and aptly titled development of graph theory. The original German 
edition was published in 1984 by B.G. Teubner. 

ISBN 0-914351-03-6 Paperback only 233pp US$36.00 


TOPOLOGY OF SURFACES by André Gramain 1984 


A beautiful introduction to the concepts and methods of differential topology. Originally 
published in 1974 by Presses Univeritaires de France. 


ISBN 0-914351-01-X Paperback only 227 pp US$24.00 
KNOT THEORY by Kurt Reidemeister (1983) 
A translation of Springer-Verlag’s 1932 German edition. 

ISBN 0-914351-00-1 Paperback only 158pp US$20.00 


Individual members of the American Mathematics Society or Association who cite this ad may take a 
discount of 10% from the above prices. 

Sorry, but we can only accept prepaid orders from individuals and do not accept credit cards. Orders 
from Canada, Mexico and the USA will be sent 4th class bookrate. Please include $1.75 for postage and 
handling for the first book, and $.50 for each additional book up to a maximum postage and handling 
fee of $4.75. Foreign orders must be paid in US dollars. 

If a insist, we will ship UPS with the following shipping and handling fees: $5.00 for the first book, 
and $.75 for each additional book up to a maximum shipping and handling fee of $9.50. 

Foreign orders except from Canada or Mexico please add the following postage and handling charges: 
$3.30 (Surface Mail) for the first book and $1.00 for each additional book. Foreign Airmail will be at 
corps $1.00 per order (cost is approximately $12.00 for the first book and $6.00 for each additional 
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